Speech and Natural Language Processing
Contents
Speech and Natural Language Processing
Finite State Toolkits and Regular Expressions
Finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests/
Fare is a finite state and regular expression libary for the .NET framework written in C#. am is a JavaScript library for working with automata and formal grammars for regular and context-free languages
A toolkit for training finite-state models
Java port of the C++ OpenFst library; originally forked from the CMU Sphinx project
High level finite state programming language built on top of OpenFst.
Spectral and other training algorithms for WFSAs.
n Shortest Path for PDT
"Noam is a JavaScript library for working with automata and formal grammars for regular and context-free languages". Also has pretty cool examples using viz.js <https://github.com/mdaines/viz.js/>_
Nice set of utilities for OpenFst includes implementation of Categorial semirings.openfst-utils.
Toolkit for manipulating word lattice built on top of OpenFst. Includes support for reading and writing HTK compatible lattices.
Python interface to OpenFst
Language Modelling Toolkits
Speech Recognition
Barista is an open-source framework for concurrent speech processing.
Neural network phone duration model on top of the Kaldi speech recognition framework, (Interspeech paper) <https://phon.ioc.ee/dokuwiki/lib/exe/fetch.php?media=peopleicassp2014-durmodel.pdf>_
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
Text-to-Speech
Machine Translation
Deep Learning
Comparison of different convolution network implementations.
are some Python wrappers.
Neural network based machine translation toolkit.
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc.
Neural network toolkit for machine translation described in the paper here <https://ufal.mff.cuni.cz/pbml/102/art-baltescu-blunsom-hoang.pdf>_
Tool to convert RNNLMs to WFSTs
Natural Language Processing
"BLLIP Parser is a statistical natural language parser including a generative constituent parser (first-stage) and discriminative maximum entropy reranker (second-stage)."
Set expander for any language described in this paper <http://www.cs.cmu.edu/~wcohen/postscript/icdm-2007.pdf>_