Speech and Natural Language Processing
Speech and Natural Language Processing
"Noam is a JavaScript library for working with automata and formal grammars for regular and context-free languages". Also has pretty cool examples using viz.js <https://github.com/mdaines/viz.js/>_
Nice set of utilities for OpenFst includes implementation of Categorial semirings.openfst-utils.
Toolkit for manipulating word lattice built on top of OpenFst. Includes support for reading and writing HTK compatible lattices.
Neural network phone duration model on top of the Kaldi speech recognition framework, (Interspeech paper) <https://phon.ioc.ee/dokuwiki/lib/exe/fetch.php?media=peopleicassp2014-durmodel.pdf>_
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends…
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland.
CMUdict (the Carnegie Mellon Pronouncing Dictionary) is a free pronouncing dictionary of English.
"Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms"
Spearmint is a package to perform Bayesian optimization according to the algorithms outlined in the paper: Practical Bayesian Optimization of Machine Learning Algorithms Jasper Snoek, Hugo Larochelle and Ryan P. Adams Advances in Neural Information Processing Systems, 2012
and here <https://github.com/hannes-brt/cudnn-python-wrappers>_ are some Python wrappers.
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc.
Neural network toolkit for machine translation described in the paper here <https://ufal.mff.cuni.cz/pbml/102/art-baltescu-blunsom-hoang.pdf>_
"BLLIP Parser is a statistical natural language parser including a generative constituent parser (first-stage) and discriminative maximum entropy reranker (second-stage)."
Set expander for any language described in this paper <http://www.cs.cmu.edu/~wcohen/postscript/icdm-2007.pdf>_
"CloudASR is a software platform and a public ASR webservice."