NodeCMUSphinx4 SR

Predictor

Speech recognizer based on the CMUSphinx project. Additional language models can be downloaded from Sourceforge and Voxforge. You can also learn your own dictionary and language model and reuse the standard English acoustic model. To automatically create a dictionary and language model file from smaller text files simply go to CMU lmtool page. For more details see the CMU Building Language Model tutorial. Once created you can use your dictionary and language model with an existing acoustic model e.g. en-us.

Notice: If you get bad results check that the sampling rate of your audio files match the one used to train the language model (see CMUSphinx FAQ). The included models where trained with a sampling rate of 16kHz.

Input Ports

  1. Port Type: Data
    Table with audio column

Output Ports

  1. Port Type: Data
    Audio table with recognition result