# SpeechDatasets.jl A Julia package to download and prepare speech corpus. ## Installation Make sure to add the [FAST registry](https://gitlab.lisn.upsaclay.fr/fast/registry) to your julia installation. Then, install the package as usual: ``` pkg> add SpeechDatasets ``` ## Example ``` julia> using SpeechDatasets julia> dataset = MINILIBRISPEECH("outputdir", :train) # :dev | :test ... julia> dataset = TIMIT("/path/to/timit/dir", "outputdir", :train) # :dev | :test ... julia> dataset = INADIACHRONY("/path/to/ina_wav/dir", "outputdir", "/path/to/ina_csv/dir") # ina_csv dir optional ... julia> dataset = AVID("/path/to/avid/dir", "outputdir") ... julia> for ((signal, fs), supervision) in dataset # do something end # Lexicons julia> CMUDICT("outputfile") ... julia> TIMITDICT("/path/to/timit/dir") ... ``` ## License This software is provided under the CeCILL 2.1 license (see the [`/LICENSE`](/LICENSE))