SPRAAK
|
Script to create and train acoustic models. More...
Namespaces | |
spr_train | |
Functions | |
def | spr_train.main |
Main function. More... | |
Script to create and train acoustic models.
spr_train.py <training-description-file>
config
and trainer
(see spr_pylib::train::master::create_new_trainer() ). The config
object will hold all configuration data while the trainer
object will be used to perform the different training steps. Setting a configuration value is done by assigning values to the relevant attribute of the config
object. The first call to a method from the trainer
object will trigger a sanity check of the configuration values. If during this check some values are found missing the script will abort with an error stating what value is missing. ssh
. For this you need to make sure that the account that will run the training scripts can login on the remote nodes without user interaction. For more info see the man pages on ssh
and public-private key authentication and the ssh-add
command. It is also required that all the remote nodes share the same filesystem. All data communication happens via the shared filesystem. The following steps are taken to lauch a remote process:ssh
login to the remote node. Make sure everything is configured so that no user interaction is required during the logon.remote_env_script
)nice -n 19
<exp-name>_recovery.log
. This file is located in the <metadir> directory. It is important to remove this file if one wants to redo a complete run from start to finish. This is for example needed after changes to the config file or any of the starting input files.mx_sz
by setting key oc*
to sz.<mx_sz>
.dovit
to 1.tst
to the name of the test-script. The test-script is then called with the following arguments: <exp> <model_name> "<ph_ci> <ph_cd>" "<preproc> <ppe>" <topt[1]> ... <topt[8]>The extra arguments
<topt[1...8]>
can be defined as <key>=<values>
pairs as wellconfig.exp = "RM" config.log = "%s.LOG" % config.exp config.ldir = "ldir" config.sdir = "sdir" config.mname = "mod{exp}.{ITER}/{name}{iter}" config.preproc = "PREPROC" config.mida_np = 17 config.mida_opt = "-gc MMI -gf LIN -VC -1 -niter 10 -eps 5e-4 -greedy 'Gg' -Fopt '/'" config.ph_ci = "phon.ci" config.ph_cd = "phon.cd" config.ph_arcd = "phon.arcd" config.ph_spec = "\#" config.questions = "questions" config.seg = "RM.seg" config.cor = "RM.cor" config.dic = "RM_man.dic" config.unwind = "'add_in_front=\#;add_between=[/\#];add_at_rear=\#;sent_context=\#\#;'" config.obsdir = "/data/jroelens/RM_SAM_DATA" config.suffix = "sam" config.pass_split_cnt = 1 config.local_paths.append("/volume1/") config.remote_env_script = ". ../../SETENV" \#config.host_info = [(None, 2), ("spchcl06", 2)] config.host_info = [(None, 2)] config.metadir = "%s_PROGRESS" % config.exp trainer.tied(niter = 1, tng = 1024) trainer.cdtree(niter = 1, dtmc = 512, dtlt = 1024) trainer.fvg(niter = 1)