SPRAAK
|
Impute a spectrum given a decomposed spectrum. More...
Functions | |
int | spr_vqmask_process (SprSspInfo *Info, const void *frame_in, void *frame_out) |
void | spr_vqmask_free (SprSspInfo *Info) |
int | spr_vqmask_setup (SprSspInfo *Info, const char **descript, void *aux_info) |
Impute a spectrum given a decomposed spectrum.
Impute a MEL-spectrum given
Right now all variances (sigma) are pooled and set to
Also, the speech Gaussians are assumed to have uniform prior probabilities.
The imputation estimates the underlying noise for each spectral component and clean speech codebook entry
independently, given the decomposed noisy speech spectrum
,
and the estimate of the noise mean
using the following equation:
The imputed noise and speech spectrum are and
respectively.
[vqmask] | |
---|---|
codebook <filename> | |
Codebook filename (a .mvg file). | |
Nsil <Nsil>(0) [SpchExclSil/SpchInclSil](SpchInclSil) | |
Use the first <Nsil> entries for silence frames and the remaining entries for speech frames. Use either all the codebook entries (SpchInclSil; advised when the VAD has 'hangover', i.e. bridges short silence gaps) or only the non-silence ones (SpchExclSil; may work better if the VAD is excellent) for non-silence frames. | |
gamma <val>(1.0) | |
Weight of the noise mean Gaussian (1/sqrt(var)). | |
lambda <val>(1.0) | |
Weight of the noise linking Gaussian (1/sqrt(var)). | |
mu_noise [val](0.0) ... | |
The means mu^(v-r)_n for the Gaussian linking n^(v) with n^(r). |