|
SPRAAK
|
Impute a spectrum given a decomposed spectrum. More...
Functions | |
| int | spr_vqmask_process (SprSspInfo *Info, const void *frame_in, void *frame_out) |
| void | spr_vqmask_free (SprSspInfo *Info) |
| int | spr_vqmask_setup (SprSspInfo *Info, const char **descript, void *aux_info) |
Impute a spectrum given a decomposed spectrum.
Impute a MEL-spectrum given
decomposed in a
and
component (
=voiced,
=residu in harmonic decomposition; other decompositions should work as well)
and variance
for the
component of the noise
that links
to
(diagonal) Gaussian distributions
modelling the decomposed clean speech Right now all variances (sigma) are pooled and set to
for
for the
component of the noise Also, the speech Gaussians are assumed to have uniform prior probabilities.
The imputation estimates the underlying noise
for each spectral component and clean speech codebook entry
independently, given the decomposed noisy speech spectrum
,
and the estimate of the noise mean
using the following equation:
The imputed noise and speech spectrum are
and
respectively.
of the best fitting codebook
of the best fitting codebook | [vqmask] | |
|---|---|
codebook <filename> | |
Codebook filename (a .mvg file). | |
Nsil <Nsil>(0) [SpchExclSil/SpchInclSil](SpchInclSil) | |
Use the first <Nsil> entries for silence frames and the remaining entries for speech frames. Use either all the codebook entries (SpchInclSil; advised when the VAD has 'hangover', i.e. bridges short silence gaps) or only the non-silence ones (SpchExclSil; may work better if the VAD is excellent) for non-silence frames. | |
gamma <val>(1.0) | |
Weight of the noise mean Gaussian (1/sqrt(var)). | |
lambda <val>(1.0) | |
Weight of the noise linking Gaussian (1/sqrt(var)). | |
mu_noise [val](0.0) ... | |
The means mu^(v-r)_n for the Gaussian linking n^(v) with n^(r). | |
1.8.6