|
SPRAAK
|
Collaboration diagram for SprCwrSearchStd:Data Fields | |
| unsigned int | load_flags |
| which parts (phone_set, lexicon, hmm, ...) are loaded More... | |
| unsigned int | modif_must_do |
| combine reset and free operations during modifications More... | |
| unsigned int | modif_have_done |
| combine reset and free operations during modifications More... | |
| char | store_seq |
| do (not) store the node/phoneme sequences during the search More... | |
| char | recog_mode |
| recognition mode More... | |
| char | lm_fwd_mode |
| type of LM-forwarding that is used More... | |
| char | sent_end_flags |
| for sentence/word end in CWR_best_sentence() More... | |
| char | wexpand_flags |
| word expand flags More... | |
| char | sdet_avail |
| the preprocessing gives output on sentence boundaries More... | |
| char | sdet_cont |
| char | sdet_new_res |
| char | gauss_sel |
| gaussian selection status More... | |
| int | sdet_count |
| int | sdet_out |
| SprCwrWordSet | sentence_begin |
| word index/indices of <s> More... | |
| SprCwrWordSet | sentence_end |
| word index/indices of </s> More... | |
| float | lm_A |
| float | lm_C |
| upgrade the language model cost: cost = LM_cost*A+C More... | |
| unsigned int | lm_cache_size |
| target size of the LM-cache More... | |
| unsigned int | lmc_hash_size |
| target size of the LM-context hashing More... | |
| float * | slmsp |
| float * | slmsp0 |
| the unshifted unigram prob array More... | |
| int * | slmsp_wset_mx |
| word with best unigram probability in a word set More... | |
| int * | fwd_comb_inst |
| definition of word sets when using full LM-forwarding More... | |
| int ** | fwd_icomb_inst |
| instructions for a specific word set More... | |
| int | fwd_wset_len |
| number of new word sets introduced for full LM-forwarding More... | |
| int | Ntokens_max |
| maximal beam width (absolute limit on the nr. of open token) More... | |
| int | Ntokens_min |
| minimal beam width (absolute limit on the nr. of open token) More... | |
| int | Ntokens_wstart |
| int | Ntokens_wend |
| int | Ntokens_wint |
| int | Nctokens_wend |
| int | Nctokens_wint |
| unsigned int | beam_corr_up |
| unsigned int | beam_corr_down |
| unsigned long long | Ntokens_av_ax |
| maximum beam size after word end expansion More... | |
| unsigned long long | Ntokens_av_bp |
| maximum beam size before pruning More... | |
| unsigned long long | Ntokens_av_ap |
| maximum beam size after pruning More... | |
| double | av_threshold |
| average pruning threshold (adaptive) More... | |
| unsigned int | frame_cnt |
| int | iframe |
| number of the current frame More... | |
| int | iframe0 |
| the number of the very first frame More... | |
| int | sub_iframe |
| int | mx_empty_word_cnt |
| double | score_offset |
| scaling offset for all scores More... | |
| float | best_WE_score |
| score of the best word final token More... | |
| float | estim_best_score |
| int | estim_best_state |
| the state to be used for updating the estim_best_score field More... | |
| float | best_score |
| score of the best token (so far) More... | |
| float | limit_score |
| best score minus the pruning threshold More... | |
| float | adapt_threshold |
| adaptive version of the pruning threshold, predicted version More... | |
| float | adapt_threshold0 |
| float | adapt_thr_prev |
| float | adapt_log_bw |
| log of the previous beam width More... | |
| float | adapt_log_bw_max |
| log of the maximum beam width More... | |
| float | adapt_thr_bw_stats [4] |
| float | threshold_max |
| maximum pruning threshold (specified by the user) More... | |
| float | threshold_frac |
| maximum increase in threshold to asure a minimum beam_width More... | |
| float | word_threshold_frac |
| float | asf |
| acoustic score scale factor More... | |
| float | H_lim0 |
| float | H_lim1 |
| float | H_factor |
| float | H_av_flt |
| float | H_av |
| float | N_av_tgt |
| maximal beam width over a long interval More... | |
| float | N_av_tgt_flt |
| maximal beam width over a long interval (go there slowly) More... | |
| float | N_av_flt |
| float | N_av |
| average beam width over a long interval More... | |
| float | N_av_ss |
| react differently on sentence starts and silences More... | |
| SprCwrSearchAdaptSpkr * | spkr_settings |
| current speaker specific settings More... | |
| SprCwrSearchAdaptSpkr * | spkr_list |
| list that contains all speaker specific settings (LRU-cache) More... | |
| int * | score_hist |
| int | score_hist_size |
| the size (nr. of tabs) of the score histogram More... | |
| SprCwrSearchResult | topn |
| SprCwrSearchResult | best |
| the best scoring sentence, word sequence or word More... | |
| double | recog_score |
| score assigned to the recognition result More... | |
| SprCwrBeamHandler * | beam_handler |
| SprCwrSearchUpdate * | update_block |
| Request an update of the decoder parameters. More... | |
things needed for this implementation (the standard implementation)
| unsigned int SprCwrSearchStd::load_flags |
which parts (phone_set, lexicon, hmm, ...) are loaded
| unsigned int SprCwrSearchStd::modif_must_do |
combine reset and free operations during modifications
| unsigned int SprCwrSearchStd::modif_have_done |
combine reset and free operations during modifications
| char SprCwrSearchStd::store_seq |
do (not) store the node/phoneme sequences during the search
| char SprCwrSearchStd::recog_mode |
recognition mode
| char SprCwrSearchStd::lm_fwd_mode |
type of LM-forwarding that is used
| char SprCwrSearchStd::sent_end_flags |
for sentence/word end in CWR_best_sentence()
| char SprCwrSearchStd::wexpand_flags |
word expand flags
| char SprCwrSearchStd::sdet_avail |
the preprocessing gives output on sentence boundaries
| char SprCwrSearchStd::sdet_cont |
the recognition stopped at a sentence boundary, further data is available (continuous loop)
| char SprCwrSearchStd::sdet_new_res |
the recognition has found some new non silence frames after a sentence boundary, and has thus produced some new results
| char SprCwrSearchStd::gauss_sel |
gaussian selection status
bit 0: do gaussian selection (1/0); bit 1: gaussian selection must be initialized
| int SprCwrSearchStd::sdet_count |
skip pause between sentences with a periodical return to the main program
| int SprCwrSearchStd::sdet_out |
the preprocessing may mark whether the frame is spch or not (sentence detection)
| SprCwrWordSet SprCwrSearchStd::sentence_begin |
word index/indices of <s>
| SprCwrWordSet SprCwrSearchStd::sentence_end |
word index/indices of </s>
| float SprCwrSearchStd::lm_A |
| float SprCwrSearchStd::lm_C |
upgrade the language model cost: cost = LM_cost*A+C
| unsigned int SprCwrSearchStd::lm_cache_size |
target size of the LM-cache
| unsigned int SprCwrSearchStd::lmc_hash_size |
target size of the LM-context hashing
| float* SprCwrSearchStd::slmsp |
max. unigram probabilities (unscaled, shifted to account for negative indices) trivial way to forward the LM influence
| float* SprCwrSearchStd::slmsp0 |
the unshifted unigram prob array
| int* SprCwrSearchStd::slmsp_wset_mx |
word with best unigram probability in a word set
| int* SprCwrSearchStd::fwd_comb_inst |
definition of word sets when using full LM-forwarding
| int** SprCwrSearchStd::fwd_icomb_inst |
instructions for a specific word set
| int SprCwrSearchStd::fwd_wset_len |
number of new word sets introduced for full LM-forwarding
| int SprCwrSearchStd::Ntokens_max |
maximal beam width (absolute limit on the nr. of open token)
| int SprCwrSearchStd::Ntokens_min |
minimal beam width (absolute limit on the nr. of open token)
| int SprCwrSearchStd::Ntokens_wstart |
number of new tokens created in word end splitting (from group to individual words) and cross word transitions (word startup)
| int SprCwrSearchStd::Ntokens_wend |
number of tokens propagated to a word end position (after observe+jump & pruning)
| int SprCwrSearchStd::Ntokens_wint |
number of tokens propagated to a word internal position (after observe+jump & pruning)
| int SprCwrSearchStd::Nctokens_wend |
number of ctokens propagated to a word end position (after observe+jump & pruning)
| int SprCwrSearchStd::Nctokens_wint |
number of ctokens propagated to a word internal position (after observe+jump & pruning)
| unsigned int SprCwrSearchStd::beam_corr_up |
nr. of correction to the threshold in order to get a larger beam
| unsigned int SprCwrSearchStd::beam_corr_down |
nr. of correction to the threshold in order to get a smaller beam
| unsigned long long SprCwrSearchStd::Ntokens_av_ax |
maximum beam size after word end expansion
| unsigned long long SprCwrSearchStd::Ntokens_av_bp |
maximum beam size before pruning
| unsigned long long SprCwrSearchStd::Ntokens_av_ap |
maximum beam size after pruning
| double SprCwrSearchStd::av_threshold |
average pruning threshold (adaptive)
| unsigned int SprCwrSearchStd::frame_cnt |
nr. of frames used to make up the token scores (also used to calculate some statistics)
| int SprCwrSearchStd::iframe |
number of the current frame
| int SprCwrSearchStd::iframe0 |
the number of the very first frame
| int SprCwrSearchStd::sub_iframe |
sub divide one frame as to have an ordining for the mepty words
| int SprCwrSearchStd::mx_empty_word_cnt |
maximum number of empty words that can be insterted in a recognized string
| double SprCwrSearchStd::score_offset |
scaling offset for all scores
| float SprCwrSearchStd::best_WE_score |
score of the best word final token
| float SprCwrSearchStd::estim_best_score |
score of the best token refering to a non word final lexicon node, used to estimate the best score in the next frame
| int SprCwrSearchStd::estim_best_state |
the state to be used for updating the estim_best_score field
| float SprCwrSearchStd::best_score |
score of the best token (so far)
| float SprCwrSearchStd::limit_score |
best score minus the pruning threshold
| float SprCwrSearchStd::adapt_threshold |
adaptive version of the pruning threshold, predicted version
| float SprCwrSearchStd::adapt_threshold0 |
adaptive version of the pruning threshold, predicted version without acoustic entropy correction
| float SprCwrSearchStd::adapt_thr_prev |
adaptive version of the pruning threshold applied for the previous frame
| float SprCwrSearchStd::adapt_log_bw |
log of the previous beam width
| float SprCwrSearchStd::adapt_log_bw_max |
log of the maximum beam width
| float SprCwrSearchStd::adapt_thr_bw_stats[4] |
learn the relation between y=delta(threshold) and x=f(delta(log(bw)),log(bw),H_av); adapt_thr_bw_stats stores [xy=flt1_w(x*y),xx=flt1_w(x*x),flt2(o=y-xy/xx),o] statistics with flt1_w() a 1st order weighted filter, flt2() a 1st order filter; see beam2threshold_update() for details about f() and the filters flt1() and flt2()
| float SprCwrSearchStd::threshold_max |
maximum pruning threshold (specified by the user)
| float SprCwrSearchStd::threshold_frac |
maximum increase in threshold to asure a minimum beam_width
| float SprCwrSearchStd::word_threshold_frac |
fraction of the threshold used when new words should be started
| float SprCwrSearchStd::asf |
acoustic score scale factor
| float SprCwrSearchStd::H_lim0 |
acoustic entropy based pruning: entropies below this value are OK
| float SprCwrSearchStd::H_lim1 |
acoustic entropy based pruning: entropies above this value get maximal correction of the threshold
| float SprCwrSearchStd::H_factor |
acoustic entropy based pruning: decrease pruning theshold with max(min(H_av_flt,H_lim1)-H_lim0,0)*H_factor
| float SprCwrSearchStd::H_av_flt |
acoustic entropy based pruning: time constant of the first order averaging filter
| float SprCwrSearchStd::H_av |
acoustic entropy based pruning: the average entropy over the last H_av frames (first order filter)
| float SprCwrSearchStd::N_av_tgt |
maximal beam width over a long interval
| float SprCwrSearchStd::N_av_tgt_flt |
maximal beam width over a long interval (go there slowly)
| float SprCwrSearchStd::N_av_flt |
average beam width over a long interval: time constant of the first order averaging filter
| float SprCwrSearchStd::N_av |
average beam width over a long interval
| float SprCwrSearchStd::N_av_ss |
react differently on sentence starts and silences
| SprCwrSearchAdaptSpkr* SprCwrSearchStd::spkr_settings |
current speaker specific settings
| SprCwrSearchAdaptSpkr* SprCwrSearchStd::spkr_list |
list that contains all speaker specific settings (LRU-cache)
| int* SprCwrSearchStd::score_hist |
histogram, used in an algorithme to quickly find the Ntokens_max best hypotheses
| int SprCwrSearchStd::score_hist_size |
the size (nr. of tabs) of the score histogram
| SprCwrSearchResult SprCwrSearchStd::topn |
the best (sorted array) (partial) recognition results in CWR and IWR mode
| SprCwrSearchResult SprCwrSearchStd::best |
the best scoring sentence, word sequence or word
| double SprCwrSearchStd::recog_score |
score assigned to the recognition result
| SprCwrBeamHandler* SprCwrSearchStd::beam_handler |
trap-function to extract information on the content of the search beam at every frame
| SprCwrSearchUpdate* SprCwrSearchStd::update_block |
Request an update of the decoder parameters.
1.8.6