SPRAAK
 All Data Structures Namespaces Files Functions Variables Typedefs Enumerations Enumerator Groups Pages
Data Fields
SprCwrSearchStd Struct Reference
+ Collaboration diagram for SprCwrSearchStd:

Data Fields

unsigned int load_flags
 which parts (phone_set, lexicon, hmm, ...) are loaded More...
 
unsigned int modif_must_do
 combine reset and free operations during modifications More...
 
unsigned int modif_have_done
 combine reset and free operations during modifications More...
 
char store_seq
 do (not) store the node/phoneme sequences during the search More...
 
char recog_mode
 recognition mode More...
 
char lm_fwd_mode
 type of LM-forwarding that is used More...
 
char sent_end_flags
 for sentence/word end in CWR_best_sentence() More...
 
char wexpand_flags
 word expand flags More...
 
char sdet_avail
 the preprocessing gives output on sentence boundaries More...
 
char sdet_cont
 
char sdet_new_res
 
char gauss_sel
 gaussian selection status More...
 
int sdet_count
 
int sdet_out
 
SprCwrWordSet sentence_begin
 word index/indices of <s> More...
 
SprCwrWordSet sentence_end
 word index/indices of </s> More...
 
float lm_A
 
float lm_C
 upgrade the language model cost: cost = LM_cost*A+C More...
 
unsigned int lm_cache_size
 target size of the LM-cache More...
 
unsigned int lmc_hash_size
 target size of the LM-context hashing More...
 
float * slmsp
 
float * slmsp0
 the unshifted unigram prob array More...
 
int * slmsp_wset_mx
 word with best unigram probability in a word set More...
 
int * fwd_comb_inst
 definition of word sets when using full LM-forwarding More...
 
int ** fwd_icomb_inst
 instructions for a specific word set More...
 
int fwd_wset_len
 number of new word sets introduced for full LM-forwarding More...
 
int Ntokens_max
 maximal beam width (absolute limit on the nr. of open token) More...
 
int Ntokens_min
 minimal beam width (absolute limit on the nr. of open token) More...
 
int Ntokens_wstart
 
int Ntokens_wend
 
int Ntokens_wint
 
int Nctokens_wend
 
int Nctokens_wint
 
unsigned int beam_corr_up
 
unsigned int beam_corr_down
 
unsigned long long Ntokens_av_ax
 maximum beam size after word end expansion More...
 
unsigned long long Ntokens_av_bp
 maximum beam size before pruning More...
 
unsigned long long Ntokens_av_ap
 maximum beam size after pruning More...
 
double av_threshold
 average pruning threshold (adaptive) More...
 
unsigned int frame_cnt
 
int iframe
 number of the current frame More...
 
int iframe0
 the number of the very first frame More...
 
int sub_iframe
 
int mx_empty_word_cnt
 
double score_offset
 scaling offset for all scores More...
 
float best_WE_score
 score of the best word final token More...
 
float estim_best_score
 
int estim_best_state
 the state to be used for updating the estim_best_score field More...
 
float best_score
 score of the best token (so far) More...
 
float limit_score
 best score minus the pruning threshold More...
 
float adapt_threshold
 adaptive version of the pruning threshold, predicted version More...
 
float adapt_threshold0
 
float adapt_thr_prev
 
float adapt_log_bw
 log of the previous beam width More...
 
float adapt_log_bw_max
 log of the maximum beam width More...
 
float adapt_thr_bw_stats [4]
 
float threshold_max
 maximum pruning threshold (specified by the user) More...
 
float threshold_frac
 maximum increase in threshold to asure a minimum beam_width More...
 
float word_threshold_frac
 
float asf
 acoustic score scale factor More...
 
float H_lim0
 
float H_lim1
 
float H_factor
 
float H_av_flt
 
float H_av
 
float N_av_tgt
 maximal beam width over a long interval More...
 
float N_av_tgt_flt
 maximal beam width over a long interval (go there slowly) More...
 
float N_av_flt
 
float N_av
 average beam width over a long interval More...
 
float N_av_ss
 react differently on sentence starts and silences More...
 
SprCwrSearchAdaptSpkrspkr_settings
 current speaker specific settings More...
 
SprCwrSearchAdaptSpkrspkr_list
 list that contains all speaker specific settings (LRU-cache) More...
 
int * score_hist
 
int score_hist_size
 the size (nr. of tabs) of the score histogram More...
 
SprCwrSearchResult topn
 
SprCwrSearchResult best
 the best scoring sentence, word sequence or word More...
 
double recog_score
 score assigned to the recognition result More...
 
SprCwrBeamHandlerbeam_handler
 
SprCwrSearchUpdateupdate_block
 Request an update of the decoder parameters. More...
 

Detailed Description

things needed for this implementation (the standard implementation)

Field Documentation

unsigned int SprCwrSearchStd::load_flags

which parts (phone_set, lexicon, hmm, ...) are loaded

unsigned int SprCwrSearchStd::modif_must_do

combine reset and free operations during modifications

unsigned int SprCwrSearchStd::modif_have_done

combine reset and free operations during modifications

char SprCwrSearchStd::store_seq

do (not) store the node/phoneme sequences during the search

char SprCwrSearchStd::recog_mode

recognition mode

char SprCwrSearchStd::lm_fwd_mode

type of LM-forwarding that is used

char SprCwrSearchStd::sent_end_flags

for sentence/word end in CWR_best_sentence()

char SprCwrSearchStd::wexpand_flags

word expand flags

char SprCwrSearchStd::sdet_avail

the preprocessing gives output on sentence boundaries

char SprCwrSearchStd::sdet_cont

the recognition stopped at a sentence boundary, further data is available (continuous loop)

char SprCwrSearchStd::sdet_new_res

the recognition has found some new non silence frames after a sentence boundary, and has thus produced some new results

char SprCwrSearchStd::gauss_sel

gaussian selection status

bit 0: do gaussian selection (1/0); bit 1: gaussian selection must be initialized

int SprCwrSearchStd::sdet_count

skip pause between sentences with a periodical return to the main program

int SprCwrSearchStd::sdet_out

the preprocessing may mark whether the frame is spch or not (sentence detection)

SprCwrWordSet SprCwrSearchStd::sentence_begin

word index/indices of <s>

SprCwrWordSet SprCwrSearchStd::sentence_end

word index/indices of </s>

float SprCwrSearchStd::lm_A
float SprCwrSearchStd::lm_C

upgrade the language model cost: cost = LM_cost*A+C

unsigned int SprCwrSearchStd::lm_cache_size

target size of the LM-cache

unsigned int SprCwrSearchStd::lmc_hash_size

target size of the LM-context hashing

float* SprCwrSearchStd::slmsp

max. unigram probabilities (unscaled, shifted to account for negative indices) trivial way to forward the LM influence

float* SprCwrSearchStd::slmsp0

the unshifted unigram prob array

int* SprCwrSearchStd::slmsp_wset_mx

word with best unigram probability in a word set

int* SprCwrSearchStd::fwd_comb_inst

definition of word sets when using full LM-forwarding

int** SprCwrSearchStd::fwd_icomb_inst

instructions for a specific word set

int SprCwrSearchStd::fwd_wset_len

number of new word sets introduced for full LM-forwarding

int SprCwrSearchStd::Ntokens_max

maximal beam width (absolute limit on the nr. of open token)

int SprCwrSearchStd::Ntokens_min

minimal beam width (absolute limit on the nr. of open token)

int SprCwrSearchStd::Ntokens_wstart

number of new tokens created in word end splitting (from group to individual words) and cross word transitions (word startup)

int SprCwrSearchStd::Ntokens_wend

number of tokens propagated to a word end position (after observe+jump & pruning)

int SprCwrSearchStd::Ntokens_wint

number of tokens propagated to a word internal position (after observe+jump & pruning)

int SprCwrSearchStd::Nctokens_wend

number of ctokens propagated to a word end position (after observe+jump & pruning)

int SprCwrSearchStd::Nctokens_wint

number of ctokens propagated to a word internal position (after observe+jump & pruning)

unsigned int SprCwrSearchStd::beam_corr_up

nr. of correction to the threshold in order to get a larger beam

unsigned int SprCwrSearchStd::beam_corr_down

nr. of correction to the threshold in order to get a smaller beam

unsigned long long SprCwrSearchStd::Ntokens_av_ax

maximum beam size after word end expansion

unsigned long long SprCwrSearchStd::Ntokens_av_bp

maximum beam size before pruning

unsigned long long SprCwrSearchStd::Ntokens_av_ap

maximum beam size after pruning

double SprCwrSearchStd::av_threshold

average pruning threshold (adaptive)

unsigned int SprCwrSearchStd::frame_cnt

nr. of frames used to make up the token scores (also used to calculate some statistics)

int SprCwrSearchStd::iframe

number of the current frame

int SprCwrSearchStd::iframe0

the number of the very first frame

int SprCwrSearchStd::sub_iframe

sub divide one frame as to have an ordining for the mepty words

int SprCwrSearchStd::mx_empty_word_cnt

maximum number of empty words that can be insterted in a recognized string

double SprCwrSearchStd::score_offset

scaling offset for all scores

float SprCwrSearchStd::best_WE_score

score of the best word final token

float SprCwrSearchStd::estim_best_score

score of the best token refering to a non word final lexicon node, used to estimate the best score in the next frame

int SprCwrSearchStd::estim_best_state

the state to be used for updating the estim_best_score field

float SprCwrSearchStd::best_score

score of the best token (so far)

float SprCwrSearchStd::limit_score

best score minus the pruning threshold

float SprCwrSearchStd::adapt_threshold

adaptive version of the pruning threshold, predicted version

float SprCwrSearchStd::adapt_threshold0

adaptive version of the pruning threshold, predicted version without acoustic entropy correction

float SprCwrSearchStd::adapt_thr_prev

adaptive version of the pruning threshold applied for the previous frame

float SprCwrSearchStd::adapt_log_bw

log of the previous beam width

float SprCwrSearchStd::adapt_log_bw_max

log of the maximum beam width

float SprCwrSearchStd::adapt_thr_bw_stats[4]

learn the relation between y=delta(threshold) and x=f(delta(log(bw)),log(bw),H_av); adapt_thr_bw_stats stores [xy=flt1_w(x*y),xx=flt1_w(x*x),flt2(o=y-xy/xx),o] statistics with flt1_w() a 1st order weighted filter, flt2() a 1st order filter; see beam2threshold_update() for details about f() and the filters flt1() and flt2()

float SprCwrSearchStd::threshold_max

maximum pruning threshold (specified by the user)

float SprCwrSearchStd::threshold_frac

maximum increase in threshold to asure a minimum beam_width

float SprCwrSearchStd::word_threshold_frac

fraction of the threshold used when new words should be started

float SprCwrSearchStd::asf

acoustic score scale factor

float SprCwrSearchStd::H_lim0

acoustic entropy based pruning: entropies below this value are OK

float SprCwrSearchStd::H_lim1

acoustic entropy based pruning: entropies above this value get maximal correction of the threshold

float SprCwrSearchStd::H_factor

acoustic entropy based pruning: decrease pruning theshold with max(min(H_av_flt,H_lim1)-H_lim0,0)*H_factor

float SprCwrSearchStd::H_av_flt

acoustic entropy based pruning: time constant of the first order averaging filter

float SprCwrSearchStd::H_av

acoustic entropy based pruning: the average entropy over the last H_av frames (first order filter)

float SprCwrSearchStd::N_av_tgt

maximal beam width over a long interval

float SprCwrSearchStd::N_av_tgt_flt

maximal beam width over a long interval (go there slowly)

float SprCwrSearchStd::N_av_flt

average beam width over a long interval: time constant of the first order averaging filter

float SprCwrSearchStd::N_av

average beam width over a long interval

float SprCwrSearchStd::N_av_ss

react differently on sentence starts and silences

SprCwrSearchAdaptSpkr* SprCwrSearchStd::spkr_settings

current speaker specific settings

SprCwrSearchAdaptSpkr* SprCwrSearchStd::spkr_list

list that contains all speaker specific settings (LRU-cache)

int* SprCwrSearchStd::score_hist

histogram, used in an algorithme to quickly find the Ntokens_max best hypotheses

int SprCwrSearchStd::score_hist_size

the size (nr. of tabs) of the score histogram

SprCwrSearchResult SprCwrSearchStd::topn

the best (sorted array) (partial) recognition results in CWR and IWR mode

SprCwrSearchResult SprCwrSearchStd::best

the best scoring sentence, word sequence or word

double SprCwrSearchStd::recog_score

score assigned to the recognition result

SprCwrBeamHandler* SprCwrSearchStd::beam_handler

trap-function to extract information on the content of the search beam at every frame

SprCwrSearchUpdate* SprCwrSearchStd::update_block

Request an update of the decoder parameters.