SPRAAK
 All Data Structures Namespaces Files Functions Variables Typedefs Enumerations Enumerator Groups Pages
Corpus File
Contents

A corpus file contains information on a database. It gives the transcription of the data in a file or a part of a file. The data itself may be sampled data, Corpus files are typically stored with extension '.cor'.

A corpus file makes abstraction of a root directory and file name extensions. Hence, the same corpus file is used to deal with the sampled data as well as all data derived from it, such as feature data, labels, ..

Keys

The important header keys are:

Data

The first four fields in a corpus entry have a predefined meaning. Additional fields are optional and can be interpreted on a program specific basis Hence a corpus entry takes the form:

FILENAME TRANSCRIPTION  F1 F2  [OPT_DATA]
Example
.spr
DATA    CORPUS
DIM1    5
#
1       one     0 -1 C
347     three_four_seven 0 -1 F
tst     This    0  24 M
-       is      25 39 M
-       a_test  40 70 M
Remarks & Limitations: