a processor for text data where the datalines are pointers to text files, does normalization
More...
|
|
| normalizer |
| |
|
| alphabet |
| |
|
| max_length |
| |
|
| sequence_length_histogram |
| |
|
| nonesymbol |
| |
|
| conf |
| |
a processor for text data where the datalines are pointers to text files, does normalization
§ __init__()
| def nabu.processing.processors.textfile_processor.TextFileProcessor.__init__ |
( |
|
self, |
|
|
|
conf |
|
) |
| |
TextProcessor constructor.
- Parameters
-
| conf | processor configuration as a configparser |
§ __call__()
| def nabu.processing.processors.textfile_processor.TextFileProcessor.__call__ |
( |
|
self, |
|
|
|
dataline |
|
) |
| |
process the data in dataline
- Parameters
-
- Returns
- The normalized text as a string
§ write_metadata()
| def nabu.processing.processors.textfile_processor.TextFileProcessor.write_metadata |
( |
|
self, |
|
|
|
datadir |
|
) |
| |
write the processor metadata to disk
- Parameters
-
| dir | the directory where the metadata should be written |
The documentation for this class was generated from the following file: