tools/build_vocab.lua
build_vocab.lua
options:
-h [<boolean>]
(default:false
)
This help.-md [<boolean>]
(default:false
)
Dump help in Markdown format.-config <string>
(default:''
)
Load options from this file.-save_config <string>
(default:''
)
Save options to this file.
Vocabulary options¶
-data <string>
(required)
Data file.-save_vocab <string>
(required)
Vocabulary dictionary prefix.-vocab_size <table>
(default:50000
)
List of source vocabularies size:word[ feat1[ feat2[ ...] ] ]
. If = 0, vocabularies are not pruned.-words_min_frequency <table>
(default:0
)
List of source words min frequency:word[ feat1[ feat2[ ...] ] ]
. If = 0, vocabularies are pruned by size.-keep_frequency [<boolean>]
(default:false
)
Keep frequency of words in dictionary.-idx_files [<boolean>]
(default:false
)
If set, each line of the data file starts with a first field which is the index of the sentence.
Logger options¶
-log_file <string>
(default:''
)
Output logs to a file under this path instead of stdout - if file name ending with json, output structure json.-disable_logs [<boolean>]
(default:false
)
If set, output nothing.-log_level <string>
(accepted:DEBUG
,INFO
,WARNING
,ERROR
,NONE
; default:INFO
)
Output logs at this level and above.