Load separate files #
data.Field
parameters is here.
When calling build_vocab
, torchtext will add <unk>
in vocabulary list. Set unk_token=None
if you want to remove it. If sequential=True
(default), it will add <pad>
in vocab. <unk>
and <pad>
will add at the beginning of vocabulary list by default.