Commit Graph

8 Commits

Author SHA1 Message Date
qianlong 4f16f036be Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
add CaseFold, NormalizeUTF8

add RegexReplace

add RegexTokenizer

add BasicTokenizer

add WordpieceTokenizer

add BertTokenizer
2020-06-17 15:47:04 +08:00
chenjianping af8108c9e1 support host reduce scatter and allgather 2020-06-04 17:20:47 +08:00
xulei2020 163b6b7ea7 add jieba c++ code 2020-05-20 15:55:12 +08:00
xiefangqi c937bad53f minddata support voc 2020-05-15 13:24:03 +08:00
chenjianping 1286767d0e support building on windows 2020-04-15 09:36:48 +00:00
zhoufeng f385f2a48b default build command
"-z" (minddata) and "-M on" (gpu) by default in build.sh
2020-04-09 10:50:22 +08:00
Wei Luning 73ba399364 remove ge depend in cpu 2020-04-02 19:38:42 +08:00
zhunaipan 930a1fb0a8 initial version
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
2020-03-27 22:54:54 +08:00