Skip to content

Latest commit

 

History

History
52 lines (38 loc) · 2 KB

README.md

File metadata and controls

52 lines (38 loc) · 2 KB

Syntaxnet for Russian

Google's SyntaxNet Parser and POS tagger with a model for Russian language.

Usage


1. Single parse using shell:

echo "мама мыла раму" | docker run --rm -i inemo/syntaxnet_rus
...
Input: Name this boat
Parse (CONLL format):
1       мама    _       NOUN    _       Animacy=Anim|Case=Nom|Gender=Fem|Number=Sing|fPOS=NOUN++        2       nsubj   _       _
2       мыла    _       VERB    _       Aspect=Imp|Gender=Fem|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Act|fPOS=VERB++        0  ROOT     _       _
3       раму    _       NOUN    _       Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|fPOS=NOUN++        2       dobj    _       _

2. Standalone SyntaxNet server that does not recreate models (stays alive) (unstable):

docker run --shm-size=1024m -i --rm -p 8111:9999 inemo/syntaxnet_rus python /root/models/syntaxnet/bazel-bin/syntaxnet/parser_eval.runfiles/__main__/syntaxnet/api/syntaxnet_rus_api.py --host=0.0.0.0 --port=9999

Note that, although the current container installs model for Russian, the implemented server can be used for any language (any model trained in SyntaxNet).

2.1 You also can use the server in conjunction with SyntaxNet python wrapper.

2.2 You can use telnet to talk with parser (be aware about escape problems of unicode in telnet, e.g., 'маму' will not work by default via telnet):

telnet localhost 8111
мама мыла
1       мама    _       NOUN    _       Animacy=Anim|Case=Nom|Gender=Fem|Number=Sing|fPOS=NOUN++        2       nsubj   _       _
2       мыла    _       VERB    _       Aspect=Imp|Gender=Fem|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Act|fPOS=VERB++        0  ROOT     _       _

Updating


docker login
docker build -t inemo/syntaxnet_rus --no-cache . && docker push inemo/syntaxnet_rus