-
Notifications
You must be signed in to change notification settings - Fork 4
IndraTranslation
inen is a transfer grammar from INDRA (in) to ERG (en) (see LogonTransfer).
inen is in tm folder, along with other transfer grammars, which can be downloaded from the Github.
For INDRA, we cloned the tm folder to grammar folder.
~/grammar$ git clone http://github.com/sanghoun/tm.git
1. update and compile the source grammar INDRA.
~/ind$ ace -g ace/config.tdl -G ind.dat
2. update and compile the target grammar ERG.
~logon/lingo/erg$ ace -g ace/config.tdl -G erg.dat
3. update and compile the transfer grammar inen.
~/grammar/tm/inen$ ace -g ace/config.tdl -G inen.dat
4. translating by INPUT | PARSING | TRANSFER | GENERATION
$ echo "anjing menggonggong" | ace -g YOUR_INDRA_DIRECTORY/ind.dat | ace -g YOUR_INEN_DIRECTORY/inen.dat | ace -g YOUR_ERG_DIRECTORY/erg.dat -e
1. Edit in.vpm under ~/grammar/tm/inen to transfer e.g.:
- the underspecified tense in Indonesian into present tense in English,
- third person (underspecified for number) in Indonesian into third person plural in English,
- perfect aspect in Indonesian into past tense in English.
E.TENSE : TENSE
tense >> pres
PNG.PERNUM : PERS NUM
3rd >> 3 pl
E.ASPECT : TENSE
perf >> past
2. In out.vpm
TENSE : TENSE
* >> *
PERS : PERS
* >> *
NUM : NUM
* >> *
3. In test.mtr, map menggonggong to bark and anjing to dog
bark_mtr := monotonic_mtr &
[ INPUT.RELS < [ PRED "_menggonggong_v_rel" ] >,
OUTPUT.RELS < +copy+ & [ PRED "_bark_v_1_rel" ] > ].
dog_mtr := monotonic_mtr &
[ INPUT.RELS < [ PRED "_anjing_n_rel" ] >,
OUTPUT.RELS < +copy+ & [ PRED "_dog_n_1_rel" ] > ].
udef_mtr := monotonic_mtr &
[ INPUT.RELS < [ PRED "exist_q_rel" ] >,
OUTPUT.RELS < [ PRED udef_q_rel ] > ].
If we find a problem in transferring and generating the translation, we should do debugging.
1. Check the want-to-be MRS in the target grammar ERG, save it in a file.
2. Compare with the MRS in the source grammar INDRA and in the transfer grammar inen.
$ echo "anjing menggonggong" | ace -g YOUR_INDRA_DIRECTORY/ind.dat -Tf -vv | less
$ echo "anjing menggonggong" | ace -g YOUR_INDRA_DIRECTORY/ind.dat | ace -g YOUR_INEN_DIRECTORY/inen.dat -Tf -vv | less
3. Edit the grammar(s) or transfer rules
Home | Forum | Discussions | Events