Problems with the ACE+LKB in the logon tree


#1

Trying to follow the instructions on page http://moin.delph-in.net/LogonAnswer, the first command failed with an error:

$ answer --erg --compile
reading configuration       from `/home/user/logon/lingo/erg/ace/config.tdl'
reading types               from `/home/user/logon/lingo/erg/ace/../mtr.tdl'
multibyte decoding failed: Invalid or incomplete multibyte or wide character
on string `config übertag-emission-path'

Any idea? What is strange here is that ACE version in the LOGON tree is the same I am using in the MacOS, 0.9.30. I was able to compile the grammar in the MacOS. But this error above happens in the LOGON tree runing in a Linux system. The cause of the error is the --compile argument that executes the code:

user@381fc3ed1319:~/logon$ /home/user/logon/lingo/answer/bin/linux.x86.64/ace -G /home/user/logon/lingo/erg/erg.dat -g /home/user/logon/lingo/erg/ace/config.tdl
reading configuration       from `/home/user/logon/lingo/erg/ace/config.tdl'
reading types               from `/home/user/logon/lingo/erg/ace/../mtr.tdl'
multibyte decoding failed: Invalid or incomplete multibyte or wide character
on string `config übertag-emission-path'
user@381fc3ed1319:~/logon$ /home/user/logon/lingo/answer/bin/linux.x86.64/ace -V
ACE version 0.9.30
compiled at 23:05:01 on Nov 21 2018

#2

Problem solved! The error was related to the locales. I changed the Docker file that I am using to run the Linux environment adding locales setttings.

BTW, I added a link to this repository in http://moin.delph-in.net/LkbMacintosh. This is just some small improvements over the initial Padro contribution.


#3

The problem with compilation was solved, but the parsing didn’t execute as expected. The error is below. It seems that inside the docker (http://github.com/own-pt/docker-logon), for some reason the systems are not establishing the connection. Does anyone have any idea?

I am trying to follow the documentation from http://moin.delph-in.net/LogonAnswer

$ ./parse --binary --erg+tnt/ace --protocol 2 --best 1 --limit 0 --count 8 --text lingo/erg/etc/repsol-100.txt
International Allegro CL Enterprise Edition
10.0 [64-bit Linux (x86-64)] (Jun 10, 2017 21:22)
Copyright (C) 1985-2015, Franz Inc., Oakland, CA, USA.  All Rights Reserved.

This standard runtime copy of Allegro CL was built by:
   [TC13152] Universitetet i Oslo (IFI)

; Loading /home/user/logon/dot.tsdbrc

[changing package from "COMMON-LISP-USER" to "TSDB"]
TSNLP(1): NIL
TSNLP(2): 5
TSNLP(3): 2
TSNLP(4): 5
TSNLP(5): NIL
TSNLP(6): T
TSNLP(7):
"/home/user/logon/lingo/lkb/src/tsdb/home/"
TSNLP(8): 0
TSNLP(9): NIL
TSNLP(10): 1
TSNLP(11): 2
TSNLP(12): ; Loading /home/user/logon/lingo/erg/lkb/script
set-coding-system(): activated UTF8.
;   Loading /home/user/logon/lingo/erg/Version.lsp
;   Loading /home/user/logon/lingo/erg/lkb/globals.lsp
;   Loading /home/user/logon/lingo/erg/lkb/user-fns.lsp
;   Loading /home/user/logon/lingo/erg/lkb/checkpaths.lsp
;   Loading /home/user/logon/lingo/erg/lkb/patches.lsp

Reading in type file fundamentals
Reading in type file tmt
Reading in type file lextypes
[17:09:23] gc-after-hook(): {L#637 N=5.4M O=0 E=100%} [S=2.4G R=102M].

Reading in type file syntax
[17:09:25] gc-after-hook(): {L#638 N=7.1M O=0 E=98%} [S=2.4G R=232M].

Reading in type file ctype
Reading in type file lexrules
Reading in type file auxverbs
[17:09:26] gc-after-hook(): {L#639 N=9.2M O=0 E=98%} [S=2.4G R=352M].

Reading in type file mtr
Reading in type file dt
Checking type hierarchy
Checking for unique greatest lower bounds
Expanding constraints
[17:09:32] gc-after-hook(): {L#640 N=55M O=5.2K E=99%} [S=2.4G R=352M].

Making constraints well formed
[17:09:33] gc-after-hook(): {L#641 N=72M O=4.9M E=82%} [S=2.4G R=357M].
[17:09:33] gc-after-hook(): {L#642 N=80M O=1.9M E=66%} [S=2.4G R=359M].
[17:09:34] gc-after-hook(): {L#643 N=87M O=2.2M E=68%} [S=2.4G R=361M].
[17:09:34] gc-after-hook(): {L#644 N=56M O=36M E=72%} [S=2.4G R=397M].
[17:09:34] gc-after-hook(): 137M tenured; forcing global gc().
[17:09:35] gc-after-hook(): {GR#8 N=33M O=0 E=100%} [S=2.4G R=424M].
[17:09:36] gc-after-hook(): {L#645 N=67M O=0 E=0%} [S=2.4G R=424M].
[17:09:37] gc-after-hook(): {L#646 N=72M O=14M E=72%} [S=2.4G R=438M].
[17:09:37] gc-after-hook(): {L#647 N=79M O=11M E=52%} [S=2.4G R=448M].
63385568 bytes have been tenured, next gc will be global.
See the documentation for variable EXCL:*GLOBAL-GC-BEHAVIOR* for more information.
[17:09:39] gc-after-hook(): {G#647 N=78M O=0 E=91%} [S=2.4G R=458M].
[17:09:39] gc-after-hook(): {L#648 N=93M O=0 E=0%} [S=2.4G R=458M].
[17:09:40] gc-after-hook(): {L#649 N=71M O=30M E=73%} [S=2.4G R=476M].
[17:09:41] gc-after-hook(): {L#650 N=71M O=18M E=31%} [S=2.4G R=495M].

Expanding defaults
Type file checked successfully
Computing display ordering
Reading in cached leaf types
Cached leaf types read
Reading in cached lexicon (main)
Cached lexicon read
Reading in rules file constructions
Reading in lexical rules file inflr
[17:09:42] gc-after-hook(): {L#651 N=75M O=15M E=61%} [S=2.4G R=510M].
66493600 bytes have been tenured, next gc will be global.
See the documentation for variable EXCL:*GLOBAL-GC-BEHAVIOR* for more information.

Reading in lexical rules file inflr-pnct
Reading in root file roots
Reading in lexical rules file lexrinst
Reading in parse node file parse-nodes
;   Loading /home/user/logon/lingo/erg/lkb/mrsglobals.lsp
;   Loading /home/user/logon/lingo/erg/lkb/eds.lsp
;   Loading /home/user/logon/lingo/erg/www/setup.lsp
; cpu time (non-gc) 13.460000 sec user, 0.100000 sec system
; cpu time (gc)     10.210000 sec user, 0.300000 sec system
; cpu time (total)  23.670000 sec user, 0.400000 sec system
; real time  21.263803 sec (113.2%)
; space allocation:
;  25,979,662 cons cells, 681,547,840 other bytes, 0 static bytes
; Page Faults: major: 0 (gc: 56918), minor: 146109 (gc: 56918)
;   Loading /home/user/logon/lingo/erg/rpp/setup.lsp
read-repp(): reading file `xml.rpp'.
read-repp(): reading file `latex.rpp'.
read-repp(): reading file `ascii.rpp'.
read-repp(): reading file `html.rpp'.
read-repp(): reading file `wiki.rpp'.
read-repp(): reading file `lgt.rpp'.
read-repp(): reading file `gml.rpp'.
read-repp(): reading file `robustness.rpp'.
read-repp(): reading file `quotes.rpp'.
read-repp(): reading file `ptb.rpp'.
read-repp(): reading file `lkb.rpp'.
read-repp(): reading file `micro.rpp'.
read-repp(): reading file `tokenizer.rpp'.
read-heads() reading file `rules.hds'.
read-model(): reading file `jhpstg.g.mem'.
[17:09:44] gc-after-hook(): {G#651 N=63M O=0 E=89%} [S=2.4G R=537M].
read-semi(): reading file `erg.smi'.
read-semi(): reading file `hierarchy.smi'.
read-semi(): reading file `abstract.smi'.
read-semi(): reading file `surface.smi'.
[17:09:46] gc-after-hook(): {L#652 N=85M O=0 E=0%} [S=2.4G R=537M].
[17:09:46] gc-after-hook(): {L#653 N=89M O=5.0M E=83%} [S=2.4G R=537M].
read-vpm(): reading file `semi.vpm'.
read-vpm(): reading file `abstract.vpm'.
;   Loading /home/user/logon/lingo/erg/lkb/mt.lsp
read-transfer-rules(): reading file `paraphraser.mtr'.
read-transfer-rules(): reading file `idioms.mtr'.
[17:09:47] gc-after-hook(): {L#654 N=86M O=11M E=75%} [S=2.4G R=537M].
read-transfer-rules(): reading file `trigger.mtr'.
[17:09:48] gc-after-hook(): {L#655 N=70M O=20M E=68%} [S=2.4G R=537M].
read-transfer-rules(): reading file `generation.mtr'.

Building rule filter
[17:09:51] gc-after-hook(): {L#656 N=26M O=49M E=89%} [S=2.4G R=566M].
[17:09:51] gc-after-hook(): 173M tenured; forcing global gc().
[17:09:52] gc-after-hook(): {GR#11 N=17M O=0 E=100%} [S=2.4G R=574M].
[17:09:59] gc-after-hook(): {L#657 N=22M O=0 E=4%} [S=2.4G R=574M].

Building lr connections table
Constructing lr table for non-morphological rules
Grammar input complete
NIL
TSNLP(13):
  275 skeletons in `/home/user/logon/lingo/lkb/src/tsdb/skeletons/english/'.
NIL
TSNLP(14):
[t40002] BEGIN
[t40003] BEGIN
[t40004] BEGIN
[t40005] BEGIN
[t40006] BEGIN
[t40007] BEGIN
[t40008] BEGIN
[t40009] BEGIN
[t40002] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40002>
[t40004] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40004>
[t40003] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40003>
[t40005] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40005>
[t40006] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40006>
[t40007] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40007>
[t40009] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40009>
[t40008] EOF
  [17:10:21] wait-for-clients(): client exit on `100932dfc896' <40008>
cat: /tmp/.parse.tmp..3035: No such file or directory

It seems the problem is related to PVM, something that I didn’t install?


#4

I am having a hard time getting something like this to run at my end. I’m not sure why, but the logon/lingo/erg/ directory is somewhat inconsistant in the LOGON tree I have, apparently.

The messages you are seeing suggest that ACE is crashing immediately upon being spawned by [incr tsdb()]. Perhaps try something like this:

logon$ bin/answer --erg --compile

logon$ echo “The dog slept.” | bin/answer --erg

If you get a parse result from that, ACE is working properly. At my end, I had to use --terg instead of --erg, to switch to the logon/lingo/terg/ version, which wasn’t corrupted. You could try that too (which would be --terg+tnt/ace with the ./parse script)?