Hi! I have a test corpus with this registry:
##
## registry entry for corpus PRUEBA
##
# long descriptive name for the corpus
NAME "Una pruebáñ"
# corpus ID (must be lowercase in registry!)
ID prueba
# path to binary data files
HOME /opt/cwb/data/prueba
# optional info file (displayed by "info;" command in CQP)
INFO /opt/cwb/data/prueba/.info
# corpus properties provide additional information about the corpus:
##:: charset = "utf8" # character encoding of corpus data
##:: language = "??" # insert ISO code for language (de, en, fr, ...)
##
## p-attributes (token annotations)
##
ATTRIBUTE word
ATTRIBUTE FORM
ATTRIBUTE LEMMA
ATTRIBUTE TAG
ATTRIBUTE SHORT_TAG
ATTRIBUTE MSD
ATTRIBUTE NEC
ATTRIBUTE SENSE
ATTRIBUTE SYNTAX
ATTRIBUTE DEPHEAD
ATTRIBUTE DEPREL
ATTRIBUTE COREF
ATTRIBUTE TOKENID
##
## s-attributes (structural markup)
##
# <text id=".."> ... </text>
STRUCTURE text
STRUCTURE text_id # [annotations]
# <p> ... </p>
STRUCTURE p
# <s> ... </s>
STRUCTURE s
# Yours sincerely, the Encode tool.
And when I do info PRUEBA
I get this output:
Size: 21
Charset: utf8
Properties:
language = '??'
charset = 'utf8'
No further information available about PRUEBA
So I wonder two things:
info
doesn't output more information like ATTRIBUTE
and STRUCTURE
?Thanks
show cd
or usingcwb-describe-corpus -s
on the command line.Thanks for your response!
cwb-describe-corpus -s
works perfectly.For
show cd
I still get incomplete information:You seem to have forgotten to activate the corpus:
but
Thanks, sorry for the mistake