
Raw text corpora were kindly provided or licensed by ECI, DSL, Oxford University, Linguateca, ATILF, NILC, the Danish parliament and others. For a full list of corpus credits and references see our copyright page, which is also linked from the individual corpus pages.
Grammatical corpus annotation, both morphosyntactic tags (CG) and tree-structures (PSG), was performed with Eckhard Bick's VISL parsers: PALAVRAS (Portuguese), PALAVRAS-HIS (Spanish), DanGram (Danish), GerGram (German), EspGram (Esperanto), ItaGram (Italian) and FrAG (French), which are all accessible online (including file upload service). Annotation of German and English also involved morphological, and in the latter case syntactic, modules licensed by Lingsoft. For French, the PoS-tagging stage used H. Schmid's and A. Stein's DecisionTreeTagger. For Romanian, the morphological annotation was performed with Dan Tufis' probabilistic MSD tagger.
Treebank revision was supervised work involving, among others, the following VISL-students: Susanna Afonsoand Raquel Marchi (Portuguese), Ina Størner Rasmussen, Camilla Pedersen, Dorte Lønsmann and Kim Ebensgaard Jensen (Danish), and Ane Dybro Johansen (French). The treebank projects had funding support by Linguateca (Portuguese), The Nordic Council of Ministers (Danish) and ATILF (French).
More information on the VISL project as well as live grammatical analysis and a number of grammar teaching tools are available at the VISL main site or its research oriented beta version.