Martin Reynaert
Martin Reynaert
Tilburg University & Meertens Institute Amsterdam
Verified email at
Cited by
Cited by
The construction of a 500-million-word reference corpus of contemporary written Dutch
N Oostdijk, M Reynaert, V Hoste, I Schuurman
Essential speech and language technology for Dutch, 219-247, 2013
Non-interactive OCR post-correction for giga-scale digitization projects
M Reynaert
International Conference on Intelligent Text Processing and Computational …, 2008
Text induced spelling correction
M Reynaert
COLING 2004: Proceedings of the 20th International Conference on …, 2004
FoLiA: A practical XML format for linguistic annotation–a descriptive and comparative study
M van Gompel, M Reynaert
Computational Linguistics in the Netherlands Journal 3, 63-81, 2013
Character confusion versus focus word-based correction of spelling and OCR variants in corpora
MWC Reynaert
International Journal on Document Analysis and Recognition (IJDAR) 14 (2 …, 2011
From D-Coi to SoNaR: A reference corpus for Dutch
NHJ Oostdijk, M Reynaert, P Monachesi, G Noord, R Ordelman, ...
Marrakech, Marocco: ELRA, 2008
Learning to predict pitch accents and prosodic boundaries in Dutch
E Marsi, M Reynaert, A Van Den Bosch, W Daelemans, V Hoste
41st Annual meeting of the Association for Computational Linguistics, 489-496, 2003
All, and only, the Errors: more Complete and Consistent Spelling and OCR-Error Correction Evaluation.
M Reynaert
LREC, 2008
Balancing SoNaR: IPR versus processing issues in a 500-million-word written Dutch Reference Corpus
M Reynaert, NHJ Oostdijk, OD Clercq, H Heuvel, F Jong
Malta: European Language Resources Association (ELRA), 2010
Nederlab: Towards a single portal and research environment for diachronic Dutch text corpora
H Brugman, M Reynaert, N van der Sijs, R van Stipriaan, ETK Sang, ...
Proceedings of the Tenth International Conference on Language Resources and …, 2016
Historical spelling normalization. A comparison of two statistical methods: TICCL and VARD2
M Reynaert, I Hendrickx, R Marquilhas
Proceedings of the Second Workshop on Annotation of Corpora for Research in …, 2012
Corpus-Induced Corpus Clean-up.
M Reynaert
LREC, 87-92, 2006
Combining information sources for memory-based pitch accent placement
E Marsi, B Busser, W Daelemans, V Hoste, M Reynaert, A Bosch
Seventh International Conference on Spoken Language Processing, 2002
Synergy of Nederlab and@ Philos TEI: diachronic and multilingual Text-Induced Corpus Clean-up
M Reynaert
Paris: European Language Resources Association (ELRA), 2014
Parallel identification of the spelling variants in corpora
M Reynaert
Proceedings of the Third Workshop on Analytics For Noisy Unstructured Text …, 2009
TICCLops: Text-Induced Corpus Clean-up as online processing system
M Reynaert
Dublin, Ireland: Dublin City University and Association for Computational …, 2014
OpenSoNaR: user-driven development of the SoNaR corpus interfaces
M Reynaert, M Camp, M Zaanen
Dublin, Ireland: Dublin City University and Association for Computational …, 2014
FoLiA in Practice. The Infrastructure of a Linguistic Annotation Format
M Gompel, K Sloot, M Reynaert, APJ van den Bosch
London: Ubiquity Press, 2017
On OCR ground truths and OCR post-correction gold standards, tools and formats
M Reynaert
Proceedings of the First International Conference on Digital Access to …, 2014
Multilingual text induced spelling correction
M Reynaert
Proceedings of the Workshop on Multilingual Linguistic Resources, 110-117, 2004
The system can't perform the operation now. Try again later.
Articles 1–20