The Annotated CBC4Kids Reading Comprehension Corpus

About
- A corpus of English news, provided by CBC and targeted at teenagers
- Collected and annotated by MITRE with questions and their correct answers, similar to remedial tests
- Converted to XML
- Enriched with multi-layer markup (POS, parse trees, lemmata etc.) by the question answering group at Edinburgh
- Purpose: automatic reading comprehension evaluation
Publications
- Leidner, Jochen L., Tiphaine Dalmas, Bonnie Webber, Johan Bos and Claire Grover (2003). Automatic Multi-Layer Corpus Annotation for Evaluating Question Answering Methods: CBC4Kids. Proceedings of the Third Workshop on Linguistically Interpreted Corpora (LINC-3) held at the Tenth Annual Meeting of the European Chapter of the Association for Computational Linguistics 2003 (EACL'03) Budapest, Hungary, pp. 39-46. [PDF] [slides][BibTeX]
- Dalmas, Tiphaine, Jochen L. Leidner, Bonnie Webber, Claire Grover and Johan Bos (2003). Annotated Corpora for Reading Comprehension and Question Answering Evaluation. Proceedings of the Workshop on Question Answering held at the Tenth Annual Meeting of the European Chapter of the Association for Computational Linguistics 2003 (EACL'03), Budapest, Hungary, pp. 13-19. [PDF] [BibTeX]
- Dalmas, Tiphaine, Jochen L. Leidner, Bonnie Webber, Claire Grover and Johan Bos (2004). Annotating CBC4Kids: A Corpus for Reading Comprehension and Question Answering Evaluation. Technical Report EDI-INF-RR-0204, School of Informatics, University of Edinburgh.
Availability
- CBC4Kids is freely available for research purposes from The MITRE Corporation
- To obtain the annotated CBC4Kids corpus, please contact Dr. Lisa Ferro at:
Links
- The MITRE Corporation
- The Canadian Broadcasting Corporation (CBC)
page maintained by Jochen L. Leidner