Celebrity Corpus

The “Celebrity” corpus consists of 150 news articles annotated with three semantic relations of the biographic domain. The corpus is provided in two formats, a CoNLL-like format (plain-text files with tabular-separated values) and an XML-based format. Files in the XML-based format can be loaded with the Recon tool.

Use

The DFKI Celebrity Corpus is released as CC-BY NC 4.0. If you use this data, you should cite the accompanying paper:

Annotating Relation Mentions in Tabloid Press. Hong Li, Sebastian Krause, Feiyu Xu, and Hans Uszkoreit. Proceedings of LREC, 2014. (bib) (pdf)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
celebrity-corpus-v1.1-2013-10-08		celebrity-corpus-v1.1-2013-10-08
.gitattributes		.gitattributes
README		README
README.MD		README.MD
celebrity-corpus-v1.1-2013-10-08.zip		celebrity-corpus-v1.1-2013-10-08.zip
paper.bib		paper.bib
paper.pdf		paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

celebrity-corpus-v1.1-2013-10-08

celebrity-corpus-v1.1-2013-10-08

.gitattributes

.gitattributes

README

README

README.MD

README.MD

celebrity-corpus-v1.1-2013-10-08.zip

celebrity-corpus-v1.1-2013-10-08.zip

paper.bib

paper.bib

paper.pdf

paper.pdf

Repository files navigation

Celebrity Corpus

Use

About

Releases

Packages

Languages

DFKI-NLP/celebrity-corpus

Folders and files

Latest commit

History

Repository files navigation

Celebrity Corpus

Use

About

Topics

Resources

Stars

Watchers

Forks

Languages