Linguistic Data Consortium (LDC) Dataset archive

Collection items

[img]
2015-2016 CoNLL Shared Task
LDC_Catalogue_ID: LDC2017T13 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
Multilingual ATIS
LDC_Catalogue_ID: LDC2019T04 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
BOLT Chinese-English Word Alignment and Tagging -- SMS/Chat Training
LDC_Catalogue_ID: LDC2019T13 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
BOLT English Treebank - Discussion Forum
LDC_Catalogue_ID: LDC2019T15 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
TimeBank 1.2
LDC_Catalogue_ID: LDC2006T08 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
Unified Linguistic Annotation Text Collection
LDC_Catalogue_ID: LDC2009T07 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
FactBank 1.0
LDC_Catalogue_ID: LDC2009T23 Downloaded 29/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
SemEval-2010 Task 1 OntoNotes English: Coreference Resolution in Multiple Languages
LDC_Catalogue_ID: LDC2011T01 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
OntoNotes Release 5.0
LDC_Catalogue_ID: LDC2013T19 Downloaded 14/06/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Penn Discourse Treebank Version 3.0
LDC_Catalogue_ID: LDC2019T05 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Treebank-3 (Three "map" files are available in a compressed file)
LDC_Catalogue_ID: LDC99T42 Downloaded 06/06/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2007 CoNLL Shared Task - Arabic & English
LDC_Catalogue_ID: LDC2018T08 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
DEFT Chinese Committed Belief Annotation
LDC_Catalogue_ID: LDC2019T03 Downloaded 28/11/2019

Shared with Selected Users by
Miss Eleonora Gandolfi
[img]
2009 CoNLL Shared Task Part 1
LDC_Catalogue_ID: LDC2012T03 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Machine Reading Phase 1 NFL Scoring Training Data
LDC_Catalogue_ID: LDC2019T14 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Treebank-3
LDC_Catalogue_ID: LDC99T42 Downloaded 06/06/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2006 CoNLL Shared Task - Ten Languages
LDC_Catalogue_ID: LDC2015T11 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Phrase Detectives Corpus Version 2
LDC_Catalogue_ID: LDC2019T10 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
DEFT English Committed Belief Annotation
LDC_Catalogue_ID: LDC2019T16 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Treebank-3 (Coordination Annotation for the Penn Treebank)
LDC_Catalogue_ID: LDC99T42 Downloaded 06/06/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2006 CoNLL Shared Task - Arabic & Czech
LDC_Catalogue_ID: LDC2015T12 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
DEFT Spanish Committed Belief Annotation
LDC_Catalogue_ID: LDC2019T09 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2007 CoNLL Shared Task - Basque, Catalan, Czech & Turkish
LDC_Catalogue_ID: LDC2018T06 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2007 CoNLL Shared Task - Greek, Hungarian & Italian
LDC_Catalogue_ID: LDC2018T07 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
2009 CoNLL Shared Task Part 2
LDC_Catalogue_ID: LDC2012T04 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img] [img]
TIPSTER Complete
LDC_Catalogue_ID: LDC93T3A Downloaded 09/01/2020

Shared with Selected Users by
Miss Eleonora Gandolfi
[img]
TAC Relation Extraction Dataset
LDC_Catalogue_ID: LDC2018T24 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
TAC KBP Entity Discovery and Linking - Comprehensive Evaluation Data 2016-2017
LDC_Catalogue_ID: LDC2019T19 Downloaded 09/01/2020

Shared with the University by
Miss Eleonora Gandolfi
[img]
Domain-Specific Hyponym Relations
LDC_Catalogue_ID: LDC2014T07 Downloaded 28/11/2019

Shared with the University by
Miss Eleonora Gandolfi
[img]
Noisy TIMIT
LDC_Catalogue_ID: LDC2017S04 Downloaded 09/01/2020

Shared with the University by
Dr Stuart Middleton
[img]
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition
LDC_Catalogue_ID: LDC2019T04 Downloaded 09/01/2020

Shared with the University by
Dr Stuart Middleton
[img]
TIMIT Acoustic-Phonetic Continuous Speech Corpus
LDC_Catalogue_ID: LDC93S1 Downloaded 24/01/2020

Shared with the University by
Dr Stuart Middleton
[img]
Global TIMIT Thai
LDC_Catalogue_ID: LDC2022S13 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Global TIMIT Mandarin Chinese
LDC_Catalogue_ID: LDC2021S03 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
SRI Speech-Based Collaborative Learning Corpus
LDC_Catalogue_ID: LDC2019S01 Downloaded 09/01/2020

Shared with the University by
Dr Stuart Middleton
[img]
English Gigaword
LDC_Catalogue_ID: LDC2003T05 Downloaded 21/07/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT English Treebank - SMS/Chat
LDC_Catalogue_ID: LDC2021T03 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT Egyptian Arabic Treebank - SMS/Chat
LDC_Catalogue_ID: LDC2021T17 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
Second DIHARD Challenge Development - SEEDLingS
LDC_Catalogue_ID: LDC2021S11 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT Egyptian Arabic SMS/Chat Parallel Training Data
LDC_Catalogue_ID: LDC2021T15 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT Chinese SMS/Chat Parallel Training Data
LDC_Catalogue_ID: LDC2021T11 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
X-SRL: Parallel Cross-lingual Semantic Role Labeling
LDC_Catalogue_ID: LDC2021T09 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT Egyptian Arabic PropBank and Sense -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech
LDC_Catalogue_ID: LDC2021T18 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
Chinese Abstract Meaning Representation 2.0
LDC_Catalogue_ID: LDC2021T13 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010
LDC_Catalogue_ID: LDC2021T06 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
Penn Discourse Treebank Version 2.0 - German Translation
LDC_Catalogue_ID: LDC2021T05 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT English Translation Treebank - Chinese SMS/Chat
LDC_Catalogue_ID: LDC2021T19 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
ACE 2005 Multilingual Training Corpus
LDC_Catalogue_ID: LDC2006T06 Downloaded 17/11/2021

Shared with the University by
Dr Stuart Middleton
[img]
Second DIHARD Challenge Development - Eleven Sources
LDC_Catalogue_ID: LDC2021S10 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
ESPADA
LDC_Catalogue_ID: LDC2021T10 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014
LDC_Catalogue_ID: LDC2021T08 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
BOLT Egyptian Arabic Treebank - Conversational Telephone Speech
LDC_Catalogue_ID: LDC2021T12 Downloaded 16/12/2021

Shared with the University by
Dr Stuart Middleton
[img]
Second DIHARD Challenge Evaluation - SEEDLingS
LDC_Catalogue_ID: LDC2022S07 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
NUBUC
LDC_Catalogue_ID: LDC2022S04 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Abstract Meaning Representation (AMR) Annotation Release 3.0
LDC_Catalogue_ID: LDC2020T02 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Spoken Digits in Hindi and Indian English
LDC_Catalogue_ID: LDC2022S03 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
BOLT English Translation Treebank - Egyptian Arabic SMS/Chat
LDC_Catalogue_ID: LDC2022T06 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Qatari Corpus of Argumentative Writing
LDC_Catalogue_ID: LDC2022T04 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
2017 NIST OpenSAT Pilot - SSSF
LDC_Catalogue_ID: LDC2022S01 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Abstract Meaning Representation 2.0 - Four Translations
LDC_Catalogue_ID: LDC2020T07 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
AttImam
LDC_Catalogue_ID: LDC2022T02 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Second DIHARD Challenge Evaluation - Eleven Sources
LDC_Catalogue_ID: LDC2022S06 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
.
MASRI Synthetic
LDC_Catalogue_ID: LDC2022S08 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Third DIHARD Challenge Evaluation
LDC_Catalogue_ID: LDC2022S14 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
[img]
Third DIHARD Challenge Development
LDC_Catalogue_ID: LDC2022S12 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton
.
CAMIO Transcription Languages
LDC_Catalogue_ID: LDC2022T07 Downloaded 05/01/2023

Shared with the University by
Dr Stuart Middleton

Linguistic Data Consortium (LDC) Dataset archive

Datasets downloaded by LDC contact for University of Southampton: Stuart E. Middleton sem03@soton.ac.uk Datasets can be used following a LDC License (non-commercial) by all University of Southampton staff and students Looking up dataset help on LDC dataset (description of dataset and links to samples) https://catalog.ldc.upenn.edu/<LDC_catalogue_ID>

Actions (login required)

View Item View Item

Toolbox

There are no actions available for this resource.