Replication data and results for the paper "Keyness in song lyrics. Challenges of highly clumpy data"

Contributing person
datacite.contributor.ProjectLeader

Meier-Vieracker, Simon (orcid: 0000-0002-0141-9327)

Contributing person
datacite.contributor.RightsHolder

TU Dresden

Countries to which the data refer
datacite.geolocation.iso3166

GERMANY

Description of the data
datacite.resourceType

Corpus (Songkorpus) Results of keyword analysis Code to obtain results

Type of the data
datacite.resourceTypeGeneral

Text

Type of the data
datacite.resourceTypeGeneral

Dataset

Type of the data
datacite.resourceTypeGeneral

Software

Total size of the dataset
datacite.size

178370889

Author
dc.contributor.author

Frommherz, Yannick

Author
dc.contributor.author

Langenhorst, Jan

Author
dc.contributor.author

Meier-Vieracker, Simon

Upload date
dc.date.accessioned

2023-05-08T09:56:16Z

Publication date
dc.date.available

2023-05-08T09:56:16Z

Publication date
dc.date.available

2026-06-05T13:12:54Z

Data of data creation
dc.date.created

2022-2023

Publication date
dc.date.issued

2023-05-08

Abstract of the dataset
dc.description.abstract

Replication data and results for Langenhorst, Jan/Frommherz, Yannick/Meier-Vieracker, Simon: Keyness in song lyrics. Challenges of highly clumpy data. In: Journal for Language Technology and Computational Linguistics. This data set contains * Python module used to obtain results presented in paper * Jupyter Notebooks to generate results and create plots * Complete keyword/key-ngram lists * A 'shuffled' version of the songkorpus used in our analysis (replication of ngrams not possible, only keywords)

Public reference to this page
dc.identifier.uri

https://opara.zih.tu-dresden.de/handle/123456789/2543

Public reference to this page
dc.identifier.uri

https://doi.org/10.25532/OPARA-220

dc.language
dc.language

eng

Publisher
dc.publisher

Technische Universität Dresden

Licence
dc.rights

Attribution 4.0 International

URI of the licence text
dc.rights.uri

http://creativecommons.org/licenses/by/4.0/

Specification of the discipline(s)
dc.subject.classification

1::11::104

Specification of the discipline(s)
dc.subject.classification

4::44::408

Specification of the discipline(s)
dc.subject.classification

4::44::409

Title of the dataset
dc.title

Replication data and results for the paper "Keyness in song lyrics. Challenges of highly clumpy data"

Project title
opara.project.title

Keyness in song lyrics. Challenges of highly clumpy data.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
songkorpus-KLD-keywords.zip
Size:
170.11 MB
Format:
Description:
Replication data
Attribution 4.0 International