Data

Title: Sports-Related Emotion Corpus (SREC)
Contributors: Valentina Sintsova and Pearl Pu
Description:

This dataset contains a set of sports-related tweets manually annotated with emotion categories. The annotation was performed by workers from the Amazon Mechanical Turk platform. This dataset was the basis for the OlympLex emotion lexicon.

More details on the collection and annotation process can be found in: 
Valentina Sintsova, Claudiu Musat, and Pearl Pu. Fine-Grained Emotion Recognition in Olympic Tweets Based on Human Computation. In Proceedings of the NAACL/HLT Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA), ACL, 2013.

Unfortunately, by Twitter terms of service, we cannot share this dataset directly. Thus, we share those tweets via their identifiers. In the current distribution (version 1.2), we provide the annotation for 1265 tweets for which we have the Twitter identifiers, instead of all 1957 tweets used in the paper.

Links: Download SREC
Title: Pseudo-labeled Emotional Data: EMO-Hash Data
Contributors: Valentina Sintsova and Pearl Pu
Description:

Tweets with explicit emotional hashtags

We collected 17.6 million tweets with explicit emotional hashtags corresponding to the GEW emotion categories, by using Twitter Streaming API between 27th February and 26th May of 2014. Among them, we extracted 1,729,980 tweets that had those hashtags at the end of the text, were not repeated, were no retweets, did not contain URLs, and were assigned to only one emotion category. Using 500,000 of these pseudo-labeled tweets, we built the PMI-Hash emotion lexicon, as described above.

Unfortunately, by Twitter terms of service, we cannot share this dataset directly. Thus, we share those tweets via their identifiers.

Links: Download EMO-Hash
Title: Food and Mood Hashtag Dataset (FMHashtag2016)
Contributors: Onur Yürüten and Pearl Pu
Description:

This dataset was curated from Twitter using keywords we have obtained from the emotion lexicons curated in an earlier study. We distribute this dataset in Parquet format (75.5 MB, approximately 41000 tweets)

Links: Download FMHashtag2016
Title: Food and Mood Emoji Dataset (FMEmoji)
Contributors: Onur Yürüten and Pearl Pu
Description:

This dataset was curated from Twitter using the popular emojis used in the social media website.

We distribute this dataset in Parquet format (3.13 GB, approximately 4M tweets)

Links: Download FMEmoji