Feature Engineering for Twitter-based Applications

TitleFeature Engineering for Twitter-based Applications
Publication TypeJournal
Year of Publication2017
AuthorsSanjaya Wijeratne, Amit Sheth, Shreyansh Bhatt, Lakshika Balasuriya, Hussein S. Al-Olimat, Gaur, M, Amir Hossein Yazdavar, Krishnaprasad Thirunarayan
IssueFeature Engineering for Machine Learning and Data Analytics, Editors. Guozhu Dong and Huan Liu
Pagination35
Date Published2017
PublisherChapman and Hall. Data Mining and Knowledge Discovery Series
KeywordsDepression, emoji, Emotion Analysis, gang member identification, location extraction, Sentiment Analysis, twitris, twitter, twitter features
Abstract

This chapter presents studies concerning feature engineering for Twitter-based applications. It first discusses how Twitter data can be downloaded from the Twitter Application Programming Interface (API) and the kinds of data available in the downloaded tweets. Then, it discusses various textual features, image and video features, Twitter metadata-related features, and network features that can be extracted. Next, it discusses the uses of different feature types along with an analysis on why certain features perform well in the context of informal short text messages typically found in tweets. It then presents five real-world Twitter applications that utilize the different feature types. For each application, it also highlights the features that perform well in the corresponding application setting. Finally, it concludes the chapter by discussing Twitris, a real-time semantic social web analytics platform that has already been commercialized, and its use of Twitter features.

Full Text Citation

Sanjaya Wijeratne, Amit Sheth, Shreyansh Bhatt, Lakshika Balasuriya, Hussein Al-Olimat, Manas Gaur, Amir Hossein Yazdavar, Krishnaprasad Thirunarayan. "Feature Engineering for Twitter-based Applications", in Feature Engineering for Machine Learning and Data Analytics. Editors. Guozhu Dong and Huan Liu. Chapman and Hall/CRC Data Mining and Knowledge Discovery Series. December 2017.

Projects: 
Context-Aware Harassment Detection on Social Media
Harassment
HazardSEES