Talaia is a platform for monitoring social media and digital press. A configurable crawler gathers content with respect to user defined domains or topics. Crawled data is processed by means of the EliXa Sentiment Analysis system. A Django powered interface provides data visualization for a user-based analysis of the data. This paper presents the architecture of the system and describes in detail its different components. To prove the validity of the approach, two real use cases are accounted for: one in the cultural domain and one in the political domain. Evaluation for the sentiment analysis task in both scenarios is also provided, showing the capacity for domain adaptation.
翻译:Talaia是一个监测社交媒体和数字新闻的平台,一个可配置爬行器收集用户定义域或主题的内容,通过EliXa Sentiment分析系统处理数据。一个Django有动力的界面为基于用户的数据分析提供数据可视化数据。本文介绍该系统的结构,并详细描述其不同组成部分。为了证明这一方法的有效性,对两个实际使用案例进行了核算:一个在文化领域,一个在政治领域。还评估了两种情景中的情绪分析任务,显示了区域适应能力。