Advanced search

Machine learning for WebSensors: algorithms and applications

Grant number: 14/08996-0
Support type:Regular Research Grants
Duration: August 01, 2014 - July 31, 2016
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Solange Oliveira Rezende
Home Institution: Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos, SP, Brazil
Assoc. researchers: Bruno Magalhães Nogueira ; Gustavo Enrique de Almeida Prado Alves Batista ; Ricardo Marcondes Marcacini ; Veronica Oliveira de Carvalho


The popularization of textual content published in web platforms has motivated the development of methods for automatic knowledge extraction from texts. Particularly, a new range of applications and studies have been proposed to use the web as a powerful "social sensor". This allows to identify and monitor events published in news portals and social networks as epidemics detection, sentiment analysis, and political and economic indicators. Currently, the construction of a websensor is a complex task, since it depends of domain specialists to define the parameters of the sensors, i.e., search queries, filters and monitoring textual content from web. Moreover, for some problems there is no comprehension about the phenomenons to monitor, which limits the application of websensors. In this research project we investigate the use of machine learning methods to support the building of websensors. The basic idea is to use a sample of textual document from a problem and apply semi/non supervised learning methods to extract patterns from texts and support the generation of websensors. Thus, we hope to reduce the dependency of specialist domains to define parameters for websensor. Besides, each websensor represents a phenomenon related to a problem and it can be monitored during the time to be used as support to decision making. (AU)

Articles published in Agência FAPESP about the research grant
Algorithms facilitate automated classification of web texts 

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
MANZATO, MARCELO G.; DOMINGUES, MARCOS A.; FORTES, ARTHUR C.; SUNDERMANN, CAMILA V.; D'ADDIO, RAFAEL M.; CONRADO, MERLEY S.; REZENDE, SOLANGE O.; PIMENTEL, MARIA G. C. Mining unstructured content for recommender systems: an ensemble approach. INFORMATION RETRIEVAL JOURNAL, v. 19, n. 4, p. 378-415, AUG 2016. Web of Science Citations: 0.
ROSSI, RAFAEL GERALDELI; LOPES, ALNEU DE ANDRADE; REZENDE, SOLANGE OLIVEIRA. Optimization and label propagation in bipartite heterogeneous networks to improve transductive classification of texts. INFORMATION PROCESSING & MANAGEMENT, v. 52, n. 2, p. 217-257, MAR 2016. Web of Science Citations: 4.
CORREA, GERALDO N.; MARCACINI, RICARDO M.; HRUSCHKA, EDUARDO R.; REZENDE, SOLANGE O. Interactive textual feature selection for consensus clustering. PATTERN RECOGNITION LETTERS, v. 52, p. 25-31, JAN 15 2015. Web of Science Citations: 1.

Please report errors in scientific publications list by writing to: