Several branches of computer vision heavily rely (but we could even say depend) on the availability of large datasets of labelled images. While such labeling is usually done by hand, a powerful help can be obtained from Internet and its related tools. In this paper we address the problem of automatically generating a set of images representing an object class, given the name of the class. We exploit semantic technologies, such as lexical resources and ontologies, in order to improve the search performances by using a standard web search engine. We will also discuss an application to the automatic building of a training set for a classification framework. Preliminary experiments are provided for 10 classes from the public CalTech256 dataset and results show an average increment in classification accuracy of about 10%. © 2013 ACM.
"Tell me more": How semantic technologies can help refining internet image search
Publication type:
Contributo in atti di convegno
Source:
Workshop on Video and Image Ground Truth in Computer Vision Applications (VIGTA '13), San Pietroburgo, 15/07/2013
info:cnr-pdr/source/autori:Setti, Francesco; Porello, Daniele; Ferrario, Roberta; Abdulhak, Sami Abduljalil; Cristani, Marco/congresso_nome:Workshop on Video and Image Ground Truth in Computer Vision Applications (VIGTA '13)/congresso_luogo:San Pietroburg
Date:
2013
Resource Identifier:
http://www.cnr.it/prodotto/i/298465
https://dx.doi.org/10.1145/2501105.2501110
info:doi:10.1145/2501105.2501110
http://www.scopus.com/record/display.url?eid=2-s2.0-84885230992&origin=inward
urn:isbn:9781450321693
Language:
Eng