World-scale mining of objects and events from community photo collections

Quack, Till; Leibe, Bastian; Gool, Luc Van

doi:10.1145/1386352.1386363

Cited by 236 publications

(250 citation statements)

References 32 publications

(44 reference statements)

Supporting

Mentioning

240

Contrasting

Unclassified

Order By: Relevance

“…In [10,11] tags and visual information together with geo-location are used for object (e.g. monuments) and event extraction.…”

Section: Multi-modal Analysis Approachesmentioning

confidence: 99%

“…However, more advanced techniques and applications [9,10,13,22,23,24,25] have also been presented, capable of processing more kinds of input modalities enabling the spatio-temporal and situational dimension. Scalability is addressed by a wide range of applications; however the amount of works enabling the real-time aspect is still very limited.…”

Section: Real-time Applicationsmentioning

confidence: 99%

See 1 more Smart Citation

Extracting Emergent Semantics from Large-Scale User-Generated Content

Kompatsiaris

Diplaris

Papadopoulos

2012

Advances in Intelligent and Soft Computing

View full text Add to dashboard Cite

Abstract. This paper presents a survey of novel technologies for uncovering implicit knowledge through the analysis of user-contributed content in Web2.0 applications. The special features of emergent semantics are herein described, along with the various dimensions that the techniques should be able to handle. Consequently a series of application domains is given where the extracted information can be consumed. The relevant techniques are reviewed and categorised according to their capability for scaling, multi-modal analysis, social networks analysis, semantic representation, real-time and spatio-temporal processing. A showcase of such an emergent semantics extraction application, namely ClustTour, is also presented, and open issues and future challenges in this new field are discussed.

show abstract

“…In [10,11] tags and visual information together with geo-location are used for object (e.g. monuments) and event extraction.…”

Section: Multi-modal Analysis Approachesmentioning

confidence: 99%

Section: Real-time Applicationsmentioning

confidence: 99%

Extracting Emergent Semantics from Large-Scale User-Generated Content

Kompatsiaris

Diplaris

Papadopoulos

2012

Advances in Intelligent and Soft Computing

View full text Add to dashboard Cite

show abstract

“…Another application that combines textual and visual techniques has been proposed by Quack et al [20]. They developed a system that crawls photos on the internet and identifies clusters of images referring to a common object (physical items on fixed locations), and events (special social occasions taking place at certain times).…”

Section: Combined Analysis Of Geographical Context and Visual Contentmentioning

confidence: 99%

“…Gammeter et al [9] extends this idea towards object-based auto-annotation of holiday photos in a large database that includes landmark buildings, statues, scenes, pieces of art, with help of external resources such as Wikipedia. In both [20] and [9], GPS coordinates are used to pre-cluster objects which may not be always available.…”

Section: Combined Analysis Of Geographical Context and Visual Contentmentioning

confidence: 99%

Geotag propagation in social networks based on user trust model

Ivanov

Vajda

Lee

et al. 2010

Multimed Tools Appl

View full text Add to dashboard Cite

In the past few years sharing photos within social networks has become very popular. In order to make these huge collections easier to explore, images are usually tagged with representative keywords such as persons, events, objects, and locations. In order to speed up the time consuming tag annotation process, tags can be propagated based on the similarity between image content and context. In this paper, we present a system for efficient geotag propagation based on a combination of object duplicate detection and user trust modeling. The geotags are propagated by training a graph based object model for each of the landmarks on a small tagged image set and finding its duplicates within a large untagged image set. Based on the established correspondences between these two image sets and the reliability of the user, tags are propagated from the tagged to the untagged images. The user trust modeling reduces the risk of propagating wrong tags caused by spamming or faulty annotation. The effectiveness of the proposed method is demonstrated through a set of experiments on an image database containing various landmarks.

show abstract

“…But, though they are still developing, vision based methods are quite powerful and when combined with textual methods, very effective automated systems can be achieved. A good application of combined use of textual and visual techniques is proposed by Quack et al in [11]. Objective of the work persented in [11] is to provide a system that automatically forms high quality image databases using the large-scale internet sources.…”

Section: Related Workmentioning

confidence: 99%

Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites

Küçüktunç

Sevil

Tosun

et al. 2008

Semantic Multimedia

View full text Add to dashboard Cite

Abstract. In this paper, we propose an automatic photo tag expansion system for the community photo collections, such as Flickr 1 . Our aim is to suggest relevant tags for a target photograph uploaded to the system by a user, by incorporating the visual and textual cues from other related photographs. As the first step, the system requires the user to add only a few initial tags for each uploaded photo. These initial tags are used to retrieve related photos including the same tags in their tag lists. Then the set of candidate tags collected from a large pool of photos is weighted according to the similarity of the target photo to the retrieved photo including the tag. Finally, the tags in the highest rankings are used to automatically expand the tags of the target photo. The experimental results on Flickr photos show that, the use of visual similarity of semantically relevant photos to recommend tags improves the quality of suggested tags compared to only text-based systems.

show abstract

World-scale mining of objects and events from community photo collections

Cited by 236 publications

References 32 publications

Extracting Emergent Semantics from Large-Scale User-Generated Content

Extracting Emergent Semantics from Large-Scale User-Generated Content

Geotag propagation in social networks based on user trust model

Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites

Contact Info

Product

Resources

About