Categories: CODE-MM - ITEC Publications [rss] [bib] [xml]

2015
[29]	Manfred Jürgen Primus, Klaus Schoeffmann, Laszlo Böszörmenyi, Instrument Classification in Laparoscopic Videos, In 13th International Workshop on Content-Based Multimedia Indexing (Tomas Skopal, Jakub Lokoc, eds.), IEEE Computer Society, Los Alamitos, CA, USA, pp. 1-6, 2015. [bib][url] [doi] [abstract] Abstract: In medical endoscopy more and more surgeons record videos of their interventions in a long-term storage archive for later retrieval. In order to allow content-based search in such endoscopic video archives, the video data needs to be indexed first. However, even the very basic step of content-based indexing, namely content segmentation, is already very challenging due to the special characteristics of such video data. Therefore, we propose to use instrument classification to enable semantic segmentation of laparoscopic videos. In this paper, we evaluate the performance of such an instrument classification approach. Our results show satisfying performance for all instruments used in our evaluation.
[28]	Jakub Lokoc, Bernd Münzer, Klaus Schoeffmann, Manfred del Fabro, Manfred Jürgen Primus, Tomas Skopal, Jan Lansky, What are the Salient Keyframes in Short Casual Videos? An Extensive User Study using a new Video Dataset, In Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) (Matteo Cesana, ed.), IEEE, Los Alamitos, CA, pp. 1-6, 2015. [bib]
2014
[27]	M Zaharieva, M Riegler, M Del Fabro, Multimodal Synchronization of Image Galleries, In Working Notes Proceedings of the MediaEval 2014 Workshop (F De Natale, V Mezaris, N Conci, eds.), CEUR-WS, Vol-1263, pp. 1-2, 2014. [bib]
[26]	M Zaharieva, M Schopfhauser, M Del Fabro, M Zeppelzauer, Clustering and Retrieval of Social Events in Flickr, In Working Notes Proceedings of the MediaEval 2014 Workshop (G Petkos, S Papadopoulos, G Rizzo, V Mezaris, R Troncy, eds.), CEUR-WS, Vol-1263, pp. 1-2, 2014. [bib]
[25]	Claudiu Cobarzan, Marco Andrea Hudelist, Manfred Del Fabro, Content-Based Video Browsing with Collaborating Mobile Clients, In MultiMedia Modeling, 20th Anniversary International Conference (C Gurrin, F Hopfgartner, W Hurst, H Johansen, H Lee, N O'Connor, eds.), Springer, Berlin, Germany, pp. 402-406, 2014. [bib]
2013
[24]	Manfred Del Fabro, Laszlo Böszörmenyi, State-of-the-art and future challenges in video scene detection: a survey, In Multimedia Systems, Springer-Verlag, vol. 19, no. 5, Berlin, Heidelberg, New York, pp. 427-454, 2013. [bib]
[23]	Matthias Zeppelzauer, Maia Zaharieva, Manfred Del Fabro, Unsupervised Clustering of Social Events, In MediaEval 2013 - Multimedia Benchmark Workshop (Martha Larson, Xavier Anguera, Timo Reuter, Gareth Jones, Bogdan Ionescu, Markus Schedl, Tomas Piatrik, Claudia Hauff, Mohammad Soleymani, eds.), CEUR-WS.org/Vol-1043, Aachen, Germany, pp. 1-2, 2013. [bib] [pdf]
[22]	Klaus Schoeffmann, David Ahlström, Werner Bailer, Claudiu Cobarzan, Frank Hopfgartner, Kevin McGuinness, Cathal Gurrin, Christian Frisson, Duy-Dinh Le, Manfred Del Fabro, Hongliang Bai, Wolfgang Weiss, The Video Browser Showdown: a live evaluation of interactive video search tools, In International Journal of Multimedia Information Retrieval, Springer, Berlin, Germany, pp. 1-15, 2013. [bib]
[21]	Manfred Jürgen Primus, Klaus Schoeffmann, Laszlo Böszörmenyi, Segmentation of Recorded Endoscopic Videos by Detecting Significant Motion Changes, In 11th International Workshop on Content-Based Multimedia Indexing (Laszlo Czuni, ed.), IEEE Computer Society, Los Alamitos, CA, USA, pp. 223-228, 2013. [bib] [pdf] [abstract] Abstract: In the medical domain it has become common to store recordings of endoscopic surgeries or procedures. The storage of these endoscopic videos provides not only evidence of the work of the surgeons but also facilitates research, the training of new surgeons and supports explanations to the patients. However, an endoscopic video archive, where tens or hundreds of new videos are added each day, needs content-based analysis in order to provide content-based search. A fundamental first step in content analysis is the segmentation of the video. We propose a new method for segmentation of endoscopic videos, based on spatial and temporal differences of motion in these videos. Through an evaluation with 20 videos we show that our approach provides reasonable performance.
[20]	Manfred Del Fabro, Klaus Schoeffmann, Mario Guggenberger, Mario Taschwer, A Filtering Tool to Support Interactive Search in Internet Video Archives, In 11th International Workshop on Content-Based Multimedia Indexing (Laszlo Czuni, ed.), IEEE Computer Society, Los Alamitos, CA, USA, pp. 7-10, 2013. [bib]
[19]	Manfred Del Fabro, Bernd Münzer, Laszlo Böszörmenyi, AAU Video Browser with Augmented Navigation Bars, In Advances in Multimedia Modeling (Shipeng Li, Abdulmotaleb El-Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, Cathal Gurrin, eds.), Springer, Berlin Heidelberg, pp. 544-546, 2013. [bib] [doi] [abstract] Abstract: We present an improved version of last year’s winner of the Video Browser Showdown. In a preprocessing step video segments are detected and clustered in several latent classes of similar content based on color and motion information. The navigation bars of our video browser are then augmented with different colors indicating where elements of the detected clusters are located. As humans are able to classify the content of clusters fast, they can benefit from this information when browsing through a video.
[18]	Manfred Del Fabro, Bernd Münzer, Laszlo Böszörmenyi, Smart Video Browsing With Augmented Navigation Bars, In Advances in Multimedia Modeling (Shipeng Li, Abdulmotaleb El-Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, Cathal Gurrin, eds.), Springer, Berlin Heidelberg, pp. 88-98, 2013. [bib] [doi] [abstract] Abstract: While accuracy and speed get a lot of attention in video retrieval research, the investigation of interactive retrieval tools gets less attention and is often regarded as trivial. We want to show that even simple ideas have potential to improve the retrieval performance by giving some automated support to the browsing user. We present a video browsing concept where video segments are clustered in several latent classes of similar content. The navigation bars of our video browser are augmented with different colors indicating where elements of these clusters are located. As humans are able to classify the content of clusters fast, they can benefit from this information when browsing a video. We present a study where we investigated how humans can be supported in different video browsing tasks with a color-based and a motion-based clustering of video content.
[17]	Werner Bailer, Klaus Schoeffmann, David Ahlström, Wolfgang Weiss, Manfred Del Fabro, Interactive Evaluation of Video Browsing Tools, In Advances in Multimedia Modeling (Shipeng Li, Abdulmotaleb El-Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, Cathal Gurrin, eds.), Springer, Berlin Heidelberg, pp. 81-91, 2013. [bib] [doi] [abstract] Abstract: The Video Browser Showdown (VBS) is a live competition for evaluating video browsing tools regarding their efficiency at known-item search (KIS) tasks. The first VBS was held at MMM 2012 with eight teams working on 14 tasks, of which eight were completed by expert users and six by novices. We describe the details of the competition, analyze results regarding the performance of tools, the differences between the tasks and the nature of the false submissions.
2012
[16]	Mathias Lux, Jochen Huber, Why did you record this video? An exploratory study on user intentions for video production, In Image Analysis for Multimedia Interactive Services (WIAMIS), 2012 13th International Workshop on (Noel O'Connor, Petros Daras, Fernando Pereira, eds.), IEEE, Los Alamitos, CA, USA, pp. 1-4, 2012. [bib] [doi] [abstract] Abstract: Why do people record videos and share them? While the question seems to be simple, user intentions have not yet been investigated for video production and sharing. A general taxonomy would lead to adapted information systems and multimedia interfaces tailored to the users' intentions. We contribute (1) an exploratory user study with 20 participants, examining the various facets of user intentions for video production and sharing in detail and (2) a novel set of user intention clusters for video production, grounded empirically in our study results. We further reflect existing work in specialized domains (i.e. video blogging and mobile phone cameras) and show that prevailing models used in other multimedia fields (e.g. photography) cannot be used as-is to reason about video recording and sharing intentions.
[15]	Anita Sobe, Wilfried Elmenreich, Manfred Del Fabro, Self-organizing content sharing at social events, In European Meeting on Cybernetics and Systems Research Book of Abstracts (Robert Bichler, Stefan Blachfellner, Wolfgang Hofkirchner, eds.), EMCSR, Vienna, pp. 197-200, 2012. [bib][url]
[14]	Klaus Schoeffmann, David Ahlström, Using a Cylindrical Interface for Image Browsing to Improve Visual Search Performance, In Proceedings of The 13th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2012) (Noel O'Connor, Petros Daras, Fernando Pereira, eds.), IEEE, Los Alamitos, CA, USA, pp. 1-4, 2012. [bib] [abstract] Abstract: In this paper we evaluate a 3D cylindrical interface that arranges image thumbnails by visual similarity for the purpose of image browsing. Through a user study we compare the performance of this interface to the performance of a common scrollable 2D list of thumbnails in a grid arrangement. Our evaluation shows that the 3D Cylinder interface enables significantly faster visual search and is the preferred search interface for the majority of tested users.
[13]	Klaus Schoeffmann, Marco Andrea Hudelist, Gerald Schaefer, Manfred Del Fabro, Mobile Image Browsing on a 3D Globe, In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (H S Ip Horace, Yong Rui, eds.), ACM, New York, NY, USA, pp. 61:1-61:2, 2012. [bib][url] [doi] [abstract] Abstract: With users increasingly using their mobile devices such as smartphones as digital photo albums, effective methods for managing these collections are becoming increasingly important. Standard solutions provide only limited facilities for organising, browsing and searching image collections on mobile devices, making it challenging and time-consuming to locate images of interest. In this demo paper, we present an intuitive interface for organising and browsing image collections on mobile devices. Images are arranged on a 3D globe according to colour similarity. To avoid image overlap image thumbnails are placed on a regular grid structure while large image collections are organised using a hierarchical data structure. Through multi-touch user interaction image browsing can be performed in an intuitive and effective manner.
[12]	Klaus Schoeffmann, David Ahlström, Laszlo Böszörmenyi, 3D Storyboards for Interactive Visual Search, In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012) (Jian Zhang, Dan Schonfeld, David Dagan Feng, Jianfei Cai Nanyang, Alan Hanjalic, Enrico Magli, Mark Pickering, Gerald Friedland, Xian-Sheng Hua, eds.), IEEE Computer Society, Los Alamitos, CA, USA, pp. 848-853, 2012. [bib] [abstract] Abstract: Interactive image and video search tools typically use a grid-like arrangement of thumbnails for preview purpose. Such a display, which is commonly known as storyboard, provides limited flexibility at interactive search and it does not optimally exploit the available screen estate. In this paper we design and evaluate alternatives to the common two-dimensional storyboard. We take advantage of 3D graphics in order to present image thumbnails in cylindrical arrangements. Through a user study we evaluate the performance of these interfaces in terms of visual search time and subjective performance.
[11]	Alexander Müller, Mathias Lux, Laszlo Böszörmenyi, The video summary GWAP: summarization of videos based on a social game, In Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies (Stefanie Lindstaedt, Michael Granitzer, eds.), ACM, New York, NY, USA, pp. 15:1-15:7, 2012. [bib][url] [doi]
[10]	Christopher Mueller, Martin Smole, Klaus Schoeffmann, A Demonstration of A Hierarchical Multi-Layout 3D Video Browser, In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012) (Jian Zhang, Dan Schonfeld, David Dagan Feng, Jianfei Cai Nanyang, Alan Hanjalic, Enrico Magli, Mark Pickering, Gerald Friedland, Xian-Sheng Hua, eds.), IEEE Computer Society, Los Alamitos, CA, USA, pp. 665, 2012. [bib] [pdf] [abstract] Abstract: This paper demonstrates a novel 3D Video Browser (3VB) that enables interactive search within a single video as well as video collections by utilizing 3D projection and an intuitive interaction. The browsing approach is based on hierarchical search, which means that the user can split a video into several segments. The 3VB disposes a convenient interface that allows flexible arrangement of video segments in the 3D space. It allows for concurrent playback of video segments and flexible inspection of these segments at any desired level of detail through convenient user interaction.
[9]	Oge Marques, Mathias Lux, Visual information retrieval using Java and LIRE, In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (William Hersh, Jamie Callan, Yoelle Maarek, Mark Sanderson, eds.), ACM, New York, NY, USA, pp. 1193-1193, 2012. [bib][url] [doi]
[8]	Mathias Lux, Mario Taschwer, Oge Marques, Classification of photos based on good feelings: ACM MM 2012 multimedia grand challenge submission, In Proceedings of the 20th ACM international conference on Multimedia (Kiyoharu Aizawa, Noboru Babaguchi, John Smith, eds.), ACM, New York, NY, USA, pp. 1367-1368, 2012. [bib][url] [doi]
[7]	Mathias Lux, Mario Taschwer, Oge Marques, A closer look at photographers' intentions: a test dataset, In Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia (Kiyoharu Aizawa, Noboru Babaguchi, John Smith, eds.), ACM, New York, NY, USA, pp. 17-18, 2012. [bib][url] [doi]
[6]	Mathias Lux, Mario Guggenberger, Alexander Müller, Finding Image Regions with Human Computation and Games with a Purpose, In Proceedings of the Eighth Artificial Intelligence and Interactive Digital Entertainment International Conference (AIIDE 2012) (Mark Riedl, Gita Sukthankar, eds.), Association for the Advancement of Artificial Intelligence (AAAI Press), Palo Alto, California, USA, pp. 220, 2012. [bib][url] [abstract] Abstract: Manual image annotation is a tedious and time-consuming task, while automated methods are error prone and limited in their results. Human computation, and especially games with a purpose, have shown potential to create high quality annotations by "hiding the complexity" of the actual annotation task and employing the "wisdom of the crowds". In this demo paper we present two games with a single purpose: finding regions in images that correspond to given terms. We discuss approach, implementation, and preliminary results of our work and give an outlook to immediate future work.
[5]	Marian Kogler, Mathias Lux, Robust image retrieval using bag of visual words with fuzzy codebooks and fuzzy assignment, In i-KNOW '12 Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies (Stefanie Lindstaedt, ed.), ACM, New York, NY, USA, pp. 34.1 - 34.4, 2012. [bib][url] [doi] [abstract] Abstract: Content-based retrieval systems leverage low level features such as color, texture or local information of images to find similar images to a respective query image. In recent years the Bag of Visual Words (BoVW) approach, which relies on quantized visual information around local image patches, has gained importance in image retrieval. In this paper we focus on fuzzy algorithms, in order to improve the descriptiveness of image descriptors. We extend the BoVW approach by applying fuzzy clustering and fuzzy assignment to take a step towards more effective visual descriptors, which are matched against each other in content-based similarity searches.