In medical endoscopy more and more surgeons record videos of their interventions in a long-term storage archive for later retrieval. In order to allow content-based search in such endoscopic video archives, the video data needs to be indexed first. However, even the very basic step of content-based indexing, namely content segmentation, is already very challenging due to the special characteristics of such video data. Therefore, we propose to use instrument classification to enable semantic segmentation of laparoscopic videos. In this paper, we evaluate the performance of such an instrument classification approach. Our results show satisfying performance for all instruments used in our evaluation.
Los Alamitos, CA, USA
Primus, Manfred Jürgen
Schoeffmann, Klaus
Böszörmenyi, Laszlo
13th International Workshop on Content-Based Multimedia Indexing
10.1109/CBMI.2015.7153616
Skopal, Tomas
Lokoc, Jakub
978-1-4673-6870-4
EN
Prague, Czech Republic
jun
1-6
IEEE Computer Society
2015.06.12
registered
Instrument Classification in Laparoscopic Videos
http://siret.ms.mff.cuni.cz/cbmi2015/
2015
Los Alamitos, CA
Lokoc, Jakub
Münzer, Bernd
Schoeffmann, Klaus
del Fabro, Manfred
Primus, Manfred Jürgen
Skopal, Tomas
Lansky, Jan
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
Cesana, Matteo
EN
Turin, Italy
jun
1-6
IEEE
2015.06.29
registered
What are the Salient Keyframes in Short Casual Videos? An Extensive User Study using a new Video Dataset
2015
Vol-1263
Zaharieva, M
Riegler, M
Del Fabro, M
Working Notes Proceedings of the MediaEval 2014 Workshop
De Natale, F
Mezaris, V
Conci, N
EN
Barcelona, Spain
oct
1-2
CEUR-WS
2014.10.16
registered
Multimodal Synchronization of Image Galleries
2014
Vol-1263
Zaharieva, M
Schopfhauser, M
Del Fabro, M
Zeppelzauer, M
Working Notes Proceedings of the MediaEval 2014 Workshop
Petkos, G
Papadopoulos, S
Rizzo, G
Mezaris, V
Troncy, R
EN
Barcelona, Spain
oct
1-2
CEUR-WS
2014.10.16
registered
Clustering and Retrieval of Social Events in Flickr
2014
Berlin, Germany
Cobarzan, Claudiu
Hudelist, Marco Andrea
Del Fabro, Manfred
MultiMedia Modeling, 20th Anniversary International Conference
Gurrin, C
Hopfgartner, F
Hurst, W
Johansen, H
Lee, H
O'Connor, N
EN
Dublin, Ireland
jan
402-406
Springer
2014.01.07
poster
Content-Based Video Browsing with Collaborating Mobile Clients
2014
Berlin, Heidelberg, New York
Del Fabro, Manfred
Böszörmenyi, Laszlo
0942-4962
Multimedia Systems
EN
feb
5
427-454
Springer-Verlag
State-of-the-art and future challenges in video scene detection: a survey
19
2013
Aachen, Germany
Zeppelzauer, Matthias
Zaharieva, Maia
Del Fabro, Manfred
MediaEval 2013 - Multimedia Benchmark Workshop
Larson, Martha
Anguera, Xavier
Reuter, Timo
Jones, Gareth
Ionescu, Bogdan
Schedl, Markus
Piatrik, Tomas
Hauff, Claudia
Soleymani, Mohammad
EN
Barcelona, Spain
oct
1-2
https://www.itec.aau.at/bib/files/mediaeval2013_submission_37.pdf
CEUR-WS.org/Vol-1043
2013.10.19
poster
Unsupervised Clustering of Social Events
2013
Berlin, Germany
Schoeffmann, Klaus
Ahlström, David
Bailer, Werner
Cobarzan, Claudiu
Hopfgartner, Frank
McGuinness, Kevin
Gurrin, Cathal
Frisson, Christian
Le, Duy-Dinh
Del Fabro, Manfred
Bai, Hongliang
Weiss, Wolfgang
International Journal of Multimedia Information Retrieval
EN
dec
1-15
Springer
The Video Browser Showdown: a live evaluation of interactive video search tools
2013
In the medical domain it has become common to store recordings of endoscopic surgeries or procedures. The storage of these endoscopic videos provides not only evidence of the work of the surgeons but also facilitates research, the training of new surgeons and supports explanations to the patients. However, an endoscopic video archive, where tens or hundreds of new videos are added each day, needs content-based analysis in order to provide content-based search. A fundamental first step in content analysis is the segmentation of the video. We propose a new method for segmentation of endoscopic videos, based on spatial and temporal differences of motion in these videos. Through an evaluation with 20 videos we show that our approach provides reasonable performance.
Los Alamitos, CA, USA
Primus, Manfred Jürgen
Schoeffmann, Klaus
Böszörmenyi, Laszlo
11th International Workshop on Content-Based Multimedia Indexing
Czuni, Laszlo
EN
Veszprem, Hungary
jun
223-228
https://www.itec.aau.at/bib/files/CBMI_2013_39.pdf
IEEE Computer Society
2013.06.18
registered
Segmentation of Recorded Endoscopic Videos by Detecting Significant Motion Changes
2013
Los Alamitos, CA, USA
Del Fabro, Manfred
Schoeffmann, Klaus
Guggenberger, Mario
Taschwer, Mario
11th International Workshop on Content-Based Multimedia Indexing
Czuni, Laszlo
EN
Veszprem, Hungary
jun
7-10
IEEE Computer Society
2013.06.18
poster
A Filtering Tool to Support Interactive Search in Internet Video Archives
2013
We present an improved version of last year’s winner of the Video Browser Showdown. In a preprocessing step video segments are detected and clustered in several latent classes of similar content based on color and motion information. The navigation bars of our video browser are then augmented with different colors indicating where elements of the detected clusters are located. As humans are able to classify the content of clusters fast, they can benefit from this information when browsing through a video.
Berlin Heidelberg
Del Fabro, Manfred
Münzer, Bernd
Böszörmenyi, Laszlo
Advances in Multimedia Modeling
10.1007/978-3-642-35728-2_9
Li, Shipeng
El-Saddik, Abdulmotaleb
Wang, Meng
Mei, Tao
Sebe, Nicu
Yan, Shuicheng
Hong, Richang
Gurrin, Cathal
978-3-642-35727-5
978-3-642-35728-2
EN
Huangshan, China
jan
544-546
Springer
Lecture Notes in Computer Science Volume 7733
19th International Conference, MMM 2013, Huangshan, China, January 7-9, 2013, Proceedings, Part II
2013.01.08
poster
AAU Video Browser with Augmented Navigation Bars
2013
While accuracy and speed get a lot of attention in video retrieval research, the investigation of interactive retrieval tools gets less attention and is often regarded as trivial. We want to show that even simple ideas have potential to improve the retrieval performance by giving some automated support to the browsing user. We present a video browsing concept where video segments are clustered in several latent classes of similar content. The navigation bars of our video browser are augmented with different colors indicating where elements of these clusters are located. As humans are able to classify the content of clusters fast, they can benefit from this information when browsing a video. We present a study where we investigated how humans can be supported in different video browsing tasks with a color-based and a motion-based clustering of video content.
Berlin Heidelberg
Del Fabro, Manfred
Münzer, Bernd
Böszörmenyi, Laszlo
Advances in Multimedia Modeling
10.1007/978-3-642-35728-2_9
Li, Shipeng
El-Saddik, Abdulmotaleb
Wang, Meng
Mei, Tao
Sebe, Nicu
Yan, Shuicheng
Hong, Richang
Gurrin, Cathal
978-3-642-35727-5
978-3-642-35728-2
EN
Huangshan, China
jan
88-98
Springer
Lecture Notes in Computer Science Volume 7733
19th International Conference, MMM 2013, Huangshan, China, January 7-9, 2013, Proceedings, Part II
2013.01.08
registered
Smart Video Browsing With Augmented Navigation Bars
2013
The Video Browser Showdown (VBS) is a live competition for evaluating video browsing tools regarding their efficiency at known-item search (KIS) tasks. The first VBS was held at MMM 2012 with eight teams working on 14 tasks, of which eight were completed by expert users and six by novices. We describe the details of the competition, analyze results regarding the performance of tools, the differences between the tasks and the nature of the false submissions.
Berlin Heidelberg
Bailer, Werner
Schoeffmann, Klaus
Ahlström, David
Weiss, Wolfgang
Del Fabro, Manfred
Advances in Multimedia Modeling
10.1007/978-3-642-35728-2_9
Li, Shipeng
El-Saddik, Abdulmotaleb
Wang, Meng
Mei, Tao
Sebe, Nicu
Yan, Shuicheng
Hong, Richang
Gurrin, Cathal
978-3-642-35724-4
978-3-642-35725-1
EN
Huangshan, China
jan
81-91
Springer
Lecture Notes in Computer Science Volume 7732
19th International Conference, MMM 2013, Huangshan, China, January 7-9, 2013, Proceedings, Part I
2013.01.07
registered
Interactive Evaluation of Video Browsing Tools
2013
Why do people record videos and share them? While the question seems to be simple, user intentions have not yet been investigated for video production and sharing. A general taxonomy would lead to adapted information systems and multimedia interfaces tailored to the users' intentions. We contribute (1) an exploratory user study with 20 participants, examining the various facets of user intentions for video production and sharing in detail and (2) a novel set of user intention clusters for video production, grounded empirically in our study results. We further reflect existing work in specialized domains (i.e. video blogging and mobile phone cameras) and show that prevailing models used in other multimedia fields (e.g. photography) cannot be used as-is to reason about video recording and sharing intentions.
Los Alamitos, CA, USA
Lux, Mathias
Huber, Jochen
Image Analysis for Multimedia Interactive Services (WIAMIS), 2012 13th International Workshop on
10.1109/WIAMIS.2012.6226758
O'Connor, Noel
Daras, Petros
Pereira, Fernando
978-1-4673-0789-5
978-1-4673-0791-8
2158-5873
Communication, Networking & Broadcasting ; Components, Circuits, Devices & Systems ; Computing & Processing (Hardware/Software) ; Signal Processing & Analysis
EN
Dublin, Ireland
jan
IEEE
1-4
IEEE
2012.05.25
registered
Why did you record this video? An exploratory study on user intentions for video production
2012
Vienna
Sobe, Anita
Elmenreich, Wilfried
Del Fabro, Manfred
European Meeting on Cybernetics and Systems Research Book of Abstracts
Bichler, Robert
Blachfellner, Stefan
Hofkirchner, Wolfgang
EN
Vienna, Austria
apr
197-200
EMCSR
2012.04.11
registered
Self-organizing content sharing at social events
http://www.emcsr.net/wp-content/uploads/2012/EMCSR_Book_of_Abstracts_V2.pdf
2012
In this paper we evaluate a 3D cylindrical interface that arranges image thumbnails by visual similarity for the purpose of image browsing. Through a user study we compare the performance of this interface to the performance of a common scrollable 2D list of thumbnails in a grid arrangement. Our evaluation shows that the 3D Cylinder interface enables significantly faster visual search and is the preferred search interface for the majority of tested users.
Los Alamitos, CA, USA
Schoeffmann, Klaus
Ahlström, David
Proceedings of The 13th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2012)
O'Connor, Noel
Daras, Petros
Pereira, Fernando
EN
Dublin, Ireland
may
1-4
IEEE
2012.05.23
registered
Using a Cylindrical Interface for Image Browsing to Improve Visual Search Performance
2012
With users increasingly using their mobile devices such as smartphones as digital photo albums, effective methods for managing these collections are becoming increasingly important. Standard solutions provide only limited facilities for organising, browsing and searching image collections on mobile devices, making it challenging and time-consuming to locate images of interest. In this demo paper, we present an intuitive interface for organising and browsing image collections on mobile devices. Images are arranged on a 3D globe according to colour similarity. To avoid image overlap image thumbnails are placed on a regular grid structure while large image collections are organised using a hierarchical data structure. Through multi-touch user interaction image browsing can be performed in an intuitive and effective manner.
New York, NY, USA
Schoeffmann, Klaus
Hudelist, Marco Andrea
Schaefer, Gerald
Del Fabro, Manfred
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
10.1145/2324796.2324866
Horace, H S Ip
Rui, Yong
978-1-4503-1329-2
EN
Hong Kong, China
jun
61:1-61:2
ACM
2012.06.08
poster
Mobile Image Browsing on a 3D Globe
http://dl.acm.org/citation.cfm?id=2324866
2012
Interactive image and video search tools typically use a grid-like arrangement of thumbnails for preview purpose. Such a display, which is commonly known as storyboard, provides limited flexibility at interactive search and it does not optimally exploit the available screen estate. In this paper we design and evaluate alternatives to the common two-dimensional storyboard. We take advantage of 3D graphics in order to present image thumbnails in cylindrical arrangements. Through a user study we evaluate the performance of these interfaces in terms of visual search time and subjective performance.
Los Alamitos, CA, USA
Schoeffmann, Klaus
Ahlström, David
Böszörmenyi, Laszlo
Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012)
Zhang, Jian
Schonfeld, Dan
Feng, David Dagan
Nanyang, Jianfei Cai
Hanjalic, Alan
Magli, Enrico
Pickering, Mark
Friedland, Gerald
Hua, Xian-Sheng
EN
Melbourne, Australia
July
848-853
IEEE Computer Society
2012.07.12
registered
3D Storyboards for Interactive Visual Search
2012
New York, NY, USA
Müller, Alexander
Lux, Mathias
Böszörmenyi, Laszlo
Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies
10.1145/2362456.2362476
Lindstaedt, Stefanie
Granitzer, Michael
games with a purpose, human computation, video retrieval, video summarization
EN
jan
15:1-15:7
ACM
i-KNOW '12
none
The video summary GWAP: summarization of videos based on a social game
http://doi.acm.org/10.1145/2362456.2362476
2012
This paper demonstrates a novel 3D Video Browser (3VB) that enables interactive search within a single video as well as video collections by utilizing 3D projection and an intuitive interaction. The browsing approach is based on hierarchical search, which means that the user can split a video into several segments. The 3VB disposes a convenient interface that allows flexible arrangement of video segments in the 3D space. It allows for concurrent playback of video segments and flexible inspection of these segments at any desired level of detail through convenient user interaction.
Los Alamitos, CA, USA
Mueller, Christopher
Smole, Martin
Schoeffmann, Klaus
Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012)
Zhang, Jian
Schonfeld, Dan
Feng, David Dagan
Nanyang, Jianfei Cai
Hanjalic, Alan
Magli, Enrico
Pickering, Mark
Friedland, Gerald
Hua, Xian-Sheng
EN
Melbourne, Australia
jul
665
https://www.itec.aau.at/bib/files/A_Demo_of_a_Hierarchical_Multi-Layout_3D_Video_Browser.pdf
IEEE Computer Society
2012.07.10
registered
A Demonstration of A Hierarchical Multi-Layout 3D Video Browser
2012
New York, NY, USA
Marques, Oge
Lux, Mathias
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
10.1145/2348283.2348538
Hersh, William
Callan, Jamie
Maarek, Yoelle
Sanderson, Mark
content-based image retrieval, image search, java, visual information retrieval
EN
Portland, Oregon, USA
jan
1193-1193
ACM
SIGIR '12
2012.08.12
registered
Visual information retrieval using Java and LIRE
http://doi.acm.org/10.1145/2348283.2348538
2012
New York, NY, USA
Lux, Mathias
Taschwer, Mario
Marques, Oge
Proceedings of the 20th ACM international conference on Multimedia
10.1145/2393347.2396488
Aizawa, Kiyoharu
Babaguchi, Noboru
Smith, John
affection, image classification, image search, user intentions
EN
Nara, Japan
jan
1367-1368
ACM
MM '12
2012.11.01
registered
Classification of photos based on good feelings: ACM MM 2012 multimedia grand challenge submission
http://doi.acm.org/10.1145/2393347.2396488
2012
New York, NY, USA
Lux, Mathias
Taschwer, Mario
Marques, Oge
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia
10.1145/2390803.2390811
Aizawa, Kiyoharu
Babaguchi, Noboru
Smith, John
digital photos, user intentions
EN
Nara, Japan
jan
17-18
ACM
CrowdMM '12
2012.10.29
registered
A closer look at photographers' intentions: a test dataset
http://doi.acm.org/10.1145/2390803.2390811
2012
Manual image annotation is a tedious and time-consuming task, while automated methods are error prone and limited in their results. Human computation, and especially games with a purpose, have shown potential to create high quality annotations by "hiding the complexity" of the actual annotation task and employing the "wisdom of the crowds". In this demo paper we present two games with a single purpose: finding regions in images that correspond to given terms. We discuss approach, implementation, and preliminary results of our work and give an outlook to immediate future work.
Palo Alto, California, USA
Lux, Mathias
Guggenberger, Mario
Müller, Alexander
Proceedings of the Eighth Artificial Intelligence and Interactive Digital Entertainment International Conference (AIIDE 2012)
Riedl, Mark
Sukthankar, Gita
978-1-57735-582-3
Games with a Purpose; Human Computation
EN
jan
220
Association for the Advancement of Artificial Intelligence (AAAI Press)
none
Finding Image Regions with Human Computation and Games with a Purpose
http://www.aaai.org/ocs/index.php/AIIDE/AIIDE12/paper/view/5474
2012
Content-based retrieval systems leverage low level features such as color, texture or local information of images to find similar images to a respective query image. In recent years the Bag of Visual Words (BoVW) approach, which relies on quantized visual information around local image patches, has gained importance in image retrieval. In this paper we focus on fuzzy algorithms, in order to improve the descriptiveness of image descriptors. We extend the BoVW approach by applying fuzzy clustering and fuzzy assignment to take a step towards more effective visual descriptors, which are matched against each other in content-based similarity searches.
New York, NY, USA
Kogler, Marian
Lux, Mathias
i-KNOW '12 Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies
10.1145/2362456.2362498
Lindstaedt, Stefanie
bag of visual words, content based image retrieval, fuzzy, visual information retrieval
EN
jan
34.1 - 34.4
ACM
i-KNOW '12
none
Robust image retrieval using bag of visual words with fuzzy codebooks and fuzzy assignment
http://doi.acm.org/10.1145/2362456.2362498
2012
In this report we describe our approach to the known-item search task for TRECVID 2012. We describe how we index available metadata and how we gain additional information about the videos using content-based analysis. A rule-based query expansion and query reduction method is applied to increase the number of relevant videos in automatic runs. Furthermore, we describe an approach for quick, interactive filtering of large result sets. We outline how the parameters of our system were tuned for the IACC dataset and discuss our TRECVID 2012 KIS results.
Gaithersburg, USA
Del Fabro, Manfred
Lux, Mathias
Schoeffmann, Klaus
Taschwer, Mario
Proceedings of TRECVID 2012
Over, Paul
Awad, George
Michel, Martial
Fiscus, Jonathan
Sanders, Greg
Shaw, Barbara
Kraaij, Wessel
Smeaton, Alan
Quénot, Georges
EN
Gaithersburg, USA
nov
11
National Institute of Standards and Technology (NIST)
2012.11.28
poster
ITEC-UNIKLU Known-Item Search Submission 2012
http://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.org.html
2012
In this paper, we investigate whether community-contributed multimedia content can be used to make video summaries of social events. We implemented an event summarization algorithm that uses photos from Flickr and videos from YouTube to compose summaries of well-known society events, which took place in the last three years. The comparison with a manually obtained ground truth shows a good coverage of the most important situations of these events. We do not claim to produce the best summaries possible, which may be compared to the work of a human director, but we analyze what can be achieved with community-contributed content by now.
France
Del Fabro, Manfred
Sobe, Anita
Böszörmenyi, Laszlo
Proceedings of the Fourth International Conferences on Advances in Multimedia (MMEDIA 2012)
Davies, Philip
Newell, David
978-1-61208-195-3
video summarization. event summarization. social media. real-life events. video retrieval. image retrieval. multimedia entertainment.
EN
Chamonix Mont-Blanc, France
apr
119-126
https://www.itec.aau.at/bib/files/mmedia_2012_6_30_40058.pdf
IARIA
2012.05.02
registered
Summarization of Real-Life Events Based on Community-Contributed Content
http://www.thinkmind.org/download.php?articleid=mmedia_2012_6_30_40058
2012
Los Alamitos, CA, USA
Ahlström, David
Schoeffmann, Klaus
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops
Zhang, Jian
Schonfeld, Dan
Feng, David Dagan
Nanyang, Jianfei Cai
Hanjalic, Alan
Magli, Enrico
Pickering, Mark
Friedland, Gerald
Hua, Xian-Sheng
EN
Melbourne, Australia
jul
546-551
IEEE Computing Society
2012.07.13
registered
A Visual Search User Study on the Influences of Aspect Ratio Distortion of Preview Thumbnails
2012
Default image browsing interfaces on touch-based mobile devices provide limited support for image search tasks. To facilitate fast and convenient searches we propose an alternative interface that takes advantage of 3D graphics and arranges images on a rotatable globe according to color similarity. In a user study we compare the new design to the iPad's image browser. Results collected from 24 participants show that for color-sorted image collections the globe can reduce search time by 23% without causing more errors and that it is perceived as being fun to use and preferred over the standard browsing interface by 70% of the participants.
New York, USA
Ahlström, David
Hudelist, Marco Andrea
Schoeffmann, Klaus
Schaefer, Gerald
Proceedings of the 20th ACM international conference on Multimedia
Babaguchi, Noboru
Aizawa, Kiyoharu
Smith, John
978-1-4503-1089-5
EN
Nara, Japan
nov
pp. 925-928
ACM Digital Library
2012.10.31
registered
A User Study on Image Browsing on Touchscreens
http://dl.acm.org/citation.cfm?id=2393347&coll=DL&dl=ACM&CFID=159013035&CFTOKEN=94655035
2012