[16] | Manfred Del Fabro, Laszlo Böszörmenyi, State-of-the-art and future challenges in video scene detection: a survey, In Multimedia Systems, Springer-Verlag, vol. 19, no. 5, Berlin, Heidelberg, New York, pp. 427-454, 2013.
[bib] |
[15] | Roland Tusch, Felix Pletzer, Armin Kraetschmer, Laszlo Böszörmenyi, Bernhard Rinner, Thomas Mariacher, Manfred Harrer, Efficient Level of Service Classification for Traffic Monitoring in the Compressed Video Domain, In ICME '12 Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (Jian Zhang, Dan Schonfeld, David Feng Deagan, eds.), IEEE, Piscataway (NJ), pp. 967-972, 2012.
[bib] [doi] [abstract]
Abstract: This paper presents a new method for estimating the level of service (LOS) on motorways in the compressed video domain. The method performs statistical computations on motion vectors of MPEG4 encoded video streams within a predefined region of interest to determine a set of four motion features describing the speed and density of the traffic stream. These features are fed into a Gaussian radial basis function network to classify the corresponding LOS. To improve the classification results, vectors of moving objects are clustered and outliers are eliminated. The proposed method is designed to be executed on a server system, where a large number of camera live streams can be analyzed in parallel in real-time. Evaluations with a comprehensive set of real-world training and test data from an Austrian motorway have shown an average accuracy of 86.7% on the test data set for classifying all four LOS levels. With a mean execution time of 48 microseconds per frame on a common server, hundreds of video streams can be analyzed in real-time.
|
[14] | Roland Tusch, Felix Pletzer, Vijay Mudunuri, Armin Kraetschmer, Karuna Sabbavarapu, Marian Kogler, Laszlo Böszörmenyi, Bernhard Rinner, Manfred Harrer, Thomas Mariacher, P Hrassnig, LOOK2 - A Video-based System for Real-time Notification of Relevant Traffic Events., In ICMEW '12 Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (Jian Zhang, Dan Schonfeld, Deagan David Feng, eds.), IEEE, Piscataway (NJ), pp. 670, 2012.
[bib] [doi] [abstract]
Abstract: We demonstrate our novel video-based real-time traffic event notification and verification system LOOK2. It generates fast and reliable traffic information about relevant traffic state and road conditions changes on observed roads. It utilizes installed road-side sensors providing low-level traffic and environmental data, as well as video sensors which gain high-level traffic information from live video analysis. Spatio-temporal data fusion is applied on all available traffic and environmental data to gain reliable traffic information. This traffic information is published by a DATEXII compliant web service to a web-based traffic desk application. Road network and traffic channel operators receive real-time and relevant traffic event notifications by using this application. The system also enables a visual verification of the notified situations.
|
[13] | Manfred Del Fabro, Laszlo Böszörmenyi, AAU Video Browser: Non-Sequential Hierarchical Video Browsing without Content Analysis, In Advances in Multimedia Modeling (Klaus Schoeffmann, Bernard Merialdo, Alexander Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, Christian Breiteneder, eds.), Springer, Berlin, Heidelberg, New York, pp. 639-641, 2012.
[bib] [doi] [pdf] [abstract]
Abstract: We participate in the Video Browser Showdown with our easy-to-use video browsing tool. It can be used for getting a quick overview of videos as well as for simple Known Item Search (KIS) tasks. It offers a parallel and a tree-like browsing interface for navigating through the content of single videos or even small video collections in a hierarchical, non-sequential manner. We want to validate whether simple KIS tasks can be completed without a time consuming content analysis in advance.
|
[12] | Manfred Del Fabro, Laszlo Böszörmenyi, Summarization and Presentation of Real-Life Events Using Community-Contributed Content, In Advances in Multimedia Modeling (Klaus Schoeffmann, Bernard Merialdo, Alexander Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, Christian Breiteneder, eds.), Springer, Berlin, Heidelberg, New York, pp. 630-632, 2012.
[bib] [doi] [pdf] [abstract]
Abstract: We present an algorithm for the summarization of social events with community-contributed content from Flickr and YouTube. A clustering algorithm groups content related to the searched event. Date information, GPS coordinates, user ratings and visual features are used to select relevant photos and videos. The composed event summaries are presented with our video browser.
|
[11] | Anita Sobe, Wilfried Elmenreich, Laszlo Böszörmenyi, Replication for Bio-inspired Delivery in Unstructured Peer-to-Peer Networks, In Proceedings of the Ninth Workshop on intelligent solutions for embedded systems (Markus Kucera, Thomas Waas, eds.), IEEE, Los Alamitos, CA, USA, pp. 6, 2011.
[bib] |
[10] | Klaus Schoeffmann, Manfred Del Fabro, Hierarchical Video Browsing with a 3D Carousel, In Proceedings of the ACM International Conference on Multimedia (Selcuk Candan, Sethuraman Panchanthan, Balakrishnan Prabhakaran Prabhakaran, eds.), ACM Pre, Scottsdale, AZ, USA, pp. 1609-1612, 2011.
[bib] |
[9] | Manfred Del Fabro, Non-Sequential Decomposition, Composition and Presentation of Multimedia Content, PhD thesis, Klagenfurt University, pp. 168, 2011.
[bib] [abstract]
Abstract: This thesis discusses three major issues that arise in the context of non-sequential usage of multimedia content, i.e. a usage, where users only access content that is interesting for them. These issues are (1) semantically meaningful segmentation of videos, (2) composition of new video streams with content from different sources and (3) non-sequential presentation of multimedia content. A semantically meaningful segmentation of videos can be achieved by partitioning a video into scenes. This thesis gives a comprehensive survey of scene segmentation approaches, which were published in the last decade. The presented approaches are categorized based on the underlying mechanisms used for the segmentation. The characteristics that are common for each category as well as the strengths and weaknesses of the presented algorithms are stated. Additionally, an own scene segmentation approach for sports videos with special properties is introduced. Scenes are extracted based on recurring patterns in the motion information of a video stream. Furthermore, different approaches in the context of real-life events are presented for the composition of new video streams based on content from multiple sources. Community-contributed photos and videos are used to generate video summaries of social events. The evaluation shows that by using content provided by a crowd of people a new and richer view of an event can be created. This thesis introduces a new concept for this emerging view, which is called ``The Vision of Crowds''. The presentation of such newly, composed video streams is described with a simple but powerful formalism. It provides a great flexibility in defining the temporal and spatial arrangement of content. Additionally, a video browsing application for the hierarchical, non-sequential exploration of video content is introduced. It is able to interpret the formal description of compositions and can be adapted for different purposes with plug-ins.
|
[8] | Manfred Del Fabro, Laszlo Böszörmenyi, The Vision of Crowds: Social Event Summarization Based on User- Generated Multimedia Content, In ACM CHI 2011 Workshop – Data Collection By The People For The People (Christine Robson, Sean Kandel, Jeff Heer, Jeff Pierce, eds.), published on workshop homepage, http://databythepeople.com/ (May 2011), pp. 1-5, 2011.
[bib] [pdf] [abstract]
Abstract: In this position paper we introduce the idea of generating a superior view of a large social event, based on user-generated -- crowdsourced -- content. Instead of just collecting and making them available in a raw form (as social platforms like YouTube), we automatically generate semantically coherent summarizations of the entire event. The individual consuming user gets thus a compact view generated by a large number of producing users. We call this idea the "Vision of Crowds". A case study has been conducted at a social event where we used user-generated content to automatically generate live reports about that event. Furthermore, we have implemented a GUI that allows users to interactively compose personalized video summaries, based on the user-generated data collected at the case study.
|
[7] | Anita Sobe, Wilfried Elmenreich, Laszlo Böszörmenyi, Towards a self-organizing replication model for non-sequential media access, In Proceedings of the 18th International Conference on Multimedea 2010 (Alberto Del Bimbo, Shih-Fu Chang, Arnold Smeulders, eds.), ACM, New York, pp. 3-8, 2010.
[bib] |
[6] | Manfred Del Fabro, Klaus Schoeffmann, Laszlo Böszörmenyi, Instant Video Browsing: A Tool for Fast Non-sequential Hierarchical Video Browsing, In Proceedings of HCI in Work and Learning, Life and Leisure 6th Symposium of the Workgroup Human-Computer Interaction and Usability Engineering (Gerhard Leitner, Martin Hitz, Andreas Holzinger, eds.), Springer Verlag GmbH, Berlin, Heidelberg, New York, pp. 443-446, 2010.
[bib] [doi] [pdf] [abstract]
Abstract: We introduce an easy-to-use video browsing tool which assists users in getting a quick overview of videos as well as in finding segments of interest. It provides a parallel and a tree-based view for browsing the content of videos -- or even video collections -- in a hierarchical, non-sequential manner. The tool has a plug-in architecture and can be extended both by further presentation methods and by video analysis algorithms.
|
[5] | Manfred Del Fabro, Laszlo Böszörmenyi, Video Scene Detection Based on Recurring Motion Patterns, In Proceedings of the Second International Conference on Advances in Multimedia (MMEDIA 2010) (Laszlo Böszörmenyi, Dumitru Burdescu, Philip Davies, David Newell, eds.), IEEE, Washington (DC), pp. 113-118, 2010.
[bib] [doi] [pdf] [abstract]
Abstract: We present an algorithm for video scene detection based on the identification of recurring motion sequences within a video stream. The motion information is extracted in the compressed domain of H.264/AVC videos, no full decoding of the video stream is needed. Based on the motion information our algorithm identifies sequences of adjacent frames with similar motion. Throughout all identified motion sequences we are searching for recurring patterns of similar ones. The most recurring pattern is used for the segmentation of the video stream into scenes. The evaluation shows promising results.
|
[4] | Mathias Lux, An Evaluation of Metrics for Retrieval of MPEG-7 Semantic Descriptions, In Multimedia, 2009. ISM '09. 11th IEEE International Symposium on (Jeffrey Tsai, Ramesh Jain, eds.), IEEE, Los Alamitos, CA, USA, pp. 546-551, 2009.
[bib] [doi] [abstract]
Abstract: MPEG-7 is an extensive multimedia metadata standard covering a huge number of aspects of metadata. However, as with most metadata standards details of usage and application of the standards are – at least partially – open to interpretation. In case of MPEG-7storage and transmission of high level metadata on concept level are defined but retrieval methods are not proposed. So if for instance a user annotates photos using the MPEG-7 semantic description scheme, there are no standardized ways to retrieve the photos based on the annotation. In this paper we propose metrics for retrieval based on the MPEG-7 semantic description scheme and evaluate them in a digital photo retrieval scenario.
|
[3] | Marian Kogler, Manfred Del Fabro, Mathias Lux, Klaus Schoeffmann, Laszlo Böszörmenyi, Global vs. Local Feature in Video Summarization: Experimental Results, In Proceedings of the 10th International Workshop of the Multimedia Metadata Community on Semantic Multimedia Database Technologies (SeMuDaTe'09) in conjunction with the 4th International Conference on Semantic and Digital Media Technologies (SAMT 2009) (Klamma Ralf, Kosch Harald, Mathias Lux, Stegmaier Florian, eds.), http://ceur-ws.org, Aachen, Germany, pp. 6, 2009.
[bib][url] |
[2] | Christoph Kofler, Mathias Lux, Dynamic presentation adaptation based on user intent classification., In MM '09 Proceedings of the 17th ACM international conference on Multimedia (Wen Gao, Yong Tui, Alan Hanjalic, eds.), NA, NA, pp. 1117-1118, 2009.
[bib][url] [doi] [abstract]
Abstract: Results of internet searches are typically presented as lists. When searching for digital photos different search result presentations however offer different benefits. If users are primarily interested in the visual content of images a thumbnail grid may be more appropriate than a list. For people searching photos taken at a specific place image metadata in the result presentation is of interest too. In this paper we present an application which monitors a user's behavior while searching for digital photos and classifies the user's intention. Based on the intention, the result is adapted to support the user in an optimal way.
|
[1] | Christoph Kofler, Mathias Lux, An Exploratory Study on the Explicitness of User Intentions in Digital Photo Retrieval., In Proceedings of I-KNOW ’09 and I-SEMANTICS ’09 (Klaus Tochtermann, Hermann Maurer, eds.), TU Graz & Know Center, Graz, Austria, pp. 208-214, 2009.
[bib][url] [abstract]
Abstract: Search queries are typically interpreted as specification of information need of a user. Typically the search query is either interpreted as is or based on the context of a user, being for instance a user profile, his/her previously undertaken searches or any other background information. The actual intent of the user – the goal s/he wants to achieve with information retrieval – is an important part of a user’s context. In this paper we present the results of an exploratory study on the interplay between the goals of users and their search behavior in multimedia retrieval.
|