Categories: DMS - page 4 - ITEC Publications [rss] [bib] [xml]

[163]	David Ahlström, Klaus Schoeffmann, A Visual Search User Study on the Influences of Aspect Ratio Distortion of Preview Thumbnails, In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (Jian Zhang, Dan Schonfeld, David Dagan Feng, Jianfei Cai Nanyang, Alan Hanjalic, Enrico Magli, Mark Pickering, Gerald Friedland, Xian-Sheng Hua, eds.), IEEE Computing Society, Los Alamitos, CA, USA, pp. 546-551, 2012. [bib]
[162]	David Ahlström, Marco Andrea Hudelist, Klaus Schoeffmann, Gerald Schaefer, A User Study on Image Browsing on Touchscreens, In Proceedings of the 20th ACM international conference on Multimedia (Noboru Babaguchi, Kiyoharu Aizawa, John Smith, eds.), ACM Digital Library, New York, USA, pp. pp. 925-928, 2012. [bib][url] [abstract] Abstract: Default image browsing interfaces on touch-based mobile devices provide limited support for image search tasks. To facilitate fast and convenient searches we propose an alternative interface that takes advantage of 3D graphics and arranges images on a rotatable globe according to color similarity. In a user study we compare the new design to the iPad's image browser. Results collected from 24 participants show that for color-sorted image collections the globe can reduce search time by 23% without causing more errors and that it is perceived as being fun to use and preferred over the standard browsing interface by 70% of the participants.
2011
[161]	Stefan Wieser, Laszlo Böszörmenyi, Decentralized topology aggregation for QoS estimation in large overlay networks, In 2011 IEEE International Symposium on Network Computing and Applications (NCA2011) (Olsen Rasmus Løvenstein, Romano Paolo, Tsuchiya Tatsuhiro, eds.), IEEE, Cambridge, USA, pp. 298-302, 2011. [bib] [pdf] [abstract] Abstract: This paper introduces a scalable approach for efficient, low-cost multi-level Quality of Service (QoS) estimation in large overlay networks (ON). We modify an existing distributed partitioning algorithm [1], and use it to create ”QoS maps”. QoS maps empower applications to quickly predict several QoS metrics for any given route, and to obtain multiple alternative routes to any target node in the ON. We show that our modifications of the partitioning algorithm permit the aggregation of large hubs, but still preserve the sublinear runtime of the original heuristic. Simulations with large ONs are performed to evaluate the proposed approach and demonstrate its scalability. Finally, we outline our estimation algorithm that we use to predict QoS and perform QoS aware routing in any given ON.
[160]	Stefan Wieser, Laszlo Böszörmenyi, Decentralized topology aggregation for QoS estimation in large overlay networks, Technical report, Institute of Information Technology (ITEC), Klagenfurt University, no. TR/ITEC/01/2.11, Klagenfurt, Austria, pp. 26, 2011. [bib] [pdf]
[159]	Anita Sobe, Wilfried Elmenreich, Laszlo Böszörmenyi, Storage Balancing in Self-organizing Multimedia Delivery Systems, Technical report, Institute of Information Technology (ITEC), Klagenfurt University, no. TR/ITEC/01/2.13, arxiv e-print 1111.0242, Klagenfurt, Austria, pp. 16, 2011. [bib][url]
[158]	Anita Sobe, Wilfried Elmenreich, Laszlo Böszörmenyi, Replication for Bio-inspired Delivery in Unstructured Peer-to-Peer Networks, In Proceedings of the Ninth Workshop on intelligent solutions for embedded systems (Markus Kucera, Thomas Waas, eds.), IEEE, Los Alamitos, CA, USA, pp. 6, 2011. [bib]
[157]	Christoph Lagger, Mathias Lux, Oge Marques, Which video do you want to watch now?, In Workshop on Multimedia on the Web 2011, Proceedings of (Laszlo Böszörmenyi, Oge Marques, Mathia Lux, Ralf Klamma, eds.), IEE, Los Alamitos, California, USA, pp. 45-48, 2011, to appear in IEEE. [bib] [doi] [abstract] Abstract: Searching and retrieving videos in a meaningful way on the web is still an open problem. The integration of a user's context into search is one of the most promising approaches to enhance current search interfaces and algorithms. We performed two user studies investigating video search, video retrieval, and video sharing behavior. In this paper we present an overview on the findings most promising for enhancing user experience in video retrieval. We propose a software prototype that implements an adaptive video retrieval system, that utilizes the users' intentions to provide better search results in a user interface adapted to the intentions and needs of users.
[156]	Manfred Del Fabro, Non-Sequential Decomposition, Composition and Presentation of Multimedia Content, PhD thesis, Klagenfurt University, pp. 168, 2011. [bib] [abstract] Abstract: This thesis discusses three major issues that arise in the context of non-sequential usage of multimedia content, i.e. a usage, where users only access content that is interesting for them. These issues are (1) semantically meaningful segmentation of videos, (2) composition of new video streams with content from different sources and (3) non-sequential presentation of multimedia content. A semantically meaningful segmentation of videos can be achieved by partitioning a video into scenes. This thesis gives a comprehensive survey of scene segmentation approaches, which were published in the last decade. The presented approaches are categorized based on the underlying mechanisms used for the segmentation. The characteristics that are common for each category as well as the strengths and weaknesses of the presented algorithms are stated. Additionally, an own scene segmentation approach for sports videos with special properties is introduced. Scenes are extracted based on recurring patterns in the motion information of a video stream. Furthermore, different approaches in the context of real-life events are presented for the composition of new video streams based on content from multiple sources. Community-contributed photos and videos are used to generate video summaries of social events. The evaluation shows that by using content provided by a crowd of people a new and richer view of an event can be created. This thesis introduces a new concept for this emerging view, which is called ``The Vision of Crowds''. The presentation of such newly, composed video streams is described with a simple but powerful formalism. It provides a great flexibility in defining the temporal and spatial arrangement of content. Additionally, a video browsing application for the hierarchical, non-sequential exploration of video content is introduced. It is able to interpret the formal description of compositions and can be adapted for different purposes with plug-ins.
[155]	Manfred Del Fabro, Laszlo Böszörmenyi, The Vision of Crowds: Social Event Summarization Based on User- Generated Multimedia Content, In ACM CHI 2011 Workshop – Data Collection By The People For The People (Christine Robson, Sean Kandel, Jeff Heer, Jeff Pierce, eds.), published on workshop homepage, http://databythepeople.com/ (May 2011), pp. 1-5, 2011. [bib] [pdf] [abstract] Abstract: In this position paper we introduce the idea of generating a superior view of a large social event, based on user-generated -- crowdsourced -- content. Instead of just collecting and making them available in a raw form (as social platforms like YouTube), we automatically generate semantically coherent summarizations of the entire event. The individual consuming user gets thus a compact view generated by a large number of producing users. We call this idea the "Vision of Crowds". A case study has been conducted at a social event where we used user-generated content to automatically generate live reports about that event. Furthermore, we have implemented a GUI that allows users to interactively compose personalized video summaries, based on the user-generated data collected at the case study.
[154]	Laszlo Böszörmenyi, Manfred del Fabro, Marian Kogler, Mathias Lux, Oge Marques, Anita Sobe, Innovative directions in self-organized distributed multimedia systems, In Multimedia Tools and Applications, Springer Verlag, vol. 51, Berlin, Heidelberg, New York, pp. 525-553, 2011, 10.1007/s11042-010-0622-z. [bib] [doi] [abstract] Abstract: The way by which multimedia contents are produced, delivered across networks, and consumed by intended users have shifted significantly during the past 10 years. In this paper we postulate that, in the near future, flexible and self-organizing facilities will play a dominating role in distributed multimedia systems. We discuss how such systems can be designed, using a three-layer (sensor, distribution, and user layer) architecture, SOMA (Self Organizing Multimedia Architecture), as an example. We also present innovative directions in three main aspects of self-organized multimedia systems: (i) the self-organizing aspects of multimedia user communities, e.g., the wisdom, intentions, and needs of users; (ii) a fresh look at video streams that treat them as a collection of units that can be composed taking user and network aspects into account; and (iii) new delivery paradigms and how self-organization and multimedia delivery can be combined.
2010
[153]	Stefan Wieser, Laszlo Böszörmenyi, Flocks: Interest-Based Construction of Overlay Networks, In The Second International Conferences on Advances in Multimedia (MMEDIA 2010) (Huet Benoit, Smeaton Alan, Mayer-Patel Ketan, Avrithis Yannis, eds.), IEEE, Washington, USA, pp. 119-124, 2010. [bib] [doi] [abstract] Abstract: Self-organizing overlay networks have received a lot of attention in the recent years. However, despite the popularity of content-aware and topology-aware overlay networks, surprisingly little research has been done to combine both approaches. In this paper, we create robust and flexible overlay networks that we call “Flocks”, which can be content-aware, topology-aware, or a combination of both. We model affinity with interests and properties and show the resulting overlay networks work in a decentralized, self-organizing way, and stabilize quickly.
[152]	Anita Sobe, Wilfried Elmenreich, Laszlo Böszörmenyi, Towards a self-organizing replication model for non-sequential media access, In Proceedings of the 18th International Conference on Multimedea 2010 (Alberto Del Bimbo, Shih-Fu Chang, Arnold Smeulders, eds.), ACM, New York, pp. 3-8, 2010. [bib]
[151]	Anita Sobe, Laszlo Böszörmenyi, Mario Taschwer, Video Notation (ViNo): A Formalism for Describing and Evaluating Non-sequential Multimedia Access, In International Journal on Advances in Software, International Academy, Research and Industry Association (IARIA), vol. 3, no. 1 & 2, Valencia, Spain, pp. 19-30, 2010. [bib][url] [pdf] [abstract] Abstract: The contributions of this paper are threefold: (1) the extensive introduction of a formal Video Notation (ViNo) that allows for describing different multimedia transport techniques for specifying required QoS; (2) the application of this formal notation to analyzing different transport mechanisms without the need of detailed simulations; (3) further application of ViNo to caching techniques, leading to the introduction of two cache admission policies and one replacement policy supporting nonsequential multimedia access. The applicability of ViNo is shown by example and by analysis of an existing CDN simulation. We find that a pure LRU replacement yields significantly lower hit rates than our suggested popularity-based replacement. The evaluation of caches was done by simulation and by usage of ViNo.
[150]	Sawas Chatzchristofis, Yiannis Boutalis, Mathias Lux, Combining Color and Spatial Color Distribution Information in a Fuzzy Rule Based Compact Composite Descriptor, In Agents and Artificial Intelligence (Joaquim Filipe, Ana Fred, Bernadette Sharp, eds.), Springer, Heidelberg, Germany, pp. pp. 49-60, 2010. [bib] [doi] [abstract] Abstract: In this paper, a novel low level feature for content based image retrieval is presented. The proposed feature structure combines color and spatial color distribution information. The combination of these two features in one vector classifies the proposed descriptor to the family of Composite Descriptors. In order to extract the color information, a fuzzy system is being used, which is mapping the number of colors that are included in the image into a custom palette of 8 colors. The way by which the vector of the proposed descriptor is being formed, describes the color spatial information contained in images. To be applicable in the design of large image databases, the proposed descriptor is compact, requiring only 48 bytes per image. Experiments presented in this paper demonstrate the effectiveness of the proposed technique especially for Hand-Drawn Sketches.
[149]	Manfred Del Fabro, Klaus Schoeffmann, Laszlo Böszörmenyi, Instant Video Browsing: A Tool for Fast Non-sequential Hierarchical Video Browsing, In Proceedings of HCI in Work and Learning, Life and Leisure 6th Symposium of the Workgroup Human-Computer Interaction and Usability Engineering (Gerhard Leitner, Martin Hitz, Andreas Holzinger, eds.), Springer Verlag GmbH, Berlin, Heidelberg, New York, pp. 443-446, 2010. [bib] [doi] [pdf] [abstract] Abstract: We introduce an easy-to-use video browsing tool which assists users in getting a quick overview of videos as well as in finding segments of interest. It provides a parallel and a tree-based view for browsing the content of videos -- or even video collections -- in a hierarchical, non-sequential manner. The tool has a plug-in architecture and can be extended both by further presentation methods and by video analysis algorithms.
[148]	Manfred Del Fabro, Laszlo Böszörmenyi, Video Scene Detection Based on Recurring Motion Patterns, In Proceedings of the Second International Conference on Advances in Multimedia (MMEDIA 2010) (Laszlo Böszörmenyi, Dumitru Burdescu, Philip Davies, David Newell, eds.), IEEE, Washington (DC), pp. 113-118, 2010. [bib] [doi] [pdf] [abstract] Abstract: We present an algorithm for video scene detection based on the identification of recurring motion sequences within a video stream. The motion information is extracted in the compressed domain of H.264/AVC videos, no full decoding of the video stream is needed. Based on the motion information our algorithm identifies sequences of adjacent frames with similar motion. Throughout all identified motion sequences we are searching for recurring patterns of similar ones. The most recurring pattern is used for the segmentation of the video stream into scenes. The evaluation shows promising results.
2009
[147]	Anita Sobe, Laszlo Böszörmenyi, Non-sequential Multimedia Caching, In 2009 First International Conference on Advances in Multimedia (Dan Burdescu, Petre Dini, eds.), IEEE, Los Alamitos, CA, USA, pp. 158-161, 2009. [bib] [doi]
[146]	Klaus Schoeffmann, Mathias Lux, Mario Taschwer, Laszlo Böszörmenyi, Visualization of Video Motion in Context of Video Browsing, In ICME'09 Proceedings of the 2009 IEEE international Conference on Multimedia and Expo (CY Lin, I Cox, eds.), IEEE, Los Alamitos, CA, USA, pp. 658-661, 2009. [bib][url] [abstract] Abstract: We present a new approach for video browsing using visualization of motion direction and motion intensity statistics by color and brightness variations. Statistics are collected from motion vectors of H.264/AVC encoded video streams, so full video decoding is not required. By interpreting visualized motion patterns of video segments, users are able to quickly identify scenes similar to a prototype scene or identify potential scenes of interest. We give some examples of motion patterns with different semantic value, including camera zooms, hill jumps of ski-jumpers, and the repeated appearance of a news speaker. In a user study we show that certain scenes of interest can be found significantly faster using our video browsing tool than using a video player with VCR-like controls.
[145]	Klaus Schoeffmann, Mario Taschwer, Laszlo Böszörmenyi, Video Browsing Using Motion Visualization, In Proceedings oft the International Conference on Multimedia and Expo 2009 (CY Lin, I Cox, eds.), IEEE, Los Alamitos, CA, USA, pp. 1835-1836, 2009. [bib] [abstract] Abstract: We present a video browsing tool that uses a novel and powerful visualization technique of video motion. The tool provides an interactive navigation index that allows users to quickly and easily recognize content semantics like scenes with fast/slow motion (in general or according to a specific direction), scenes showing still/moving objects in front of a still/moving background, camera pans, or camera zooms. Moreover, the visualization facilitates identification of similar segments in a video. A first user study has shown encouraging results.
[144]	Klaus Schoeffmann, Laszlo Böszörmenyi, Video Browsing Using Interactive Navigation Summaries, In Content-Based Multimedia Indexing, 2009. CBMI '09 (Yannis Avrithis, Stefanos Kollias, eds.), IEEE, Los Alamitos, CA, USA, pp. 243-248, 2009. [bib] [doi] [abstract] Abstract: A new approach for interactive video browsing is described. The novelty of the proposed approach is the flexible concept of interactive navigation summaries. Similar to time sliders, commonly used with standard soft video players, navigation summaries allow random access to a video. In addition, they also provide abstract visualizations of the content at a user-defined level of detail and, thus, quickly communicate content characteristics to the user. Navigation summaries can provide visual information about both low-level features but even high-level features. The concept fully integrates the user, who knows best which navigation summary at which level of detail could be most beneficial for his/her current video browsing task, and provide him/her a flexible set of navigation means. A first user study has shown that our approach can significantly outperform standard soft video players - the state-of-the art “poor man’s” video browsing tool.
[143]	Klaus Schoeffmann, Laszlo Böszörmenyi, Interactive Video Browsing of H.264 Content Based on Just-in-Time Analysis, In Advance in Semantic Media Adaptation and Personalization (Marios C Angelides, Phivos Mylonas, eds.), Auerbach Publications, Boca Raton, FL, USA, pp. 159-179, 2009. [bib]
[142]	Klaus Schoeffmann, Mathias Lux, Laszlo Böszörmenyi, A Novel Approach for Fast and Accurate Commercial Detection in H.264/AVC Bit Streams Based on Logo Identification, In Advances in Multimedia Modeling (Benoit Huet, Alan Smeaton, Ketan Mayer-Patel, Yannis Avrithis, eds.), Springer, Berlin, Heidelberg, New York, pp. 119-127, 2009. [bib][url] [doi] [abstract] Abstract: Commercial blocks provide no extra value for video indexing, retrieval, archiving, or summarization of TV broadcasts. Therefore, automatic detection of commercial blocks is an important topic in the domain of multimedia information systems. We present a commercial detection approach which is based on logo detection performed in the compressed domain. The novelty of our approach is that by taking advantage of advanced features of the H.264/AVC coding, it is both significantly faster and more exact than existing approaches working directly on compressed data. Our approach enables removal of commercials in a fraction of real-time while achieving an average recall of 97.33% with an average precision of 99.31%. Moreover, due to its run-time performance, our approach can also be employed on low performance devices, for instance DVB recorders.
[141]	Mathias Lux, An Evaluation of Metrics for Retrieval of MPEG-7 Semantic Descriptions, In Multimedia, 2009. ISM '09. 11th IEEE International Symposium on (Jeffrey Tsai, Ramesh Jain, eds.), IEEE, Los Alamitos, CA, USA, pp. 546-551, 2009. [bib] [doi] [abstract] Abstract: MPEG-7 is an extensive multimedia metadata standard covering a huge number of aspects of metadata. However, as with most metadata standards details of usage and application of the standards are – at least partially – open to interpretation. In case of MPEG-7storage and transmission of high level metadata on concept level are defined but retrieval methods are not proposed. So if for instance a user annotates photos using the MPEG-7 semantic description scheme, there are no standardized ways to retrieve the photos based on the annotation. In this paper we propose metrics for retrieval based on the MPEG-7 semantic description scheme and evaluate them in a digital photo retrieval scenario.
[140]	Mathias Lux, Oge Marques, Klaus Schoeffmann, Laszlo Böszörmenyi, Georg Lajtai, A novel tool for summarization of arthroscopic videos, In Multimedia Tools and Applications, Springer, Berlin, Heidelberg, New York, pp. 521 - 544, 2009. [bib][url] [abstract] Abstract: Arthroscopic surgery is a minimally invasive procedure that uses a small camera to generate video streams, which are recorded and subsequently archived. In this paper we present a video summarization tool and demonstrate how it can be successfully used in the domain of arthroscopic videos. The proposed tool generates a keyframe-based summary, which clusters visually similar frames based on user-selected visual features and appropriate dissimilarity metrics. We discuss how this tool can be used for arthroscopic videos, taking advantage of several domain-specific aspects, without losing its ability to work on general-purpose videos. Experimental results confirm the feasibility of the proposed approach and encourage extending it to other application domains.
[139]	Mathias Lux, Klaus Schoeffmann, Oge Marques, Laszlo Böszörmenyi, A Novel Tool for Quick Video Summarization using Keyframe Extraction Techniques, In 9th Workshop on Multimedia Metadata (WMM'09) (Romulus Grigoras, Vincent Charvillat, Ralf Klamma, Harald Kosch, eds.), http://ceur-ws.org, Aachen, Germany, pp. 62-76, 2009. [bib][url] [abstract] Abstract: The increasing availability of short, unstructured video clips on the Web has generated an unprecedented need to organize, index, annotate and retrieve video contents to make them useful to potential viewers. This paper presents a novel, simple, and easy-to-use tool to benchmark different low level features for video summarization based on keyframe extraction. Moreover, it shows the usefulness of the benchmarking tool by developing hypothesis for a chosen domain through an exploratory study. It discusses the results of exploratory studies involving users and their judgment of what makes the summary generated by the tool a good one.