Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of Broadcast Engineering
Journal Basic Information
Journal DOI :
The Korean Institute of Broadcast and Media Engineers
Editor in Chief :
Volume & Issues
Volume 11, Issue 4 - Dec 2006
Volume 11, Issue 3 - Sep 2006
Volume 11, Issue 2 - Jun 2006
Volume 11, Issue 1 - Mar 2006
Selecting the target year
A Study on Music Summarization
Kim Sung-Tak ; Kim Sang-Ho ; Kim Hoi-Rin ; Choi Ji-Hoon ; Lee Han-Kyu ; Hong Jin-Woo ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 3~14
Music summarization means a technique which automatically generates the most importantand representative a part or parts ill music content. The techniques of music summarization have been studied with two categories according to summary characteristics. The first one is that the repeated part is provided as music summary and the second provides the combined segments which consist of segments with different characteristics as music summary in music content In this paper, we propose and evaluate two kinds of music summarization techniques. The algorithm using multi-level vector quantization which provides a repeated part as music summary gives fixed-length music summary is evaluated by overlapping ration between hand-made repeated parts and automatically generated summary. As results, the overlapping ratios of conventional methods are 42.2% and 47.4%, but that of proposed method with fixed-length summary is 67.1%. Optimal length music summary is evaluated by the portion of overlapping between summary and repeated part which is different length according to music content and the result shows that automatically-generated summary expresses more effective part than fixed-length summary with optimal length. The cluster-based algorithm using 2-D similarity matrix and k-means algorithm provides the combined segments as music summary. In order to evaluate this algorithm, we use MOS test consisting of two questions(How many similar segments are in summarized music? How many segments are included in same structure?) and the results show good performance.
Exploiting Person-identity Features for Person-based Photo Indexing
Yang Seung-Ji ; Seo Kyong-Sok ; Ro Yong-Man ; Kim Sang-Kyun ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 15~27
In this paper, a novel approach is addressed to facilitate the browsing of large collection of digital photos associated with specified person(s) in the photos. The goal of the proposed method is to exploit additional person-identity features as incorporating facial regions and peripheral clothes region associated with them. For more effective incorporation of the clothes and facial features, situation-based photo clustering is also proposed. To evaluate the efficacy of the proposed method experiment was performed with 1120 generic home photos. The experiment results showed that the proposed method outperformed the conventional method us El.g only face feature as showing the average performance of about 92% contrary to the average performance of about 70% in the conventional method.
A Method of Generating Table-of-Contents for Educational Video
Lee Gwang-Gook ; Kang Jung-Won ; Kim Jae-Gon ; Kim Whoi-Yul ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 28~41
Due to the rapid development of multimedia appliances, the increasing amount of multimedia data enforces the development of automatic video analysis techniques. In this paper, a method of ToC generation is proposed for educational video contents. The proposed method consists of two parts: scene segmentation followed by scene annotation. First, video sequence is divided into scenes by the proposed scene segmentation algorithm utilizing the characteristics of educational video. Then each shot in the scene is annotated in terms of scene type, existence of enclosed caption and main speaker of the shot. The ToC generated by the proposed method represents the structure of a video by the hierarchy of scenes and shots and gives description of each scene and shot by extracted features. Hence the generated ToC can help users to perceive the content of a video at a glance and. to access a desired position of a video easily. Also, the generated ToC automatically by the system can be further edited manually for the refinement to effectively reduce the required time achieving more detailed description of the video content. The experimental result showed that the proposed method can generate ToC for educational video with high accuracy.
Ontology for Semantic Retrieval of MPEG-7 MDS and TV-Anytime Multimedia Data
Song Chull-Hwan ; Yoo Seong-Joon ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 42~53
This paper describes how to compose multimedia ontology for integrating/searching different types of multimedia databases. For this, we build integrated ontology based on MPEG-7 Multimedia Description Schemes (MDS), which is a representative standard for specifying multimedia contents, and the concept of TV-Anytime and re-express it using Web Ontology Language (OWL). In addition, we explain interoperability of the developed integrated ontology with other types of ontology with different concepts. Lastly, this paper describes the method of semantic search and retrieval using the integrated multimedia ontology.
Multiple ROI Support in the Scalable Video Coding
Bae Tae-Meon ; Kim Duck-Yeon ; Thang Truong Cong ; Ro Yong-Man ; Kang Jung-Won ; Kim Jae-Gon ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 54~65
In this paper, we propose a new functionality to Scalable Video Coding (SVC), which is the support of multiple ROIs for heterogeneous display resolution. Scalable video coding is targeted at giving temporal, spatial, and quality scalability for the encoded bit stream. Region of interest (ROI) is an area that is semantically important to a particular user, especially users with heterogeneous display resolutions. The bitstream containing the ROIs could to be extracted without any transcoding operations, which may be one of way to satisfy QoS. To define multiple ROI in SVC, we adapted FMO, a tool defined in H.264, and based on it, we propose a way to encode and decode ROIs. The proposed method is implemented on the JSVM1.0 and the functionality is verified using it.
Implementation of Software Downloading and Installing for upgrading Digital TV Settop Box
Ryu Yll-Kwon ; Jung Moon-Ryul ; Kim Jung-Hwan ; Choi Jin-Su ; Bang Gun ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 66~79
As constant development of digital broadcasting and data broadcasting system, new technology will be introduced to digital broadcasting and new broadcasting service will be appeared. These services need to be changed and processed to suit each services and the software of the receiver need to be upgraded. Though, generally the digital broadcasting receivers are not getting updated once it's delivered to home for long time and it need to be done by hand or collected each one of them with trouble. Therefore this paper suggests a way to overcome these difficulties via broadcasting stream. This research is to describe how three modules-namely (1) Downloader, downloads new software from data carrousel stream, (2) Update Loader, installs the software received by downloader; and (3) Recoverer, recovers the former version of the software if some serious problem has been occurred during downloading and installing the software. This paper tries to realize the accommodation of terrestrial STB based on the new technique and service following ATSC A-97 agreement.
Gaze Tracking System Using Feature Points of Pupil and Glints Center
Park Jin-Woo ; Kwon Yong-Moo ; Sohn Kwang-Hoon ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 80~90
A simple 2D gaze tracking method using single camera and Purkinje image is proposed. This method employs single camera with infrared filter to capture one eye and two infrared light sources to make reflection points for estimating corresponding gaze point on the screen from user's eyes. Single camera, infrared light sources and user's head can be slightly moved. Thus, it renders simple and flexible system without using any inconvenient fixed equipments or assuming fixed head. The system also includes a simple and accurate personal calibration procedure. Before using the system, each user only has to stare at two target points for a few seconds so that the system can initiate user's individual factors of estimating algorithm. The proposed system has been developed to work in real-time providing over 10 frames per second with XGA
resolution. The test results of nine objects of three subjects show that the system is achieving an average estimation error less than I degree.
Pre-processing of Depth map for Multi-view Stereo Image Synthesis
Seo Kwang-Wug ; Han Chung-Shin ; Yoo Ji-Sang ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 91~99
Pre-processing is one of image processing techniques to enhance image quality or appropriately convert a given image into another form for a specific purpose. An 8 bit depth map obtained by a depth camera usually contains a lot of noisy components caused by the characteristics of depth camera and edges are also more distorted by the quality of a source object and illumination condition comparing with edges in RGB texture image. To reduce this distortion, we use noise removing filters, but they are only able to reduce noise components, so that distorted edges of depth map can not be properly recovered. In this paper, we propose an algorithm that can reduce noise components and also enhance the quality of edges of depth map by using edges in RGB texture. Consequently, we can reduce errors in multi-view stereo image synthesis process.
A Study of Spreading code for Watermarking of TxID of ATSC-DTV
Cha Jae-Sang ; Choi Seong-Jin ; Lee Seon-Hee ; Lee Kwang-Jik ; Lee Yong-Tae ; Park Goo-Man ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 100~106
In this paper, we proposed a TxID(Transmitter Identification) system using ZCD (Zero Correlation Duration) code as interference-cancelled spread codes for single frequency network of ATSC terrestrial DTV broadcasting. ZCD codes are simply implemented and have ZCD characteristics. Then, Proposal system can fundamentally solve interference problems caused by ICI(Inter Code Interference) and MPI(Multi-Path Interference) which happens in TxID system using spread coding method. Also, those can cancel truncated error which happens at watermarking for TxID since those can remove its period. In this paper, we identified the availability of TxID system applying interference-cancelled ZCD codes by analyzing DER (Detection Error Rate) which is simulated in various interference environments such as AWGN, ICI, and MPI.
Development of Multi-applications for Data Broadcasting
Kim Hyun-Soon ; Kwon Jae-Kwang ; Kang Dae-Kap ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 107~115
In the environment of single application data broadcasting, only one application can be serviced at a specific time on one channel. To overcome this, we developed the structure and the method of operation for multi-applications which are fully conformant to the ACAP (Advanced Common Application Platform), and modified data broadcasting system to support multi-applications. In multi-application environment, broadcasters can service multiple applications simultaneously at a specific airtime on one channel so users can enjoy services selectively according to their preferences. In this paper, we present an example of multi-application service which was developed to make an experiment before servicing them to users on the air. The core of the multi-application is a manager application which manages other ordinary applications, so we describe the function and structure of the manager application, and then present the experimental results to show that the proposed method is the proper model for multi-applications.
Multi-View Video System using Single Encoder and Decoder
Kim Hak-Soo ; Kim Yoon ; Kim Man-Bae ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 116~129
The progress of data transmission technology through the Internet has spread a variety of realistic contents. One of such contents is multi-view video that is acquired from multiple camera sensors. In general, the multi-view video processing requires encoders and decoders as many as the number of cameras, and thus the processing complexity results in difficulties of practical implementation. To solve for this problem, this paper considers a simple multi-view system utilizing a single encoder and a single decoder. In the encoder side, input multi-view YUV sequences are combined on GOP units by a video mixer. Then, the mixed sequence is compressed by a single H.264/AVC encoder. The decoding is composed of a single decoder and a scheduler controling the decoding process. The goal of the scheduler is to assign approximately identical number of decoded frames to each view sequence by estimating the decoder utilization of a Gap and subsequently applying frame skip algorithms. Furthermore, in the frame skip, efficient frame selection algorithms are studied for H.264/AVC baseline and main profiles based upon a cost function that is related to perceived video quality. Our proposed method has been performed on various multi-view test sequences adopted by MPEG 3DAV. Experimental results show that approximately identical decoder utilization is achieved for each view sequence so that each view sequence is fairly displayed. As well, the performance of the proposed method is examined in terms of bit-rate and PSNR using a rate-distortion curve.
A Study on Preference of the Consumer for Multimedia Contents Service in Communication and Broadcasting Convergent Environment
Kim Kwang-Yong ; Kim Jae-Gon ; Han Hyun-Soo ; Park Sun-Young ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 130~137
In this paper. we introduce technologies that service multimedia contents (in brief, we call it UCA (Ubiquitous Contents Access)) to end consumer that have various communication terminal on communication & broadcasting convergence environment and propose system that this technologies are applied. Also, we analyzed relative advantage and playfulness of service technology that is looked in consumer side about proposed service technology. When analyzed relative advantage and playfulness in consumer's view point, proposed communication & broadcasting convergence service model could see that is worth service that consumer can prefer.
Computational Efficiency of Resamplers in Multi-Stage Structure
Kim Rin-Chul ;
Journal of Broadcast Engineering, volume 11, issue 1, 2006, Pages 138~141
This paper evaluates the computational efficiency of sample-rate converters with rational factors in multi-stage structure in terms of memory requirement and multiplications per second. We describe resolution preserving and mutual prime conditions, and then present a method for designing the converter from which optimal rational-valued conversion factors for each stage can be yielded directly. As an example, we show an implementation of the 44.1-to-48KHz sample-rate converter in 2-stage structure.