Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of Broadcast Engineering
Journal Basic Information
Journal DOI :
The Korean Institute of Broadcast and Media Engineers
Editor in Chief :
Volume & Issues
Volume 20, Issue 6 - Nov 2015
Volume 20, Issue 5 - Sep 2015
Volume 20, Issue 4 - Jul 2015
Volume 20, Issue 3 - May 2015
Volume 20, Issue 2 - Mar 2015
Volume 20, Issue 1 - Jan 2015
Selecting the target year
Generation of Stereoscopic Image from 2D Image based on Saliency and Edge Modeling
Kim, Manbae ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 368~378
DOI : 10.5909/JBE.2015.20.3.368
3D conversion technology has been studied over past decades and integrated to commercial 3D displays and 3DTVs. The 3D conversion plays an important role in the augmented functionality of three-dimensional television (3DTV), because it can easily provide 3D contents. Generally, depth cues extracted from a static image is used for generating a depth map followed by DIBR (Depth Image Based Rendering) rendering for producing a stereoscopic image. However except some particular images, the existence of depth cues is rare so that the consistent quality of a depth map cannot be accordingly guaranteed. Therefore, it is imperative to make a 3D conversion method that produces satisfactory and consistent 3D for diverse video contents. From this viewpoint, this paper proposes a novel method with applicability to general types of image. For this, saliency as well as edge is utilized. To generate a depth map, geometric perspective, affinity model and binomic filter are used. In the experiments, the proposed method was performed on 24 video clips with a variety of contents. From a subjective test for 3D perception and visual fatigue, satisfactory and comfortable viewing of 3D contents was validated.
A Restoration Method for Geometric Distortions to Improve Scanned Books Identification
Kim, Doyoung ; Lee, Sang-Hoon ; Jadhav, Sagar ; Lee, Sanghoon ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 379~387
DOI : 10.5909/JBE.2015.20.3.379
In recent years, copyright violations from illegal copying and distribution of e-comic contents have become an important issue. Fingerprinting techniques have been emerged to provide a fast and reliable identification method of identifying e-comic contents. When illegally scanned or camera captured comic contents are distributed, they suffer from distortions. So the fingerprint differs from the original version. This paper presents a restoration framework for correcting geometric distortions in distorted comics to improve the comic content identification.
Tile-level and Frame-level Parallel Encoding for HEVC
Kim, Younhee ; Seok, Jinwuk ; Jung, Soon-heung ; Kim, Huiyong ; Choi, Jin Soo ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 388~397
DOI : 10.5909/JBE.2015.20.3.388
High Efficiency Video Coding (HEVC)/H.265 is a new video coding standard which is known as high compression ratio compared to the previous standard, Advanced Video Coding (AVC)/H.264. Due to achievement of high efficiency, HEVC sacrifices the time complexity. To apply HEVC to the market applications, one of the key requirements is the fast encoding. To achieve the fast encoding, exploiting thread-level parallelism is widely chosen mechanism since multi-threading is commonly supported based on the multi-core computer architecture. In this paper, we implement both the Tile-level parallelism and the Frame-level parallelism for HEVC encoding on multi-core platform. Based on the implementation, we present two approaches in combining the Tile-level parallelism with Frame-level parallelism. The first approach creates the fixed number of tile per frame while the second approach creates the number of tile per frame adaptively according to the number of frame in parallel and the number of available worker threads. Experimental results show that both improves the parallel scalability compared to the one that use only tile-level parallelism and the second approach achieves good trade-off between parallel scalability and coding efficiency for both Full-HD (1080 x 1920) and 4K UHD (3840 x 2160) sequences.
Rate Allocation for Block-based Compressive Sensing
Nguyen, Quang Hong ; Dinh, Khanh Quoc ; Nguyena, Viet Anh ; Trinh, Chien Van ; Park, Younghyeon ; Jeon, Byeungwoo ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 398~407
DOI : 10.5909/JBE.2015.20.3.398
Compressive sensing (CS) has drawn much interest as a novel sampling technique that enables sparse signal to be sampled under the Nyquitst/Shannon rate. By noting that the block-based CS can still keep spatial correlation in measurement domain, this paper proposes to adapt sampling rate of each block in frame according to its characteristic defined by edge information. Specifically, those blocks containing more edges are assigned more measurements utilizing block-wise correlation in measurement domain without knowledge about full sampling frame. For natural image, the proposed adaptive rate allocation shows considerable improvement compared with fixed subrate block-based CS in both terms of objective (up to 3.29 dB gain) and subjective qualities.
Vocal Separation Using Selective Frequency Subtraction Considering with Energies and Phases
Kim, Hyuntae ; Park, Jangsik ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 408~413
DOI : 10.5909/JBE.2015.20.3.408
Recently, According to increasing interest to original sound Karaoke instrument, MIDI type karaoke manufacturer attempt to make more cheap method instead of original recoding method. The specific method is to make the original sound accompaniment to remove only the voice of the singer in the singer music album. In this paper, a system to separate vocal components from music accompaniment for stereo recordings were proposed. Proposed system consists of two stages. The first stage is a vocal detection. This stage classifies an input into vocal and non vocal portions by using SVM with MFCC. In the second stage, selective frequency subtractions were performed at each frequency bin in vocal portions. In this case, it is determined in consideration not only the energies for each frequency bin but also the phase of the each frequency bin at each channel signal. Listening test with removed vocal music from proposed system show relatively high satisfactory level.
Temporally-Consistent High-Resolution Depth Video Generation in Background Region
Shin, Dong-Won ; Ho, Yo-Sung ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 414~420
DOI : 10.5909/JBE.2015.20.3.414
The quality of depth images is important in the 3D video system to represent complete 3D contents. However, the original depth image from a depth camera has a low resolution and a flickering problem which shows vibrating depth values in terms of temporal meaning. This problem causes an uncomfortable feeling when we look 3D contents. In order to solve a low resolution problem, we employ 3D warping and a depth weighted joint bilateral filter. A temporal mean filter can be applied to solve the flickering problem while we encounter a residual spectrum problem in the depth image. Thus, after classifying foreground andbackground regions, we use an upsampled depth image for a foreground region and temporal mean image for background region.Test results shows that the proposed method generates a time consistent depth video with a high resolution.
Implementation of Pedestrian Detection and Tracking with GPU at Night-time
Choi, Beom-Joon ; Yoon, Byung-Woo ; Song, Jong-Kwan ; Park, Jangsik ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 421~429
DOI : 10.5909/JBE.2015.20.3.421
This paper is about an approach for pedestrian detection and tracking with infrared imagery. We used the CUDA(Computer Unified Device Architecture) that is a parallel processing language in order to improve the speed of video-based pedestrian detection and tracking. The detection phase is performed by Adaboost algorithm based on Haar-like features. Adaboost classifier is trained with datasets generated from infrared images. After detecting the pedestrian with the Adaboost classifier, we proposed a particle filter tracking strategies on HSV histogram feature that exploit adaptively at the same time. The proposed approach is implemented on an NVIDIA Jetson TK1 developer board that is full-featured device ideal for software development within the Linux environment. In this paper, we presented the results of parallel processing with the NVIDIA GPU on the CUDA development environment for detection and tracking of pedestrians. We compared the object detection and tracking processing time for night-time images on both GPU and CPU. The result showed that the detection and tracking speed of the pedestrian with GPU is approximately 6 times faster than that for CPU.
Violent Behavior Detection using Motion Analysis in Surveillance Video
Kang, Joohyung ; Kwak, Sooyeong ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 430~439
DOI : 10.5909/JBE.2015.20.3.430
The demand of violence detection techniques using a video analysis to help prevent crimes is increasing recently. Many researchers have studied vision based behavior recognition but, violent behavior analysis techniques usually focus on violent scenes in television and movie content. Many methods previously published usually used both a color(e.g., skin and blood) and motion information for detecting violent scenes because violences usually involve blood scenes in movies. However, color information (e.g., blood scenes) may not be useful cues for violence detection in surveillance videos, because they are rarely taken in real world situations. In this paper, we propose a method of violent behavior detection in surveillance videos using motion vectors such as flow vector magnitudes and changes in direction except the color information. In order to evaluate the proposed algorithm, we test both USI dataset and various real world surveillance videos from YouTube.
A Pedestrian Collision Warning System using a Fuzzy Logic
Kim, Yang Ho ; Kim, Kwangsoo ; Kwak, Sooyeong ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 440~448
DOI : 10.5909/JBE.2015.20.3.440
A pedestrian collision warning system which makes a judgement of pedestrian`s intention to help avoiding hitting accidents is proposed. This system uses the image sequences obtained from a car black box as well as vehicle`s speed obtained from a GPS. It detects pedestrians, if any, based on the Histogram of Gradient method and extracts several information such as the pedestrian`s relative positions, the direction of motion vectors, and distance between vehicle and pedestrian . A fuzzy logic based on these extracted information is applied to analyze the pedestrian`s safety levels. When the safety level is determined to be danger, an alarm is triggered to the driver. The performance of the proposed algorithm is tested under various driving scenarios, which shows it works successfully in real-time.
Object Tracking in HEVC Bitstreams
Park, Dongmin ; Lee, Dongkyu ; Oh, Seoung-Jun ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 449~463
DOI : 10.5909/JBE.2015.20.3.44
Video object tracking is important for variety of applications, such as security, video indexing and retrieval, video surveillance, communication, and compression. This paper proposes an object tracking method in HEVC bitstreams. Without pixel reconstruction, motion vector (MV) and size of prediction unit in the bitstream are employed in an Spatio-Temporal Markov Random Fields (ST-MRF) model which represents the spatial and temporal aspects of the object`s motion. Coefficient-based object shape adjustment is proposed to solve the over-segmentation and the error propagation problems caused in other methods. In the experimental results, the proposed method provides on average precision of 86.4%, recall of 79.8% and F-measure of 81.1%. The proposed method achieves an F-measure improvement of up to 9% for over-segmented results in the other method even though it provides only average F-measure improvement of 0.2% with respect to the other method. The total processing time is 5.4ms per frame, allowing the algorithm to be applied in real-time applications.
A Cooperative Communication Technology for Rapid Emergency Alert Broadcast
Song, Mihwa ; Chang, Sekchin ; Lee, Yongtae ;
Journal of Broadcast Engineering, volume 20, issue 3, 2015, Pages 464~467
DOI : 10.5909/JBE.2015.20.3.464
The CBS functionality is defined for emergency alert service in cellular systems. Previously, a cooperative communication technique was proposed for more reliable CBS functionality. However, the relays for cooperative communications may fail in receiving emergency message, which may cause a critical latency. Therefore, we propose a novel cooperative communication technology in this letter, which incudes additional T-DMB functionality in relays. Analytical and empirical evaluations confirm that the proposed method is very suitable for rapid emergency alert broadcast.