• Title, Summary, Keyword: Parallel Computing

Search Result 721, Processing Time 0.033 seconds

Development of Virtual Parallel Processing System for Flexible Task Allocation on the Web (웹 환경에서 유연성 있는 작업 할당을 위한 가상 병렬 처리 시스템 개발)

  • 정권호;송은하;정영식
    • Journal of Korea Multimedia Society
    • /
    • v.3 no.3
    • /
    • pp.320-332
    • /
    • 2000
  • Web consists of the grand virtual system which is made of all connected computers network. We can solve the huge problem which requires high quality in cost performance and powerful computing power to use a numerous idle state system on internet as process it parallel. However, we have to consider heterogeneous computing resources, accessibility, and reliability to carry out parallel system on global environment, not network but whole Internet. In this paper, We the WebImg system which has the power of web computing, and show the flexible task allocation strategy in heterogeneous hosts. Also, we evaluate its performance, moreover the proposed task allocation strategy supplies fault tolerance by controlling host situation at any time.

  • PDF

Design and Analysis of User's Libraries for Parallel Computing based on the Internet (인터넷 기반의 병렬 컴퓨팅을 위한 사용자 라이브러리 설계 및 성능 분석)

  • Sin, Pil-Seop;Jeong, Jun-Mok;Maeng, Hye-Seon;Hong, Won-Gi;Kim, Sin-Deok
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.11
    • /
    • pp.2932-2945
    • /
    • 1999
  • As the Internet and Java technology have been growing up, parallel processing approach to utilize those idle resources connected to the Internet has become quite attractive. In this paper, JICE(Java Internet Computing Environment) was implemented as a parallel computing platform based on the Internet using multithreading and RMI mechanisms provided by Java. The basic model of JICE is constructed as three components, such as a client, a set of workers, and a broker. A worker communicates with other workers via a globally shared memory system. It provides users with master-slave programming model and a collection of library functions. The basic model of JICE is also extended as a multimanaging system. This multimanaging system is evaluated by analysis to show its effectiveness. According to numerical analysis and experiments with several benchmarks, it is shown that the performance of basic model depends on the shared memory reference ratio and user's library is a quite promising.

  • PDF

Nonlinear Shell Finite Element and Parallel Computing Algorithm for Aircraft Wing-box Structural Analysis (항공기 Wing-box 구조해석을 위한 비선형 쉘 유한요소 및 병렬계산 기법 개발)

  • Kim, Hyejin;Kim, Seonghwan;Hong, Jiwoo;Cho, Haeseong
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.48 no.8
    • /
    • pp.565-571
    • /
    • 2020
  • In this paper, precision and efficient nonlinear structural analysis for the aircraft wing-box model is developed. Herein, nonlinear shell element based on the co-rotational (CR) formulation is implemented. Then, parallel computing algorithm, the element-based partitioning technique is developed to accelerate the computational efficiency of the nonlinear structural analysis. Finally, computational performance, i.e., accuracy and efficiency, of the proposed analysis is evaluated by comparing with that of the existing commercial software.

Research of accelerating method of video quality measurement program using GPGPU (GPGPU를 이용한 영상 품질 측정 프로그램의 가속화 연구)

  • Lee, Seonguk;Byeon, Gibeom;Kim, Kisu;Hong, Jiman
    • Smart Media Journal
    • /
    • v.5 no.4
    • /
    • pp.69-74
    • /
    • 2016
  • Recently, parallel computing using GPGPU(General-Purpose computing on Graphics Processing Units) according to the development of the graphics processing unit is expanding. This can be achieved through the processing speeds faster than traditional computing environments across many fields, including science, medicine, engineering, and analysis. However, in using the GPU technology to implement the a parallel program there are many constraints. In this paper, we port a CPU-based program(Video Quality Measurement Program) to use technology. The program ported to GPU-based show about 1.83 times the execution speed than CPU-based program. We study on the acceleration of the GPU-based program. Also we discuss the technical constraints and problems that occur when you modify the CPU to the GPU-based programs.

A Distributed Electrical Impedance Tomography Algorithm for Real-Time Image Reconstruction (실시간 영상 복원을 위한 분산 전기단층촬영 알고리즘)

  • Junghoon Lee;Gyunglin Park
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.10 no.1
    • /
    • pp.25-36
    • /
    • 2004
  • This paper proposes and measures the performance of a distributed EIT (Electrical Impedance Tomography) image reconstruction algorithm which has a master-slave structure. The image construction is a computation based application of which the execute time is proportional to the cube of the unknowns. After receiving a specific frame from the master, each computing node extracts the basic elements by executing the first iteration of Kalman Filter in parallel. Then the master merges the basic element lists into one group and then performs the sequential iterations with the reduced number of unknowns. Every computing node has MATLAB functions as well as extended library implemented for the exchange of MATLAB data structure. The master implements another libraries such as threaded multiplication, partitioned inverse, and fast Jacobian to improve the speed of the serial execution part. The parallel library reduces the reconstruction time of image visualization about by half, while the distributed grouping scheme further reduces by about 12 times for the given target object when there are 4 computing nodes.

Thread-Level Parallelism using Java Thread and Network Resources (자바 스레드와 네트워크 자원을 이용한 병렬처리)

  • Kim, Tae-Yong
    • Journal of Advanced Navigation Technology
    • /
    • v.14 no.6
    • /
    • pp.984-989
    • /
    • 2010
  • In this paper, parallel programming technique by using Java Thread is introduced so as to develop parallel design tool to analyze the small micro flow sensor. To estimate computing time for Thread-level parallelism, the performances of two experimental models for potential problem subject to Thermal transfer equation are examined. As a result, if the number of network PC is increase, computing time for parallelism on network environment is enhanced to be almost n times. The micro sensor design tool based on distributed computing can be utilized to analyze a large scale problem.

A PRICING METHOD OF HYBRID DLS WITH GPGPU

  • YOON, YEOCHANG;KIM, YONSIK;BAE, HYEONG-OHK
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.20 no.4
    • /
    • pp.277-293
    • /
    • 2016
  • We develop an efficient numerical method for pricing the Derivative Linked Securities (DLS). The payoff structure of the hybrid DLS consists with a standard 2-Star step-down type ELS and the range accrual product which depends on the number of days in the coupon period that the index stay within the pre-determined range. We assume that the 2-dimensional Geometric Brownian Motion (GBM) as the model of two equities and a no-arbitrage interest model (One-factor Hull and White interest rate model) as a model for the interest rate. In this study, we employ the Monte Carlo simulation method with the Compute Unified Device Architecture (CUDA) parallel computing as the General Purpose computing on Graphic Processing Unit (GPGPU) technology for fast and efficient numerical valuation of DLS. Comparing the Monte Carlo method with single CPU computation or MPI implementation, the result of Monte Carlo simulation with CUDA parallel computing produces higher performance.

Analytical fragility curves of a structure subject to tsunami waves using smooth particle hydrodynamics

  • Sihombing, Fritz;Torbol, Marco
    • Smart Structures and Systems
    • /
    • v.18 no.6
    • /
    • pp.1145-1167
    • /
    • 2016
  • This study presents a new method to computes analytical fragility curves of a structure subject to tsunami waves. The method uses dynamic analysis at each stage of the computation. First, the smooth particle hydrodynamics (SPH) model simulates the propagation of the tsunami waves from shallow water to their impact on the target structure. The advantage of SPH over mesh based methods is its capability to model wave surface interaction when large deformations are involved, such as the impact of water on a structure. Although SPH is computationally more expensive than mesh based method, nowadays the advent of parallel computing on general purpose graphic processing unit overcome this limitation. Then, the impact force is applied to a finite element model of the structure and its dynamic non-linear response is computed. When a data-set of tsunami waves is used analytical fragility curves can be computed. This study proves it is possible to obtain the response of a structure to a tsunami wave using state of the art dynamic models in every stage of the computation at an affordable cost.

CMOS-Memristor Hybrid 4-bit Multiplier Circuit for Energy-Efficient Computing

  • Vo, Huan Minh;Truong, Son Ngoc;Shin, Sanghak;Min, Kyeong-Sik
    • Journal of IKEEE
    • /
    • v.18 no.2
    • /
    • pp.228-233
    • /
    • 2014
  • In this paper, we propose a CMOS-memristor hybrid circuit that can perform 4-bit multiplication for future energy-efficient computing in nano-scale digital systems. The proposed CMOS-memristor hybrid circuit is based on the parallel architecture with AND and OR planes. This parallel architecture can be very useful in improving the power-delay product of the proposed circuit compared to the conventional CMOS array multiplier. Particularly, from the SPECTRE simulation of the proposed hybrid circuit with 0.13-mm CMOS devices and memristors, this proposed multiplier is estimated to have better power-delay product by 48% compared to the conventional CMOS array multiplier. In addition to this improvement in energy efficiency, this 4-bit multiplier circuit can occupy smaller area than the conventional array multiplier, because each cross-point memristor can be made only as small as $4F^2$.

Optimal Design of Permanent Magnet Wind Generator for Maximum Annual Energy Production (최대 연간 에너지 생산을 위한 영구자석형 풍력발전기의 최적설계)

  • Jung, Ho-Chang;Jung, Sang-Yong;Hahn, Sung-Chin;Lee, Cheol-Gyun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.56 no.12
    • /
    • pp.2109-2115
    • /
    • 2007
  • The wind generators have been installed with high output power to increase the energy production and efficiency. Hence, Optimal design of the direct-driven PM wind generator, coupled with F.E.M(Finite Element Method) and Genetic Algorithm(GA), has been performed to maximize the Annual Energy Production(AEP) over the whole wind speed characterized by the statistical model of wind speed distribution. Particularly, the parallel computing via internet web service has been applied to loose excessive computing times for optimization. The results of the optimal design of Surface-Mounted Permanent Magnet Synchronous Generator(SPMSG) are compared with each other candidates to verify the usefulness of the maximizing AEP model.