• Title/Summary/Keyword: Music Score Recognition

Search Result 27, Processing Time 0.029 seconds

Optical Music Score Recognition System for Smart Mobile Devices

  • Han, SeJin;Lee, GueeSang
    • International Journal of Contents
    • /
    • v.10 no.4
    • /
    • pp.63-68
    • /
    • 2014
  • In this paper, we propose a smart system that can optically recognize a music score within a document and can play the music after recognition. Many historic handwritten documents have now been digitalized. Converting images of a music score within documents into digital files is particularly difficult and requires considerable resources because a music score consists of a 2D structure with both staff lines and symbols. The proposed system takes an input image using a mobile device equipped with a camera module, and the image is optimized via preprocessing. Binarization, music sheet correction, staff line recognition, vertical line detection, note recognition, and symbol recognition processing are then applied, and a music file is generated in an XML format. The Music XML file is recorded as digital information, and based on that file, we can modify the result, logically correct errors, and finally generate a MIDI file. Our system reduces misrecognition, and a wider range of music score can be recognized because we have implemented distortion correction and vertical line detection. We show that the proposed method is practical, and that is has potential for wide application through an experiment with a variety of music scores.

The recognition of Printed Music Score and Performance Using Computer Vision system (컴퓨터 비젼 시스템에 의한 인쇄악보의 인식과 연주)

  • 이명우;최종수
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.22 no.5
    • /
    • pp.10-16
    • /
    • 1985
  • In this paper, a computer vision system, which catches printed music score image using CCTV camera and microcomputer, and then recognizes the image and performs tar music with speaker, is discussed. Integral projection method is adopted for feature detection and recognition of the music score image. The range of recognition is con(ined to staffs, perpen-dicular lines and musical notes including chord notes among the various kinds of elements of music score. The practical recognition algorithm considering noises, the preprocessing processes getting rid of noises are also showed, and simple hardware system playing chord is made, In the results, good recognition ratio and performance are obtained.

  • PDF

Design of Music Learning Assistant Based on Audio Music and Music Score Recognition

  • Mulyadi, Ahmad Wisnu;Machbub, Carmadi;Prihatmanto, Ary S.;Sin, Bong-Kee
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.5
    • /
    • pp.826-836
    • /
    • 2016
  • Mastering a musical instrument for an unskilled beginning learner is not an easy task. It requires playing every note correctly and maintaining the tempo accurately. Any music comes in two forms, a music score and it rendition into an audio music. The proposed method of assisting beginning music players in both aspects employs two popular pattern recognition methods for audio-visual analysis; they are support vector machine (SVM) for music score recognition and hidden Markov model (HMM) for audio music performance tracking. With proper synchronization of the two results, the proposed music learning assistant system can give useful feedback to self-training beginners.

Camera-based Music Score Recognition Using Inverse Filter

  • Nguyen, Tam;Kim, SooHyung;Yang, HyungJeong;Lee, GueeSang
    • International Journal of Contents
    • /
    • v.10 no.4
    • /
    • pp.11-17
    • /
    • 2014
  • The influence of acquisition environment on music score images captured by a camera has not yet been seriously examined. All existing Optical Music Recognition (OMR) systems attempt to recognize music score images captured by a scanner under ideal conditions. Therefore, when such systems process images under the influence of distortion, different viewpoints or suboptimal illumination effects, the performance, in terms of recognition accuracy and processing time, is unacceptable for deployment in practice. In this paper, a novel, lightweight but effective approach for dealing with the issues caused by camera based music scores is proposed. Based on the staff line information, musical rules, run length code, and projection, all regions of interest are determined. Templates created from inverse filter are then used to recognize the music symbols. Therefore, all fragmentation and deformation problems, as well as missed recognition, can be overcome using the developed method. The system was evaluated on a dataset consisting of real images captured by a smartphone. The achieved recognition rate and processing time were relatively competitive with state of the art works. In addition, the system was designed to be lightweight compared with the other approaches, which mostly adopted machine learning algorithms, to allow further deployment on portable devices with limited computing resources.

Score Image Retrieval to Inaccurate OMR performance

  • Kim, Haekwang
    • Journal of Broadcast Engineering
    • /
    • v.26 no.7
    • /
    • pp.838-843
    • /
    • 2021
  • This paper presents an algorithm for effective retrieval of score information to an input score image. The originality of the proposed algorithm is that it is designed to be robust to recognition errors by an OMR (Optical Music Recognition), while existing methods such as pitch histogram requires error induced OMR result be corrected before retrieval process. This approach helps people to retrieve score without training on music score for error correction. OMR takes a score image as input, recognizes musical symbols, and produces structural symbolic notation of the score as output, for example, in MusicXML format. Among the musical symbols on a score, it is observed that filled noteheads are rarely detected with errors with its simple black filled round shape for OMR processing. Barlines that separate measures also strong to OMR errors with its long uniform length vertical line characteristic. The proposed algorithm consists of a descriptor for a score and a similarity measure between a query score and a reference score. The descriptor is based on note-count, the number of filled noteheads in a measure. Each part of a score is represented by a sequence of note-count numbers. The descriptor is an n-gram sequence of the note-count sequence. Simulation results show that the proposed algorithm works successfully to a certain degree in score image-based retrieval for an erroneous OMR output.

A Lightweight and Effective Music Score Recognition on Mobile Phones

  • Nguyen, Tam;Lee, Gueesang
    • Journal of Information Processing Systems
    • /
    • v.11 no.3
    • /
    • pp.438-449
    • /
    • 2015
  • Recognition systems for scanned or printed music scores that have been implemented on personal computers have received attention from numerous scientists and have achieved significant results over many years. A modern trend with music scores being captured and played directly on mobile devices has become more interesting to researchers. The limitation of resources and the effects of illumination, distortion, and inclination on input images are still challenges to these recognition systems. In this paper, we introduce a novel approach for recognizing music scores captured by mobile cameras. To reduce the complexity, as well as the computational time of the system, we grouped all of the symbols extracted from music scores into ten main classes. We then applied each major class to SVM to classify the musical symbols separately. The experimental results showed that our proposed method could be applied to real time applications and that its performance is competitive with other methods.

Decision-Tree Algorithm for Recognition of Music Score Images Obtained by Mobile Phone Camera (휴대폰 카메라로 촬영한 악보 영상 인식을 위한 의사트리 알고리즘)

  • Park, Keon-Hee;Oh, Sung-Ryul;Son, Hwa-Jeong;Yoo, Jae-Myeong;Kim, Soo-Hyung;Lee, Guee-Sang
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.6
    • /
    • pp.16-25
    • /
    • 2008
  • Today, mobile phone is a necessity of modern life. For that reason, we suggest a particular system of a mobile phone which take a picture of music score image and automatically play it without any technical knowledges about the music score information. This experiment makes midi, acknowleging separate symbols via preprocessing to music score image taken. This paper utilizes 11 sorts of the score image taken by a mobile phone camera for this experiment. Through this method we suggest, as much as 98% on average takes place, which is very high recognizing ratio. Also, as we introduce this system in a mobile phone by porting, it takes 8.63 seconds on average to create midi following input of images.

Conversion Program of Music Score Chord using OpenCV and Deep Learning (영상 처리와 딥러닝을 이용한 악보 코드 변환 프로그램)

  • Moon, Ji-su;Kim, Min-ji;Lim, Young-kyu;Kong, Ki-sok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.1
    • /
    • pp.69-77
    • /
    • 2021
  • This paper deals with the development of an application that converts the PDF music score entered by the user into a MIDI file of the chord the user wants. This application converts the PDF file into a PNG file for chord conversion when the user enters the PDF music score file and the chord which the user wants to change. After recognizing the melody of sheet music through image processing algorithm and recognizing the tempo of sheet music notes through deep learning, then the MIDI file of chord for existing sheet music is produced. The OpenCV algorithm and deep learning can recognize minim note, quarter note, eighth note, semi-quaver note, half rest, eighth rest, quarter rest, semi-quaver rest, successive notes and chord notes. The experiment shows that the note recognition rate of the music score was 100% and the tempo recognition rate was 90% or more.

Super-resolution in Music Score Images by Instance Normalization

  • Tran, Minh-Trieu;Lee, Guee-Sang
    • Smart Media Journal
    • /
    • v.8 no.4
    • /
    • pp.64-71
    • /
    • 2019
  • The performance of an OMR (Optical Music Recognition) system is usually determined by the characterizing features of the input music score images. Low resolution is one of the main factors leading to degraded image quality. In this paper, we handle the low-resolution problem using the super-resolution technique. We propose the use of a deep neural network with instance normalization to improve the quality of music score images. We apply instance normalization which has proven to be beneficial in single image enhancement. It works better than batch normalization, which shows the effectiveness of shifting the mean and variance of deep features at the instance level. The proposed method provides an end-to-end mapping technique between the high and low-resolution images respectively. New images are then created, in which the resolution is four times higher than the resolution of the original images. Our model has been evaluated with the dataset "DeepScores" and shows that it outperforms other existing methods.

Design and Implementation of the Effective Staff-Line Recognition Using Tilt-Correction Through Preview Analysis (프리뷰 분석에 기반한 악보 기울기 보정을 통한 효과적인 오선 인식 기법의 설계 및 구현)

  • Kim, Seongryong;Kim, Taehee;Kim, Misun;Lee, Boram;Kim, Geunjeoung;Lee, Sangjun
    • Journal of IKEEE
    • /
    • v.18 no.3
    • /
    • pp.362-367
    • /
    • 2014
  • Music score recognition applications running on a smartphone, which is one of the necessities of modern people, have already been released on the market. These applications have the several limitations, especially the recognition rate of printed music scores is low so that many errors occur when the score is played. The major factor to decrease the recognition rate comes from poor tilt-correction of the captured staff-line. In this paper, we propose a efficient method that can automatically shoot the printed music score through preview analysis, which increases the recognition rate via tilt-correction.