• Title/Summary/Keyword: Patent Data

Search Result 560, Processing Time 0.209 seconds

Big Data Smoothing and Outlier Removal for Patent Big Data Analysis

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.8
    • /
    • pp.77-84
    • /
    • 2016
  • In general statistical analysis, we need to make a normal assumption. If this assumption is not satisfied, we cannot expect a good result of statistical data analysis. Most of statistical methods processing the outlier and noise also need to the assumption. But the assumption is not satisfied in big data because of its large volume and heterogeneity. So we propose a methodology based on box-plot and data smoothing for controling outlier and noise in big data analysis. The proposed methodology is not dependent upon the normal assumption. In addition, we select patent documents as target domain of big data because patent big data analysis is a important issue in management of technology. We analyze patent documents using big data learning methods for technology analysis. The collected patent data from patent databases on the world are preprocessed and analyzed by text mining and statistics. But the most researches about patent big data analysis did not consider the outlier and noise problem. This problem decreases the accuracy of prediction and increases the variance of parameter estimation. In this paper, we check the existence of the outlier and noise in patent big data. To know whether the outlier is or not in the patent big data, we use box-plot and smoothing visualization. We use the patent documents related to three dimensional printing technology to illustrate how the proposed methodology can be used for finding the existence of noise in the searched patent big data.

A Novel Classification Model for Efficient Patent Information Research (효율적인 특허정보 조사를 위한 분류 모형)

  • Kim, Youngho;Park, Sangsung;Jang, Dongsik
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.4
    • /
    • pp.103-110
    • /
    • 2019
  • A patent contains detailed information of the developed technology and is published to the public. Thus, patents can be used to overcome the limitations of traditional technology trend research and prediction techniques. Recently, due to the advantages of patented analytical methodology, IP R&D is carried out worldwide. The patent is big data and has a huge amount, various domains, and structured and unstructured data characteristics. For this reason, there are many difficulties in collecting and researching patent information. Patent research generally writes the Search formula to collect patent documents from DB. The collected patent documents contain some noise patents that are irrelevant to the purpose of analysis, so they are removed. However, eliminating noise patents is a manual task of reading and classifying technology, which is time consuming and expensive. In this study, we propose a model that automatically classifies The Noise patent for efficient patent information research. The proposed method performs Patent Embedding using Word2Vec and generates Noise seed label. In addition, noise patent classification is performed using the Random forest. The experimental data is published and registered with the USPTO among the patents related to Ocean Surveillance & Tracking Network technology. As a result of experimenting with the proposed model, it showed 73% accuracy with the label actually given by experts.

A Study on the Prediction for the OCR Technology Development Trajectory based on the Patent and Article Information (특허와 논문정보를 활용한 OCR 기술발전 동향예측에 관한 연구)

  • Won Jun, Kim;Sang Kon, Lee;Sung Kuk, Pyo
    • Journal of Information Technology Services
    • /
    • v.21 no.6
    • /
    • pp.39-51
    • /
    • 2022
  • As the 4th Industrial Revolution emerged as a key to improving national competitiveness, OCR technology, one of the major technologies in the 4th industry is in the spotlight. Since characters in various images contain a lot of information, OCR technology for recognizing these characters has evolved into technology used in many industries. In this paper, trends in OCR technology were identified and predicted using thesis data published in 'RISS' and patent data by International patent classification (IPC) under the theme of Optical character recognition (OCR). For patent data 20,000 patents related to OCR technology from 2002 to 2020 were used as data, and 432 papers from 2012 to 2022 were used as data. Through time-series analysis, each patent data and thesis data were investigated since when OCR technology has developed, and various keyword analysis predicted which technology will be used in the future. Finally, the direction of future OCR technology development was presented through network association analysis with patent data and thesis data.

An Empirical Analysis about the Effect on Performance of Firm's Patent Competency : Focusing on the High Performance Venture Firms in Korea (기업의 특허 역량이 성과에 미치는 영향에 관한 실증 분석 : 우수 벤처기업을 중심으로)

  • Ahn, Yeon S.
    • Knowledge Management Research
    • /
    • v.11 no.1
    • /
    • pp.83-96
    • /
    • 2010
  • In this study, the effect of firm's patent competency on the their management performance was analysed. The number of patents granted to Korean firms, patent grade score as of the firm's patent competence were considered in the perspectives of patent volume and patent value respectively. Specially the analysis were implemented focusing on the high performance venture ranked 200th in Korea. The patent source data were from the Korean Intellectual Property Office, Korean Credit Evaluation Information Company, and the Patent Evaluation System of KIPO and KIPA. And the year sales and net profit volume as of the firm's management performance data from the KIS. Management performance data are consisted of the mean sales, net profit and ROI during the 4 years from FY2005 to FY2008. Major results are as follows. The regression model were proved significantly that the year sales volume and net profit are effected by the number of patents and patent grade score. But the model including the ROI were shown not significantly. So it can be concluded that patent volume and patent value are the important factors on firm's financial performance as of the year sales volume and net profit. Also the regression model including the control variables, firm's number of employee and business year, the number of patents and patent grade score are the significant factors on firms performance. And regression coefficients of patent value model were higher than these of patent volume model. So it can be recognized that patent value of firms' patent competency are more important factor than the patent volume.

  • PDF

Technology Forecasting using Bayesian Discrete Model (베이지안 이산모형을 이용한 기술예측)

  • Jun, Sunghae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.27 no.2
    • /
    • pp.179-186
    • /
    • 2017
  • Technology forecasting is predict future trend and state of technology by analyzing the results so far of developing technology. In general, a patent has novel information about the result of developed technology, because the exclusive right of technology included in patent is protected for a time period by patent law. So many studies on the technology forecasting using patent data analysis has been performed. The patent keyword data widely used in patent analysis consist of occurred frequency of the keyword. In most previous researches, the continuous data analyses such as regression or Box-Jenkins Models were applied to the patent keyword data. But, we have to apply the analytical methods of discrete data for patent keyword analysis because the keyword data is discrete. To solve this problem, we propose a patent analysis methodology using Bayesian Poisson discrete model. To verify the performance of our research, we carry out a case study by analyzing the patent documents applied by Apple until now.

Analysis of Causal Relationship between Patent Indicators and Firm Performance (특허지표와 기업 성과의 인과관계에 대한 분석)

  • Lim, Ji-Youn;Kim, Chul-Young;Gu, Ja-Chul
    • Korean Management Science Review
    • /
    • v.28 no.2
    • /
    • pp.63-74
    • /
    • 2011
  • As business environment has become more competitive, the R&D strategies of firms have been regarded more important. Patent has information about technology which affects a firm's profit and it is considered as resources which have provided appropriate data for research of innovations and trends in technology. And patent indicators are known as qualitative representation of technology quality in an objective view. Also, they are available for the continuous and systematic analysis. However, most previous studies have focused on developing patent indicators to investigate patent value and characteristics. Furthermore they have limitations that most results is not significant that patent indicators have effect on firm performance-Tobin's q, Intangible assets based on balance sheet, sales and etc. Thus, the purpose of this paper is to propose proper a factor to represent a firm performance and to analyze causal relationship between patent indicators and firm performance. Intangible assets based on market value are employed as one of most significant firm performance indicator. The results indicate that intangible assets are appropriate for analyzing causal relation between patent and a firm performance with 7 significant indicators among 10 patent indicators. Considering firm's exogenous factors, regression analysis of each data for five years is performed. This result is similar to regression analysis of full data for all years.

Design of Consolidated Patent Index for Effective Utilization of Patent Information (특허정보의 효율적 활용을 위한 통합형 특허지표 설계)

  • Shin, Han-Seop
    • Korean Management Science Review
    • /
    • v.24 no.2
    • /
    • pp.1-18
    • /
    • 2007
  • This paper presents a consolidated patent index to measure national technology innovation and science technology activation, as well as index for the main constituent such as corporation, research organization by comprehensive analysis of existing patent index. It is classified by macroscopic index and analytical index in the consolidated patent index, in which macroscopic index is to present a degree of innovation in national scientific innovation and is divided into the Consolidated Patent Index and Index for comparison between countries. The analytical index basically designed to measure R&D activity by the main constituent is divided to present by quantitative index utilizing bibliographical data in patent and other technical publication related therein, and qualitative index for analysis of bibliographical data. In this paper, the Consolidated Patent Index is presented by adding Creation Index representing for patent by developing excellent technology, Evaluation Index representing valuable technology thereof, and Utility Index representing applicability diffused.

A Study on Efficient Noise Filtering of Patent Data Analysis and Level Assessment of Patent Technology which improve reliability (특허 데이터 분석시 효율적인 노이즈 제거와 신뢰도가 향상된 특허 기술수준 평가에 관한 연구)

  • Kang, Hee-Seop;Lee, Seung-Ho
    • Journal of Korea Technology Innovation Society
    • /
    • v.15 no.1
    • /
    • pp.105-128
    • /
    • 2012
  • This paper proposes the technological level assessment which improved reliability and the efficient noise elimination methods in the process of establishing patent map analysis data. In order to eliminate efficiently noise (removed by the manual process in the past), the paper applies the Logical Operator 'AND', makes it a program in excel VBA(Visual Basic Application), and obtains the valid data. For the improved reliability technological level assessment of the patents, the study calculates average number of claims, Patent Family Size(PFS), Cites Per Patent (CPP), Triad Patent Families, Standardization Patent Diversification Index (stdPCPI), and haF-index(Hirsch a Family index). The result which applied noise exclusion work showed less than 10% of acquired patent data ratio and confirmed high reliability. The result that apply proposed technological level assessment index makes sure that balanced technological level assessment which improved reliability by producing synthetic technological level assessment.

  • PDF

A study on the systematic operation of the innovative patent strategy framework and the application plan of patent big data to secure competitive advantage (혁신특허전략 프레임워크의 체계적 운영 및 경쟁우위확보를 위한 특허빅테이터 활용방안에 관한 연구)

  • Kim, Hyun Ah;Cha, Wan Kyu
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.351-357
    • /
    • 2021
  • At the time when interest in the use of big data is rising in the face of the technological paradigm shift of the 4th industrial revolution, interest in the use of patented big data is increasing, especially as the proportion of intangible assets of companies increases. In addition to quantitative information, patent data contains various information such as unstructured text such as title, abstract, claim, citation and citation relations, drawings, and technology classification. It is judged that the use of treatment is important. Therefore, in this study, in order to systematically operate the innovative patent strategy framework and to secure a competitive advantage by strengthening the fundamental technological competitiveness of the company, we propose a method of using patent big data centering on the case of Company A, and verify its validity. I would like to suggest some implications. Through this, it is intended to raise awareness of the use of patent big data, and to suggest ways to use patent big data in connection with the company's company-wide strategy, business strategy, and functional strategy.

LED Knowledge Map through Competition Analysis based on Intellectual Property (지식재산권 기반 경쟁력 분석을 통한 LED 지식 맵)

  • Koo, Young-Duk;Kwon, Young-Il;Jeong, Dae-Hyun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.1
    • /
    • pp.7-12
    • /
    • 2013
  • In this paper, we provide a basic data to constitute knowledge map through analysis of competition situation such as analysis of patent activity for each nationality, analysis of patent activity for each applicant for a patent, analysis of patent activity for each technical area and analysis of competition status for power of security for market which consider qualitative level. In order to analysis LED data, we choose patent data of LED.