• Title/Summary/Keyword: Document Analysis

Search Result 1,171, Processing Time 0.035 seconds

A study on the electronic document delivery systems (전자식 문헌전송 시스팀에 관한 고찰)

  • 박준식;김정현
    • Journal of Korean Library and Information Science Society
    • /
    • v.16
    • /
    • pp.191-220
    • /
    • 1989
  • This study is an attempt to furnish some helpful data for the design and implementation of the electronic document delivery system based on the analysis of it's cases. To begin with, the concepts and basic models of electronic document delivery system(Fig. 1) were overviewed in the second chapter, on the basis of which the concrete cases were introduced in the third chapter ; ADONIS Project, ARTEMIS Project, HERMES Project, APOLLO Project, UNIVERSE Project, DOCDEL Project, and etc. In the future rapidly developed the technology of electronic communication, there are many possibilities of the evolution of electronic document delivery system.

  • PDF

Analysis of Document Clustering Varing Cluster Centroid Decisions (클러스터 중심 결정 방법에 따른 문서 클러스터링 성능 분석)

  • 오형진;변동률;이신원;박순철;정성종;안동언
    • Proceedings of the IEEK Conference
    • /
    • 2002.06c
    • /
    • pp.99-102
    • /
    • 2002
  • K-means clustering algorithm is a very popular clustering technique, which is used in the field of information retrieval. In this paper, We deal with the problem of K-means Algorithm from the view of creating the centroids and suggest a method reflecting document feature and considering the context of each document to determine the new centroids during the process of forming new centroids. For experiment, We used the automatic document summarizer to summarize the Reuter21578 newslire test dataset and achieved 20% improved results to the recall metrics.

  • PDF

A Study on the Establishing Document Control System in Quality Management (품질경영 체제에서의 문서관리 시스템 확보 방안)

  • 박상필;김영세;박건우
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.18 no.36
    • /
    • pp.307-313
    • /
    • 1995
  • Everyone knows that documents are very useful to obtain and transfer an information. Establishing a good document control system is difficult although it is important. In this point, document control is the base of the quality system. This paper provides possible implementation methods and achieving the method of good document control through analysis of code requirements. The best method is to provide a freedom to people.

  • PDF

A Trend Analysis and Strategy for Document Delivery Service in the Changing Digital Information Environment and Copyright Law (디지털 정보환경과 저작권법 변화에 따른 원문제공서비스 동향분석 및 대응전략)

  • Lee, Seon-Hee;Kim, Ji-Young;Kim, Hye-Sun
    • Journal of Information Management
    • /
    • v.43 no.3
    • /
    • pp.139-160
    • /
    • 2012
  • The document delivery service has been influenced by the changing digital information environment and the copyright law. This paper proposes the key points for the improved documents delivery service through trend analysis of BL(The Great Britain), NLA(Australia), subito(German speaking countries), JST(Japan), KISTI(Korea) and KERIS(Korea) and copyright law in each country. The purpose of this study is providing with the clues for developing strategies that can satisfy the users' needs in document delivery service for libraries and information centers.

A Study on the Collaboration Network Analysis of Document Delivery Service in Science and Technology (과학기술분야 원문제공서비스의 협력 네트워크 분석)

  • Kim, Ji-Young;Lee, Seon-Hee
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.4
    • /
    • pp.443-463
    • /
    • 2013
  • Korea Institute of Science and Technology Information(KISTI) provides domestic researchers with science and technology information through NDSL Information Document Service(NIDS) network to improve research productivity in Korea. University libraries and information centers of research institutes are playing a major role in the NIDS collaboration network. In this study, we examined the relationship among the participating organizations for document delivery service using the social network analysis(SNA) method. Centrality of each organization in the NIDS network was analyzed with the indexes such as degree centrality, closeness centrality, betweenness centrality, and eigenvector centrality. The research results show that KISTI, KAIST, POSTECH, and FRIC are located at the center of the NIDS network. Based on the research results, this paper suggests several directions for improvement of document delivery service.

Document Clustering Method using PCA and Fuzzy Association (주성분 분석과 퍼지 연관을 이용한 문서군집 방법)

  • Park, Sun;An, Dong-Un
    • The KIPS Transactions:PartB
    • /
    • v.17B no.2
    • /
    • pp.177-182
    • /
    • 2010
  • This paper proposes a new document clustering method using PCA and fuzzy association. The proposed method can represent an inherent structure of document clusters better since it select the cluster label and terms of representing cluster by semantic features based on PCA. Also it can improve the quality of document clustering because the clustered documents by using fuzzy association values distinguish well dissimilar documents in clusters. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.

The Engineering Change Document Management using SGML in PDM (SGML을 활용한 PDM에서의 설계변경문서관리)

  • Kim, Joon-Oh;Kim, Sunn-Ho
    • IE interfaces
    • /
    • v.10 no.2
    • /
    • pp.79-90
    • /
    • 1997
  • Documents in a traditional PDM(Product Data Management) system have been managed in a form of scanned document files or electronic documents developed by specific tools. Though each tool manages documents with its own systematical methods, it has drawbacks in data search, data integration and interchange, etc. For this reason, in this research we propose an efficient document management system for PDM by using the SGML(Standard Generalized Markup Language), one of CALS and ISO standards for document interchanges. Among documents to be managed in PDM, the engineering change notification (ECN) is taken into account. The DTD (Document Type Definition) has been constucted based on the logical analysis of the documents format, In addition, based on the DTD, DB classes have been designed by object-oriented paradigms and a prototype for document input/output and search has been developed using UniSQL ORDBMS (Object-Relational DBMS) and PowerBuilder under the client/server environment.

  • PDF

Analysis on Current Issues and Cases of Electronic Document Delivery Service for Sharing of Knowledge Information (지식정보 공유를 위한 전자원문서비스의 주요 이슈와 사례 분석)

  • Yoo, Su-Hyeon;Choi, Hee-Yoon
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.2
    • /
    • pp.81-96
    • /
    • 2006
  • Changes in document delivery service environment such as spread of web-based research information communication and direct communication between users and information providers have considerable effects on document delivery service institutes. Swift advances in information technology have allowed users to receive information on their desktops via web. Web-based document delivery makes the massive scale of reproduction and distribution possible so it needs to protect the copyright holders' rights. This study identifies the current trends and issues of document delivery service environment and reviews electronic document delivery services of foreign countries. Also this study introduces the domestic electronic document delivery service, e-DDS, and evaluates the copyright issues for the service.

Local Similarity based Document Layout Analysis using Improved ARLSA

  • Kim, Gwangbok;Kim, SooHyung;Na, InSeop
    • International Journal of Contents
    • /
    • v.11 no.2
    • /
    • pp.15-19
    • /
    • 2015
  • In this paper, we propose an efficient document layout analysis algorithm that includes table detection. Typical methods of document layout analysis use the height and gap between words or columns. To correspond to the various styles and sizes of documents, we propose an algorithm that uses the mean value of the distance transform representing thickness and compare with components in the local area. With this algorithm, we combine a table detection algorithm using the same feature as that of the text classifier. Table candidates, separators, and big components are isolated from the image using Connected Component Analysis (CCA) and distance transform. The key idea of text classification is that the characteristics of the text parallel components that have a similar thickness and height. In order to estimate local similarity, we detect a text region using an adaptive searching window size. An improved adaptive run-length smoothing algorithm (ARLSA) was proposed to create the proper boundary of a text zone and non-text zone. Results from experiments on the ICDAR2009 page segmentation competition test set and our dataset demonstrate the superiority of our dataset through f-measure comparison with other algorithms.

A Text Detection Method Using Wavelet Packet Analysis and Unsupervised Classifier

  • Lee, Geum-Boon;Odoyo Wilfred O.;Kim, Kuk-Se;Cho, Beom-Joon
    • Journal of information and communication convergence engineering
    • /
    • v.4 no.4
    • /
    • pp.174-179
    • /
    • 2006
  • In this paper we present a text detection method inspired by wavelet packet analysis and improved fuzzy clustering algorithm(IAFC).This approach assumes that the text and non-text regions are considered as two different texture regions. The text detection is achieved by using wavelet packet analysis as a feature analysis. The wavelet packet analysis is a method of wavelet decomposition that offers a richer range of possibilities for document image. From these multi scale features, we adapt the improved fuzzy clustering algorithm based on the unsupervised learning rule. The results show that our text detection method is effective for document images scanned from newspapers and journals.