리눅스 클러스터 화일 시스템 SANiqueTM의 오류 회복 기법

Failure Recovery in the Linux Cluster File System SANiqueTM

  • 이규웅 (상지대학교 컴퓨터·정보공학부)
  • 발행 : 2001.12.01

초록

This paper overviews the design of SANique$^{TM}$ -a shred file system for Linux cluster based on SAN environment. SANique$^{TM}$ has the capability of transferring user data from network-attached SAN disks to client applcations directly without the control of centralized file server system. The paper also presents the characteristics of each SANique$^{TM}$ subsystem: CFM(Cluster File Manager), CVM(Cluster Volume Manager), CLM(Cluster Lock Manager), CBM(Cluster Buffer Manager) and CRM(Cluster Recovery Manager). Under the SANique$^{TM}$ design layout, then, the syndrome of '||'&'||'quot;split-brain'||'&'||'quot; in shared file system environments is described and defined. The work first generalizes and illustrates possible situations in each of which a shared file system environment may split into two or more pieces of separate brain. Finally, the work describes the SANique$^{TM}$ approach to the given "split-brain"problem using SAN disk named "split-brain" and develops the overall recovery procedure of shared file systems.

키워드

참고문헌

  1. Sang G. Oh and Jang S. Lee, '$SANique^{TM}$ : A SAN Filc system for Linux Cluster,' Technical White Paper - Draft, MacroImpact. Co. Ltd., 2001
  2. C.C. Fan and J. Bruck, 'The Raincore Distributed Session Service for Networking Elements,' Proc. of the International Parallel and Distributed Processing Symposium, 2001
  3. P.T. murray, R.A. Fleming, P.D. Harry, P.A. Vickers, 'Somersault : Enabling Fault-Tolerant Distributed Software Systems,' Technical Paper HPL-98-81, Internet Comm. Systems Dept, Hewlett-Packard Labs. Bristol, 1998
  4. P.S. Weygant, 'Primer on Clusters for High Availability,' Technical Paper at Hewett-Packard Labs, CA, 2000
  5. R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon, 'Design and Implementation of the Sum Networks File Systems,' Proc. Of the Summer USENIX Conf. 1985
  6. U. Vahalia, Unix Internals : The New Frontiers, Prentice-Hall, NJ, 1996
  7. M.D. Dahlin, 'Severless Network File Systems,' Ph. D. Thesis at Computer Science Graduate Divistion of University of California at Berkely, 1995
  8. Oracle Parallel Server, An Oracle Technical White Paper, November, 1998
  9. S.R. Soltis, T.M. Ruwart, and M.T. O'keefe, 'The Global File Systems,' Proc. Of the 5th NASA Goddard Conference on Mass Storage Systems and Technologies, 1996
  10. K.W. Preslan, A. Barry, J. Brassow, R. Cattelan, A. Manthei, E. Nygaard, S. Oort, D. Teigland, M. Tilstra, and M. O'keefe, 'Implementing Journaling in a Linux Shared Disk File System,' Proc. Of the 8th NASA Goddard Conference on Mass Storage Systems and Technologies, 1999
  11. K.W. Preslan, A. Barry, J. Brassow, R. Cattelan, A. Manthei, B. Marzinski, E. Nygaard, S. Oort, D. Teigland, M. Tilstra, S. Whitehouse, and M. O'keefe, 'Scalability and Failure Recovery in Linux Cluster File System,' Proc. of the 4th Linux Showcase and Conference, 2000
  12. M. Satyanarayanan, 'Scalable, Secure, and Highly Available Distributed File Access,' IEEE Computer, 1990 https://doi.org/10.1109/2.53351