• 제목/요약/키워드: Criterion-referenced Test

검색결과 16건 처리시간 0.028초

성과중심교육 측면에서 우리나라 의과대학 학생평가의 현실과 과제 (Current and Future Challenges of Student Assessment in Medical Education from an Outcome-based Education Perspective)

  • 박장희
    • 의학교육논단
    • /
    • 제15권3호
    • /
    • pp.112-119
    • /
    • 2013
  • Most medical colleges in Korea have been shifting from traditional education to outcome-based education, which is the general trend in medical education. The purpose of this study was to make some suggestions in light of the reality and challenges of student assessment in medical education from the perspective of outcome- based education. First, those who are responsible for student assessment should be diversified to include faculty, residents, students, and evaluation committee members. They need separate roles in educational evaluation, so evaluation competencies are required for them. Second, various methods for evaluation and score interpretation can be used for effective evaluation. We can adopt diagnostic, formative, and summative evaluation functionally, and the norm-referenced, criterion-referenced, growth-referenced, and ability-referenced evaluation based on criteria for score interpretation. Finally, various evaluation domains and test forms can be administered together in the common lectures in the medical school. We can test not only knowledge but also skills and attitudes, with diverse test forms such as supply and performance types.

예비수학교사의 교직 적성·인성 검사에서 분할점수 변화에 따른 다양한 신뢰도 탐색 (Investigation of Various Reliability Indices of Pre-service Mathematics Teachers' Teaching Aptitude and Personality Test based on Setting Cut Scores)

  • 김성연
    • 한국수학교육학회지시리즈A:수학교육
    • /
    • 제57권1호
    • /
    • pp.55-74
    • /
    • 2018
  • The purpose of this study is first to examine the relative influence of each error source and to investigate the optimal measurement conditions to ensure satisfactory multiple reliability coefficients based on the teaching aptitude and personality test for pre-service teachers. Participants were 33 students enrolled in mathematics education in a graduate school of education located in the Seoul metropolitan area from 2013 to 2017. The main results were as follows. First, the estimated variance due to residual was highest, followed by nesting of items within domains, graduate students, interactions of graduate students with domains, and domains. Second, total 96 items, with 12 domains containing 8 items in each domain, with cut score of 598, and original 210 items, with 14 domains containing 15 items in each domain, with cut scores of 615 or 716 were optimal measurement conditions to reach acceptable reliability levels based on the joint consideration of dependability coefficients, cut score dependability coefficients, adjusted dependability coefficients, and standard errors of measurement. Third, larger deviations between the arithmetic mean and the cut score indicated higher reliability coefficients of the test results. Finally, this study suggests ways for practitioners to consider how to apply generalizability theory for criterion-referenced tests and how to develop future research based on limitations.

교구를 활용한 수학적 과정의 평가모델 개발에 관한 연구 -중학교 수학을 중심으로- (A Study on the Development of the Model for the Process-focused Assessment Using Manipulatives -Focused on Middle School Mathematics-)

  • 고상숙;한혜숙;이창연
    • 한국수학교육학회지시리즈E:수학교육논문집
    • /
    • 제27권4호
    • /
    • pp.581-609
    • /
    • 2013
  • 제 7차 교육과정이후 학생들의 수학 학습을 돕기 위해서 다양한 평가 방법으로 학습 과정 및 수학 수준 등에 대하여 진단해야하는 평가의 중요성이 꾸준히 논의되었다. 본 연구는 2009 개정교육과정에서 강조하는 수학적 과정인 문제해결, 추론, 의사소통 능력을 향상시키기 위해서 교구를 활용한 평가모델을 개발하는 것이다. 우선 교구를 활용한 평가에 적합한 평가원리를 설정하여 각 영역별로 문항과 채점기준표를 개발하고 예비 연구를 실시한 결과 관찰체크리스트의 필요성이 제기되었다. 나아가 예비 연구를 바탕으로 수정된 평가모델을 사용하여 평가를 실시한 결과, 수학적 과정인 각 영역별 특성에 따른 채점기준표에 의해 학생들의 수학적 사고과정을 구체적으로 파악할 수 있어서 목표지향평가가 용이해짐을 알 수 있었다.

초등학교 수학과 학생평가 실태 분석 (A Study on the Student Assessment of Elementary School Mathematics)

  • 이종욱
    • 한국수학교육학회지시리즈A:수학교육
    • /
    • 제48권1호
    • /
    • pp.21-32
    • /
    • 2009
  • The purpose of this study is to diagnose the current states and the problems of student assessment of Elementary School Mathematics. For that purpose, this study conducted a survey and had the individual interviews. The surrey items consisted of the six main parts: questions about the development of assessment tools, the method to assess, the grading, the special supplementary courses, the opening of learning effect, and the follow-up guidances. The results of this study are as the follow First, elementary teachers depended heavily on internet sites for developing assessment problems. Second, elementary teachers made use of a performance assessment, a unit assessment, and a term examination at ordinary times. Third, unit assessment was largely referred for grading by elementary teachers. Fourth, in selecting the students for the special supplementary courses, both criterion-referenced assessment and norm-referenced assessment were considered. After finishing the special supplementary courses, additional tests were usually taken. Fifth, elementary teachers took a negative attitude in opening of learning effect. specialty opening of test paper to parents of students was done under 30%. Sixth, fellow-up guidances were the most through the classroom guidances. but consulting with parents of students was not frequently conducted by teachers.

  • PDF

Student Responses to Smart Device-Based Test on Competency Evaluation in Dental Education

  • Kim, Jooah;Kim, Soo-Yoon
    • Journal of Korean Dental Science
    • /
    • 제12권2호
    • /
    • pp.58-65
    • /
    • 2019
  • Purpose: This study was aimed to investigate the possibility of utilizing smart device-based test (SBT) for competency evaluation in dental education and to analyze the student responses on overall competency evaluation using SBT method, in comparison to ubiquitous-based test (UBT). Materials and Methods: Questionnaire surveys have been conducted at Yonsei University College of Dentistry from 2015 to 2018 to obtain students' feedback on the application of SBT to competency evaluation. In addition, in order to supplement the competency evaluation procedure, considerations were explored by comparing the expected and actual difficulty of each item when preparing items for competency evaluation with SBT. Result: According to the survey results, student responses between the initial two years (2015 and 2016) differed from those in next two years (2017 and 2018). Students in 2017 and 2018 had more positive responses on competency evaluation with SBT. To determine the test validity, criterion-referenced evaluation was adopted to compare the data in 2017 and 2018 and slight differences in test difficulty in 2018 between the expected and actual difficulty of items were found. Conclusion: The results indicated that SBT was more appropriate for competency evaluation than UBT, based on four-year period of competency evaluation. The SBT was not affected by either the file size or the number of test-takers. Interestingly, students were not sensitive to test version of competency evaluation (paper-based test and SBT). This study suggests that the quality of the test items should be measured by continuous monitoring of the expected and actual difficulty of items for determining test validity. More detailed results and discussions of the findings are given for the development of test procedure and further potential research directions in dental education.

위.장관계 수술 환자간호의 질평가를 위한 도구개발 (Development of an evaluation tool of quality of nursing care for gastrointestinal surgery patient)

  • 이병숙;박정호;조현
    • 한국의료질향상학회지
    • /
    • 제4권2호
    • /
    • pp.260-278
    • /
    • 1997
  • Background : Quality of professional nursing care is the most essential factor for survival and growth of nursing profession. Then, nursing professionals have responsibility for the evaluation of quality of professional nursing care. The purpose of this study was to develope an evaluation tool of nursing care for patients received gastrointestinal surgery with general anesthesia. This study was a primary work for the developement of a computer program for the evaluation of nursing care. Methods : This study was done through some consecutive steps. They were (1) Developement of items for the tool (2) Developement of an evaluation tool of nursing care quality for the G-I surgery patient (3) Test of reliability and validity of the tool. Two groups of experts and expert pannels who had much experience of the QA and the care of G-I surgery patients participated for developement of the items. 85 nursing records were used for the test of reliability and validity of the developed tool. The evaluation tools were developed with two types of scoring, norm-referenced tool and criterion-referenced tool. Results The system of items for tool was evaluation area evaluation item-indicator. There were 7evaluation areas which contained 32evaluation items which contained 7lindicators. Evaluation areas 1, 2, 3, 4 were for the evaluation of process and 5, 6, 7 were for the evaluation of outcome of nursing care for G-I surgery patient. For the test of interrator reliability, correlation coefficients of each scores of items and intragroup correlation coefficients were calculated. The average correlation coefficients between two rators were 0.65, 0.54 and the intragroup correlation coefficient were 0.99 and 1.00 by the types of scoring. The Cronbach alpha coefficients of the tools were 0.54 and 0.46 by the types of scoring. The average content validity index of the items was 0.95 from 4 pairs of experts. Because there were significant differences between some scores of quality of nursing care of 3 general hospitals regardless of the types of scoring, the tools could be thought to have some construct validity. And also, there were significant correlations between some scores of quality of nursing care and admission days and admission days after surgery regardless of the types of scoring, the tools could be thought to have predictive validity. Conclusion In this study, the evaluation tool of nursing care was developed for the very specified group of patient, G-I surgery patient. And the items were developed and tested by the experts of nursing practice. Because of these reasons, it was supposed that the tool could be used effectively in nursing pratice. And the procedures for the development and the test of the evaluation tool of nursing care in this study were supposed to be used for the developement of other tools.

  • PDF

학령전 아동을 위한 호흡기전염병 예방 프로그램의 개발 및 효과에 관한 연구 (A Study on Health Education Program Development of Respiratory Communicable Disease Prevention for Preschool Children and the Measurement of It's Effects)

  • 김일옥
    • Child Health Nursing Research
    • /
    • 제10권1호
    • /
    • pp.66-79
    • /
    • 2004
  • Purpose: The purpose of this study were to develop a respiratory communicable disease prevention program for preschoolers and measure it's effects. Method: The respiratory communicable disease prevention program for preschoolers consisted of texts, cartoons, photographs, discussions, demonstrations, puzzle games, die games, compensation/reinforcement, and token economy which were directed under the systematic design of instruction by Dick %amp; Carey. This study was a quasi experimental study under the nonequivalent control group with pretest-posttest design. The subjects of this study were 45 preschool children who are attending 3 different district nursery schools and they were matched by the age, pretest knowledge, and pretest behavior. The instrument used in this study was criterion referenced test items that were developed by a researcher for evaluating the subject's knowledge, attitude, and behavior about respiratory communicable disease prevention. A pretest was administered a week before treatment. Experimental group Ⅰ was administered by the treatment of respiratory communicable disease prevention program. Experimental group Ⅱ was administered by above program with token economy program. The posttest was conducted on the eighth day. The third test for behavior was completed 15th day. To determine the effect of the program, the data were analyzed by the SAS 6.12 program with Kruskal Wallis test, ANCOVA, ANOVA, Duncan's test and paired t-test. Result: 1) There was a significant difference in knowledge between the experimental groups and control group(F=5.89, P=0.0197). 2) There was a significant difference in attitude between the experimental groups and control group(F=3.29, P=0.0469). 3) There was a non-significant difference in behavior between the experimental groups and control group(F=0.00, P=0.9512). 4) In the experimental groupⅡ, there was highly significant increase in behavior after token economy(t=4.5252, P=0.0005). Conclusion: It was found that the respiratory communicable disease prevention program for preschool children was effective in changing the preschoolers' knowledge and attitude on the respiratory communicable disease prevention, but not enough for changing the preschoolers' behavior. Token economy was improved as an effective and strong method for inducing desirable changes of preschoolers' behavior.

  • PDF

중학생 약물오남용 프로그램의 효과 (Effectiveness of a Drug Misuse and Abuse Preventive Program for Middle School Students)

  • 이윤영;한숙정
    • 한국학교보건학회지
    • /
    • 제19권2호
    • /
    • pp.89-104
    • /
    • 2006
  • Purpose: This study was to develop and verify the effects of drug misuse and abuse preventive program for middle school students. Methods:This research was a quasi experimental study under the nonequivalent control group with pretest-post test design which tried to protect children from the detrimental effect of drugs and develop a drug abuse prevention program for middle school students. Data was collected from October 10th to 21th, 2005. Subject consisted of 145 middle school students in Kyeonggi, experimental group-72, control group-73. Dick & Carey's(1996) educational system was applied, based on documents and materials online related to drug abuse in order to develop drug abuse prevention program. It's composed of 4 parts, 45 minute each. The evaluation instrument testing for the knowledge about drugs was a criterion of referenced test items modeled by Dick & Carey. The instrument for attitudes about drugs was modeled by Kim, Soyaja. A pre-test was taken on the knowledge and attitudes to drugs. The experimental students were given four sessions of drug abuse prevention education. A post-test similar to the pre-test questionnaire was given in 1 week, 4 weeks following the last session. Collected data was analyzed by using SAS 9.1 program. Results:Followings are the summarized result of study 1. The experimental group, that attended the drug abuse prevention program will have more knowledgable about drugs than the control group (F=27.31, p<.0001). 2. The experimental group, that attended the drug abuse prevention program displayed greater negativism attitude than the control group (F=0.58, p=0.4477). Conclusion:The results conclude that drug abuse prevention programs increase the knowledge of middle school students but doesn't change their attitude toward drugs. Therefore we need to offer them more systematic education to increase their knowledge so it will also improve their attitudes as well.

제7차 교육과정에 근거한 준거지향적 수행평가 문항의 개발과 평가 -고등학교 과학 "생식"과 "생물 농축" 단원을 중심으로- (Development and Evaluation of Criterion-Referenced Performance Assessment Items Based on the 7th National Science Curriculum -Subject Unit of Reproduction and Biological Accumulation-)

  • 정영란;박진주
    • 한국과학교육학회지
    • /
    • 제24권3호
    • /
    • pp.519-531
    • /
    • 2004
  • 최근 제7차 교육과정이 실시되면서 준거 지향적인 수행평가에 대한 요구가 커지고 있으며, 이러한 상황에서 평가의 절대적인 기준을 마련할 필요성 또한 절실해지고 있다. 본 연구는 제7차 교육과정에 근거하여 고등학교 과학 중 '생식' 단원과 '생물 농축' 단원에 대해서 필수 학습 요소, 성취 기준, 평가 기준 및 평가 문항을 개발하고, 이를 바탕으로 개발한 평가 문항을 실제 현장에 적용하여 그 타당성을 검증하는 것을 목적으로 한다. 본 연구에서는 제7차 교육과정과 고등학교 과학의 7종 교과서를 바탕으로 하여, '생식' 과 '생물 농축' 단원의 필수 학습 요소를 추출하고, 준거 지향 평가의 기초가 되는 객관적이고 타당한 성취 기준 및 평가 기준을 마련하였다. 추출된 필수 학습 요소는 '생식' 단원에서 12개, '생물 농축' 단원에서 4개이었으며, 개발된 성취 기준은 '생식' 단원에서 총 26개, '생물 농축' 단원에서 총 9개로 각각은 지식(K), 탐구(P), 태도(A)의 세 개 영역으로 구분하여 개발하였다. 이상과 같이 개발된 성취 기준을 바탕으로 평가 기준을 개발하였다. 개발된 평가 기준의 수는 '생식' 단원에서 총 25개, '생물 농축' 단원에서 9개였다. 성취 기준과 평가 기준에 따라 '생식' 단원과 '생물 농축' 단원에서 개발된 평가 문항의 수는 서술형 문항 17개, 논술형 문항이 13개, 포트폴리오가 2개로 총 22문항이었다. 각 평가 문항은 구체적인 채점 기준을 제시하였으므로, 교육과정에 근거한 객관적인 평가에 사용될 수 있다. 개발된 평가 문항 중, 서술형 문항 8개를 경기도 소재 고등학교 1학년 학생 240명을 대상으로 적용 분석하였다. 서술형 문항에 대한 학생들의 응답을 두 가지 검사 이론을 적용하여 분석하였는데, 평가 문항의 기본적인 양호도 검증을 실시한 결과, 본 연구에서 개발된 서술형 문항들은 난이도와 변별도 면에서 전반적으로 적절한 지수를 보여 수행 평가로 사용하기에 적절한 문항들이었다. 개발한 서술형 수행 평가 문항들은 미리 개발된 성취 기준과 평가 기준을 바탕으로 하였으므로 교수 학습의 목표와 내용에도 적합하였다. 고전 검사 이론과 다분 문항 반응 이론에 의해 분석된 문항의 변별도에는 언어적인 해석의 차이가 없었으나, 문항 난이도의 경우는 고전 검사 이론을 사용하였을 때 집단의 특성이 많이 나타나 다분 문항 반응 이론을 사용한 결과와 비교하여 해석의 차이가 다양하게 나타났다.

국가 교육과정에 근거한 공통과학 평가 기준 및 평가 도구 개발 연구 (Development of National Curriculum-Based Assessment Standards and Instruments for High School Common Science)

  • 이양락;이선경;홍미영;홍재식
    • 한국과학교육학회지
    • /
    • 제19권1호
    • /
    • pp.159-172
    • /
    • 1999
  • 본 연구는 '97년에 수행된 "공통과학 국가공통 절대평가 기준 개발 연구"의 후속 연구로서 국가 교육 과정에 근거한 고등학교 공통과학의 평가 기준 및 도구를 개발하였다. 기준 개발의 각 과정에서 과학교육 전공 교수, 고등학교 교사 교육부 관계관들과 협의회, 워크샵, 집중작업 등을 통해 의견을 수렴하고, 합의된 안을 도출하였으며, 구체적인 과정은 다음과 같다. -성취기준 검토 및 수정: 선행 연구에서 개발된 성취기준을 현장 교사 대학 교수, 교육부 관계관 등의 검토 의견을 반영하여 수정하였다. -평가기준 개발: 공통과학의 37개 중영역에 대하여 학생들의 성취 정도를 상/중/하로 판단할 수 있는 준거를 개발하였다. 평가도구 개발, 각 중단원의 성취 정도를 평가할 수 있는 도구를 중단원별로 2조 이상 개발하였다. 선택형이나 단답형 평가도구 보다는 타당도를 중시한 수행평가(서술형, 관찰, 보고서, 포트폴리오 평가 등) 위주로 개발하였다.

  • PDF