|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Education Special Series |
RR Utzman, PT, MPH, PhD, is Associate Professor and Academic Coordinator of Clinical Education, Division of Physical Therapy, West Virginia University School of Medicine, PO Box 9226, Morgantown, WV 26506-9226 (USA)
DL Riddle, PT, PhD, FAPTA, is Otto D Payton Professor, Department of Physical Therapy, Medical College of Virginia Campus, Virginia Commonwealth University, Richmond, Va
DV Jewell, PT, PhD, CCS, FAACVPR, is Assistant Professor, Department of Physical Therapy, Medical College of Virginia Campus, Virginia Commonwealth University
Address all correspondence to Dr Utzman at: rutzman{at}hsc.wvu.edu
Submitted August 8, 2006;
Accepted April 18, 2007
| Abstract |
|---|
Subjects: A nationally representative sample of 20 physical therapist education programs provided data on 3,365 students.
Methods: Programs provided data regarding demographic characteristics, undergraduate grade point average (uGPA), and quantitative and verbal Graduate Record Examination scores (qGRE, vGRE). The Federation of State Boards of Physical Therapy provided NPTE data. Data were analyzed using hierarchical logistic regression.
Results: A prediction rule that included uGPA, vGRE, qGRE, and race or ethnicity was developed from the entire sample. Prediction rules for individual programs showed large variation.
Discussion and Conclusion: Undergraduate grade point average, GRE scores, and race or ethnicity can be useful for estimating student risk for failing the NPTE. Programs should use GPA and GRE scores along with other data to calculate their own estimates of student risk.
| Introduction |
|---|
|
|
|---|
The NPTE is a 200-question, standardized examination that is administered by computer.1 The examination questions are in multiple-choice format and require the student to respond to clinical scenarios that are commonly encountered by new physical therapist graduates. The NPTE is scored on a scale of 200 to 800, and the minimum passing scaled score is 600 in all 50 states.1 The examination is based on an analysis of professional (entry-level) physical therapist practice that is updated every 5 years by the Federation of State Boards of Physical Therapy (FSBPT). A new practice analysis was completed and a new passing standard was established as of November 15, 2002.1 As a result of the practice analysis, the content and overall difficulty of the examination changed.2 Mean first-time pass rates on the NPTE were lower for students who graduated from all accredited programs in 2003 and 2004 (77% and 71%, respectively) than for 2000, 2001, and 2002 graduates (90%, 91%, and 85%, respectively).2 The updated test contents and standard are the likely reasons why the pass rates for 2003 and 2004 were lower than the pass rates for 2000 to 2002.
The Commission on Accreditation in Physical Therapy Education (CAPTE) requires all programs to report pass rates as part of ongoing curriculum review.3 Students considering a career in physical therapy also may review programs NPTE pass rates when selecting a physical therapist education program. The financial stakes for a student who fails the NPTE are high. The FSBPT charges $350 to take the test, in addition to a $50 fee charged by the testing center and application fees paid to the state where the student seeks a license.1 In many jurisdictions, the student cannot practice until the examination is passed, meaning that a student who fails the examination loses potential wages in addition to the testing and application fees.
Despite the importance of the NPTE as a gateway to clinical practice, few researchers have investigated whether data obtained prior to admission to a professional physical therapist program can be used to predict performance on the licensing examination. Two early studies4,5 examined students enrolled in baccalaureate programs. Because all programs are currently educating students at the post-baccalaureate level, these data likely are no longer generalizable. More recently, Thieman et al6 used data from 121 students in a single master's-level program to determine whether grade point average (GPA) in prerequisite courses and Graduate Record Examination (GRE) scores could be used to predict raw scores on the NPTE. They found that prerequisite course GPA and GRE scores, along with age, accounted for 11% of the variance in raw NPTE score.6 In a similar study using data from 107 students in a single program, Dockter7 found that prerequisite course grades were weakly correlated (r=.34) with raw NPTE score.
Although these studies6,7 indicate that past academic performance may be associated with performance on the NPTE, neither study provides information that could be directly applied to current physical therapist education programs or students. The purpose of our study was to investigate the utility of demographic and quantitative admissions data in predicting whether students failed the NPTE at least one time. We examined only admissions data and not post-matriculation variables such as professional program GPA. We were interested only in the variables that academic program admissions committees use in making admissions decisions and the extent to which these types of variables predict NPTE performance. By matching data from a nationally representative sample of professional Master of Physical Therapy degree (MPT) and Doctor of Physical Therapy degree (DPT) programs with NPTE data provided by the FSBPT, we examined the following questions: (1) can preadmission demographic data, GPA, and verbal and quantitative Graduate Record Examination scores (vGRE, qGRE), either in isolation or in combination, be used to predict physical therapist student risk for failing the NPTE, and (2) does the extent of prediction of NPTE failure vary by program?
| Method |
|---|
|
|
|---|
Subjects
Participating programs provided data regarding all students admitted to the classes scheduled to graduate in the years 2000 through 2004. During the data collection period, 3 of the 23 participating programs withdrew from the study. These 3 programs cited lack of time and difficulty compiling the required data as reasons for not participating. The final sample included data on 3,585 students admitted to 20 physical therapist education programs.
Data Collection
For each student in the sample, the FSBPT indicated student performance on the examination using the coding outlined in Table 1. This variable was recoded into a dichotomous pass (category 1) or fail (categories 2, 3, and 4) score, which was used as the dependent variable in the study. Physical therapist education programs provided data on the independent variables, which included undergraduate GPA (uGPA), vGRE, and qGRE. Undergraduate GPA was collected as a continuous variable, ranging from 0 to 4.0. The vGRE and qGRE are each scored on a scale of 200 to 800 in 10-point increments.9 In cases where students attended more than one undergraduate institution or took the GRE multiple times, programs were asked to record uGPA and vGRE and qGRE scores that were used during the admissions process. Analytic GRE (aGRE) scores were not included in the study because the aGRE was changed to an essay-based assessment in 2002.10 Most students enrolled in physical therapist education programs during the years 2000 through 2004 would have applied for admission prior to 2002 and would have taken the older, multiple-choice version of the aGRE. Data regarding student demographics and program characteristics also were collected. These independent variables are described in Table 3 in our companion article8 in this issue.
|
Data Analysis
Prior to analyzing the data, the uGPA was transformed by multiplying by 10. By doing so, we were able to describe how the odds of NPTE failure change when the uGPA changes by 0.10 (eg, from 3.1 to 3.2). Similarly, vGRE and qGRE were transformed by dividing by 10. This transformation allowed discussion of how odds of NPTE failure change for every 10-point change (eg, from 400 to 410) in GRE score.
Between-program analysis.
Because 2 different versions of the NPTE were administered during the study period, data were analyzed separately for each version of the examination. Throughout the remainder of this article, the examination that was used prior to November 2002 (and completed by the majority of the graduates from 2000 to 2002) will be referred to as version 1, and the version used after November 2002 (completed by graduates from years 2003 and 2004) will be referred to as version 2.
Two separate hierarchical logistic regression models were generated for the entire sample of students, one for those taking version 1 and one for those taking version 2 of the NPTE. The dependent variable was dichotomized into students who passed the NPTE on their first attempt and those who did not. Race or ethnicity was not included in the models generated for each version of the NPTE examination because cell sizes were too small for some racial or ethnic categories. Tabachnick and Fidell11 noted that small cell sizes can cause errors in logistic regression analyses and recommended expected cell frequencies of at least 5.0. Cross-tabulation analyses of all combinations of the nominal independent variables (program, school ownership, degree offered, school classification, school region, sex, and cohort) and NPTE failure revealed 3.5% of cells with expected frequencies less than 5.0. This is well within Tabachnick and Fidell's recommendation that fewer than 5% of cells should have frequencies less than 5.0. After dichotomizing NPTE score to pass/fail and eliminating the race or ethnicity variables, our data set exceeded these recommendations.
We used hierarchical regression models because we wanted to enter variables into the model in a prespecified order based on evidence or theoretical arguments.12 First, we entered the program-level variables of year graduated, academic program, and degree level. We wanted to adjust for these variables to account for differences among programs and among pools of applicants to physical therapist education programs over different years. We suspected that these differences would affect the prediction of NPTE failure and, therefore, wanted to control for this variation in the analysis. Second, we entered the individual level variables of age and sex. We included individual-level variables in our analysis because research has suggested that performance on standardized tests may vary with respect to demographic characteristics.13–17 For the final step in model, we entered uGPA, qGRE, and vGRE.
We were unable to enter geographic region and Carnegie classification into the models because the academic program variable led to cell sizes that were too small. We conducted 2 simple logistic regressions with NPTE performance as the dependent variable and Carnegie classification (as listed in the online directory of accredited programs18) or geographic region as the independent variable. Doctoral institution category and the Northeast region, respectively, served as the referent categories. Carnegie classification was not predictive of NPTE failure (
2=5.097, P=.078). Geographic region was predictive of NPTE failure (
2=33.716, P>.001), with students at institutions in the south having 1.8 times the odds of failure compared with students from northeastern institutions (95% confidence interval [CI]=1.370–2.464).
We used the –2 Log-Likelihood test (P<.05) to evaluate whether the final hierarchical models for the 2 NPTE test versions represented improvement over the constant-only model.11 We used the Hosmer and Lemeshow test (P>.05) to evaluate how well the model fit the data.11,19,20 To determine which independent variables contributed to the prediction models, Wald statistics and adjusted odds ratios were examined.20
We found that the hierarchical logistic regression analysis for version 1 of the NPTE indicated that uGPA, vGRE, qGRE, and age contributed to prediction of NPTE failure after controlling for other potential confounding variables in the model. For version 2 of the examination, the analysis indicated that uGPA, vGRE, and qGRE contributed to the prediction of NPTE failure after controlling for confounding variables. Age did not contribute to prediction in the model using data from version 2 of the NPTE. The logistic regression models for version 1 and version 2 were similar in that the ß coefficients and odds ratios for uGPA, vGRE, and qGRE were nearly identical; the odds ratios varied by 0.01 to 0.02. This indicated to us that uGPA and GRE scores were equally good at predicting NPTE failure for both versions of the NPTE. Therefore, a logistic regression model was developed using data from all students in the study that included test version as a control variable. All of the variables used in the version-specific models were included in the combined model. Because our sample size was effectively doubled over that for either version 1 or version 2, we also were able to include the race or ethnicity variable with 5 levels (African American, Asian/Pacific Islander, Hispanic, white/non-Hispanic, and "other").
For continuous independent variables (ie, uGPA, vGRE, and qGRE) identified as significant predictors of NPTE failure in the hierarchical model for the entire sample, we plotted receiver operating characteristic (ROC) curves to determine how well a variable discriminated between students who passed the NPTE on the first attempt and students who did not.21 The curves also were used to identify threshold scores for the independent variables for predicting NPTE failure.22 To identify the best cut-point thresholds for uGPA, vGRE, and qGRE, the upper left point on the ROC curve was selected. We encourage readers to examine the article by Deyo and Centor23 for a thorough description of ROC curve analysis. These thresholds were used to recode uGPA, vGRE, and qGRE into discrete variables. A final hierarchical logistic regression model was constructed using these newly coded independent variables, along with the other significant variables from the hierarchical logistic models described earlier.
First, the uGPA and GRE variables were divided into tertiles, and a logistic regression model was fit using the recoded variables. The model was evaluated for goodness of fit and contribution of the recoded variables. If dividing a variable into tertiles did not result in significant differences among the 3 score ranges, the variable was dichotomized using the best cut-point from the ROC curve analysis, and the logistic regression model was tested again. We began with the tertile method of recoding because we wanted to examine the predictive ability for reasonably small ranges of scores for each variable. We also believed that presenting the data in this manner would allow easier interpretation of the results.
Using this process, uGPA was recoded as a dichotomous variable: 3.48 or lower versus 3.49 or higher. The vGRE was recoded as a discrete variable with 3 levels: 400 or lower, 410 to 480, and 490 or higher. The qGRE score was recoded as a dichotomous variable: 530 or lower versus 540 or higher. Race or ethnicity was recoded with 2 levels: students who were white or Hispanic were categorized into one group, and all other students were placed in the second group. Test version, program, and cohort were used as control variables. After examining this final model for goodness of fit, variables contributing to the model were used to develop a prediction rule for NPTE failure.
Clinicians use clinical prediction rules to assist in making diagnoses or predicting disease or treatment outcomes.24,25 Clinical prediction rule development involves systematic collection of data from a large sample. These data are analyzed to identify the best combination of variables to predict the outcome of interest. In our study, we used the term "prediction rule" because the rule we developed is intended for use by academicians rather than clinicians. We used logistic regression to identify variables that predict NPTE failure, then used the ß coefficients to weight the importance of those variables. This approach is commonly used to develop clinical prediction rules.26–28 Finally, we tested the prediction rule using the data from all students in the data set. Data were analyzed using the Statistical Package for the Social Sciences (SPSS), version 12.0.*
Within-program analysis.
Physical therapist education programs differ with respect to curricular design, applicant pools, grading scales, and academic standards. We therefore hypothesized that the prediction of NPTE failure using uGPA and GRE scores also may vary by program. As a result, we conducted individual hierarchical regression analyses for each program. After controlling for NPTE version, individual-level variables (age, sex, and ethnicity, when permitted by sample size) were entered, followed by the key independent variables of interest (ie, uGPA, vGRE, and qGRE). Between-program variables were not applicable to individual programs.
| Results |
|---|
|
|
|---|
Of the 3,585 students in the original sample, data regarding NPTE failure were available on 3,365 students (Tab. 2). Of these, 1,965 students took version 1 and 1,389 students took version 2. Only 7 students took both versions of the NPTE. Because these 7 students first took version 1 and encountered difficulty, they were counted as taking version 1 for the between-program analysis. Ninety-three percent of the students who took version 1 of the NPTE passed on the first attempt, compared with 79% of those who took version 2.
|
|
|
|
|
|
Using ROC curves, threshold cut-points were developed for each variable contributing to prediction of NPTE failure in each academic program. The results of the logistic regression and the corresponding prediction rules for 3 selected programs are presented in Supplemental Tables 1, 2, and 3. These 3 programs were chosen to illustrate the variation in prediction across the entire sample of participating programs.
| Discussion |
|---|
|
|
|---|
The prediction rule we developed may be used to estimate students risk of failing the NPTE. A logistic regression model that accounted for differences among programs, cohorts, and test versions was used to develop the prediction rule. Thirty-nine percent of students in the sample with a prediction rule score of 1.5 or higher failed the examination. The NPTE version was the strongest predictor of NPTE failure. When data from the 2 versions were examined separately, the contributions of uGPA, vGRE, and qGRE to the prediction of NPTE failure were almost identical. This finding suggests that the newer version of the examination is more difficult, but uGPA, vGRE, and qGRE are predictive of difficulty regardless of NPTE version. "Program" was an important predictor in the model; therefore, programs should consider developing their own prediction rules using data from their own students.
Influence of Covariates on Prediction of NPTE Failure
After controlling for program, geographic region and Carnegie classification could not be included in the hierarchical logistic regression model. Results of a simple logistic regression analysis using region as the sole independent variable suggest that NPTE failure may vary by region. The results of a study by Mohr and colleagues30 suggest that other program characteristics may be predictive of student NPTE performance. They found that accreditation status, number of faculty with PhD or EdD degrees, and total program length (years of preprofessional and professional course work) were related to program pass rates on the NPTE.30 Future research should consider these and other program-level variables.
Race or ethnicity was the only student demographic characteristic that contributed to the final logistic regression model. Previous studies in physical therapist education4–7 did not include this variable in their analyses. Koenig et al16 studied records of 11,279 medical students to investigate differences in prediction of United States Medical Licensing Examination (USMLE) performance based on race or ethnicity. They developed a model using Medical College Admission Test (MCAT) scores and GPA, which over-predicted UMSLE performance for most students. However, the model under-predicted success (ie, the model predicted that the students would not pass when they actually did) for 30 African-American students.16 These findings suggest that interaction exists among MCAT scores, GPA, and race or ethnicity in predicting USMLE performance in medical students. In our study, no evidence of interaction between race or ethnicity and uGPA, vGRE, or qGRE was found. However, the proportions of students in our sample from some ethnic groups (African American=2.5%, Hispanic=2.5%) were very small, which may have reduced the statistical power to detect interactions.
Further study regarding the relationships among race or ethnicity, quantitative admissions data, and NPTE performance is recommended. From the data collected in this study, it is not possible to infer why students in some ethnic categories had increased odds of difficulty passing the NPTE. The increased odds of NPTE failure for some students in these groups may be related to socioeconomic or educational factors that we were unable to measure.
Within-Program Analysis
A variety of models emerged for predicting NPTE failure within individual programs. Some programs had not filled their classes to capacity for some cohorts, and not all programs admitted students took the NPTE. Samples from those programs included data on fewer than 150 students. Thus, the variation in program-specific models may be due to small sample sizes. Swanson et al31 reported a similar phenomenon in their study on predicting medical student performance on the USMLE. The programs also likely differ with respect to student applicant pools, admissions policies, curriculum design, methods for student testing and evaluation, and academic rigor. Further research is needed to identify factors that may contribute to the variation in prediction within schools.
Score on the vGRE was predictive of NPTE failure in 11 of the 20 programs. Scores on the qGRE were predictive in 4 programs, and uGPA was predictive in 3 programs. For each program, odds ratios, ROC curve areas, and prediction rules varied widely. This finding suggests that discrimination between students who had difficulty passing the NPTE and students who did not varied widely within programs. Program directors should analyze data from their own students whenever possible to generate program-specific thresholds and rules for prediction of NPTE failure.
Implications for Academic Programs and Students
This study provides strong evidence that uGPA, vGRE, and qGRE are predictive of failing the NPTE at least once. Undergraduate GPA reflects the student's previous academic performance, and in our other study,8 uGPA was the most consistent predictor of student academic performance in professional physical therapist programs. In this study, the most consistent predictor of NPTE failure was vGRE score. According to the Educational Testing Service, the vGRE tests the "ability to analyze and evaluate written material and synthesize information obtained from it, to analyze relationships among component parts of sentences, and to recognize relationships between words and concepts."9(p4) The questions on the NPTE require students to analyze and evaluate written material regarding clinical cases, so the skills tested by the vGRE would theoretically be helpful for a student taking the NPTE. Our study appears to support this hypothesis.
Although the GRE was not designed specifically for physical therapist students, many programs use GRE scores as part of the admissions process. An admissions test with content more relevant to physical therapy may improve the ability of the test to predict which students might have difficulty passing the NPTE. Until a better alternative is identified, the results of this study and those of the study described in our companion article8 in this issue suggest that vGRE and qGRE scores should be used by academic programs to assist in making admissions and academic training decisions. Because there was so much unexplained variance in the final logistic regression model, we are not suggesting that the prediction rule developed in this study be the sole factor in deciding which students should or should not be offered admission to an academic program. We recommend that physical therapist education programs use the prediction rule to identify students who are at risk for failing the NPTE. At-risk students then can be encouraged to utilize additional resources while completing their academic training and while preparing to take the NPTE. This additional effort may increase at-risk students likelihood of success, although additional research in this area is clearly needed.
Other authors have suggested that performance during the first year of the professional physical therapist program7 and professional program GPA6 may be useful predictors of NPTE performance. We were primarily interested in the utility of admissions data, rather than data collected during the professional physical therapist education program, for predicting NPTE performance. Our results indicate that quantitative admissions data alone account for a small amount of variance in odds of failing the NPTE. Future research that includes data about student performance in the professional program is needed to provide a more complete picture of contributors to NPTE performance.
Limitations
The between-program logistic regression model explained 28% of the variance in odds of NPTE failure. A variety of factors may account for the unexplained variance. First, we were unable to control for differences in the rigor of students undergraduate education. Studies of medical students16,31 have included variables to control for rigor of undergraduate education. Second, we did not control for how programs recorded uGPA for students who attended multiple undergraduate institutions or who took the GRE multiple times. Third, we did not include any measures of students academic performance during the professional physical therapist program. We were primarily interested in identifying predictors of NPTE performance that were measured prior to a student's training in a professional program. Dockter7 found that GPA during the first year of professional education was the strongest predictor of raw NPTE scores (r2=.420) in her sample of 107 students. Future work with larger sample sizes should examine the extent to which academic performance during professional physical therapist education predicts NPTE performance.
Research in other fields suggests that socioeconomic, educational, and psychosocial factors are likely to contribute to performance on standardized tests such as the NPTE. For example, Fadem et al32 found that parental income was related to performance on a medical student licensing examination. Gallagher et al13 found that performance on computerized versions of the GRE varies based on ethnicity, sex, and age. Sedlacek17,33 developed a noncognitive questionnaire to assess student psychosocial traits such as self-appraisal, positive self-concept, and leadership. Work by Sedlacek and colleagues17,33,34 suggests that students from different ethnic groups perform differently on standardized tests and that questionnaires such as the Noncognitive Questionnaire (NCQ) may be helpful in predicting student performance. Because students at different institutions often differ in socioeconomic and cultural backgrounds, Webb et al34 suggested that programs should develop admissions policies that take these factors into account. Because our study showed that NPTE failure varies with race or ethnicity, we concur with this recommendation.
Other study limitations are related to potential sampling and historical biases. Our study included only larger programs that use the GRE in their admissions processes. Therefore, comparisons with smaller programs may not be valid. Because the 95 eligible programs represent only half of the accredited physical therapist education programs in the United States, future research should investigate whether preadmission grades or other measures of intellectual ability predict NPTE failure for students in programs with smaller class sizes or that use preadmission tests other than the GRE.
With respect to historical bias, the NPTE was changed during the study period. We first analyzed data from the 2 versions separately, and we used test version as a control variable in all combined analyses and in the development of the prediction rule. It is important to note that academic program was predictive of NPTE failure and that test version itself was the strongest predictor. We adjusted for these variables in our prediction rule so that academic programs could use the prediction rule shown in Table 7 in lieu of calculating a program-specific prediction rule.
It is possible that the results were influenced by rapid transition of professional physical therapist programs to the DPT degree. However, degree level (MPT versus DPT) did not contribute to the prediction of NPTE failure in our analysis. Changes in the pool of students applying to physical therapist education programs also may have introduced bias. The number of students applying to physical therapist education programs declined during the years of study, while the mean uGPA declined slightly.29 During the same years, the number of students graduating from accredited programs also declined.30 This was addressed statistically by controlling for cohort in the between-program analyses and, when possible, in the within-program analyses.
The results of our analyses explained a small proportion of the variance in odds of NPTE failure. Future research should investigate other program and student characteristics that may contribute to NPTE performance. A valid method for adjusting for academic rigor would be an important contribution to the program-level analysis. In addition, qualitative approaches may be useful in identifying variables for further research and clarifying reasons why students from some demographic groups are at higher risk for failing the NPTE than others.
| Conclusion |
|---|
|
|
|---|
| Footnotes |
|---|
This work was completed in partial fulfillment of requirements for Dr Utzman's PhD degree in Health Related Sciences, Medical College of Virginia Campus, Virginia Commonwealth University.
This research was presented at the annual meeting of the Federation of State Boards of Physical Therapy; September 10, 2006; Portland, Ore.
This work was funded, in part, by the Federation of State Boards of Physical Therapy.
* SPSS Inc, 233 S Wacker Dr, Chicago, IL 60606. ![]()
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. L. Riddle, R. R. Utzman, D. V. Jewell, S. Pearson, and X. Kong Academic Difficulty and Program-Level Variables Predict Performance on the National Physical Therapy Examination for Licensure: A Population-Based Cohort Study Physical Therapy, November 1, 2009; 89(11): 1182 - 1191. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sandstrom Commentary on "Use of Demographic and Quantitative Admissions Data ..." Physical Therapy, September 1, 2007; 87(9): 1194 - 1196. [Full Text] [PDF] |
||||
![]() |
R. R Utzman, D. L Riddle, and D. V Jewell Author Response Physical Therapy, September 1, 2007; 87(9): 1197 - 1198. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |