Book Chapters
- Jiao, H., Xu, S., & Liao, M. (2024). Exploration of the Stacking Ensemble Learning Algorithm for Automated Scoring of Constructed-Response Items in Reading Assessment. In Shermis, W., Wilson, J. (Eds.) The Routledge International Handbook of Automated Essay Evaluation. Routledge.
- Qiao, X., Liao, M., & Jiao, H. (2021). Nonlinear Latent Effects in Diagnostic Classification Modeling Incorporating Response Times. In Wiberg, M., Molenaar, D., González, J., Böckenholt, U., Kim, JS. (Eds) Quantitative Psychology. Springer Proceedings in Mathematics & Statistics, vol 353. Springer, Cham.
- He, Q., Liao, D., Ling, H. K., & Jiao, H. (2021). Evaluating consistency of behavioral patterns across multiple tasks using process data: A case study in PIAAC. In L. Khorramdel, M. von Davier, K. Yamamoto. (Eds.) Innovative Computer-based International Large-Scale Assessments – Foundations, Methodologies and Quality Assurance Procedures.
- Jiao, H., Liao*, D., & Zhan*, P. (2019). Utilizing process data for cognitive diagnosis. In M. von Davier & Y. Lee (Eds.), Handbook of Diagnostic Classification Models.
- He, Q., Liao*, D., & Jiao, H. (2019). Clustering behavioral patterns using process data in PIAAC problem-solving items. In B. P. Veldkamp & C. Sluijter (Eds.). Theoretical and Practical Advances in Computer-Based Educational Measurement. Springer. Methodology of Educational Measurement and Assessment (book series), Springer.
- Jiao, H., & Li, C. (2018). Progress in International Reading Literacy Study (PIRLS) data. In The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation. Thousand Oaks, CA: Sage.
- Jiao, H., & Liao, D. (2018). Testlet response theory. In The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation. Thousand Oaks, CA: Sage.
- Qiao, X., & Jiao, H. (2018). Review of the book Bayesian Psychometric Modeling, by Levy. R & Mislevy. R. J. Measurement: Interdisciplinary Research and Perspectives, 16, 135-13
- Li, Y., Xiao, T., Liao, D., & Lee, M.-L. (2017). Using threshold regression to analyze survival data from complex surveys: with application to mortality linked NHANES III Phase II genetic data. Statistics in Medicine.
- Jiao, H., Lissitz, R. W., & Zhan*, P. (2017). Calibrating innovative items embedded in multiple contexts. In H. Jiao & R.W. Lissitz (Eds.), Technology-enhanced innovative assessment: Development, modeling, scoring from an interdisciplinary perspective. Charlotte, NC: Information Age Publishing.
- Jiao, H., Kamata, A., & Xie, C. (2015). A multilevel cross-classified testlet model for complex item and person clustering in item response modeling. In J. Harring, L. Stapleton, & S. Beretvas (Eds.), Advances in multilevel modeling for educational research: Addressing practical issues found in real-world applications (pp.139-161). Charlotte, NC: Information Age Publishing.
- Luo, Y., Jiao, H., & Lissitz, R. W. (2015). An empirical study of the impact of the choice of persistence model in value-added modeling upon teacher effect estimates. In L. A. van der Ark, D. Bolt, W.-C. Wang, J. A. Douglas & S.-M. Chow (Eds.), Quantitative psychology research (pp.133-143). Springer, Switzerland.
- Jiao, H., & Chen*, Y.-F. (2014). Differential item and testlet functioning. In A. Kunnan (Ed.),The Companion to Language Assessments (pp.1282-1300). John Wiley & Sons, Inc.
- Chen*, Y.-F., & Jiao, H. (2014). Does model misspecification lead to spurious latent classes? An evaluation of model comparison indices. In R. E. Millsap et al. (Eds.), New development in quantitative psychology, Springer Proceedings in Mathematics & Statistics, 66, DOI 10.1007/978-1-4614-9348-8_22, Springer Science +Business Media, New York.
- Jiao, H., & Lissitz, R. W. (2014). Direct modeling of student growth with multilevel and mixture extensions. In R. W. Lissitz & H. Jiao (Eds.), Value added modeling and growth modeling with particular application to teacher and school effectiveness. Charlotte: Information Age Publishing Inc.
- Chen, Y.-F., & Jiao, H. (2014). Does model misspecification lead to spurious latent classes? An evaluation of model comparison indices. In R. E. Millsap et al. (Eds.), New development in quantitative psychology, Springer Proceedings in Mathematics & Statistics, 66., Springer Science +Business Media, New York
- Jiao, H., & Lissitz, R. W. (2012). Computer-based testing in K-12 state assessments: An Introduction. In R. W. Lissitz & H. Jiao (Ed.), Computers and their impact on state assessment: Recent history and predictions for the future (pp. 1-21). Charlotte, NC: Information Age Publisher.
- Lissitz, R. W., & Caliço, T. (2012). Validity is an action verb: Commentary on: Clarifying the consensus definition of validity. Journal of Measurement: Interdisciplinary Research and Perspectives, 10, 75-79.
- Schafer, W. D., Lissitz, R. W., Zhu, X., Zhang, Y., Hou, X., & Li, Y. (2012). Evaluating teachers and schools using student growth models. Practical Assessment, Research & Evaluation, 17(17), 2.
- Templin, J. & Jiao, H. (2011). Applying model-based approaches to identify performance categories. In G. Cizek (Ed.), Setting performance standards: foundations, methods, and innovations (pp. 379-397). New York, NY: Routlege.
- Jiao, H., Lissitz, R. W., Macready, G., Wang, S. & Liang, S. (2011) Comparing the use of mixture Rasch modeling and judgmental procedures for standard setting. Psychological Test and Assessment Modeling, 53, 499-522.
- Lissitz, R. W., & Li, F. F. (2011). Standard setting in complex performance assessments: An approach aligned with cognitive diagnostic models. Psychological Test and Assessment Modeling, 53, 461-485.
- Fan, W., & Lissitz, R. W. (2010). A multilevel analysis of students and schools on high school graduation exam: A case of Maryland. International Journal of Applied Educational Studies, 9, 1-18.
- Jiao, H., & Wang, S. (2010). A multifaceted approach to investigating the equivalence between computer-based and paper-and-pencil assessments: An example of Reading Diagnostics. International Journal of Learning Technology, 5, 264-288.
- Lissitz, R. W., & Wei, Hua (2008). Consistency of Standard Setting in an Augmented State Testing System. Educational Measurement: Issues and Practice, 27, 46-56.
- Jiao, H., Wang, S., Kamata, A. (2007). Modeling local item dependence with the hierarchical generalized linear model. In E. V. Smith & R. M. Smith (Eds.), Rasch Measurement: Advanced and Specialized Applications. JAM press.
- Schafer, W. D., Liu, M., & Wang, H. (2007). Content and Grade Trends in State Assessments and NAEP. Practical Assessment Research & Evaluation , 12.
- Lissitz, R., Doran, H., Schafer, W., & Wilhoft, J.(2006). Growth modeling, value added modeling, and linking: An introduction. In Lissitz, R. W. (Ed.), Longitudinal and value-added models of student performance (pp. 1-46). Maple Grove, MN: JAM Press.
- Schafer, W. (2006). Growth Scales as Alternative to Vertical Scale. Practical Assessment Research & Evaluation , 11.
- Schafer, W., & Twing, J. (2006). Growth scales and pathways. In Lissitz, R. W. (Ed.), Longitudinal and value-added models of student performance (pp. 321-345). Maple Grove, MN: JAM Press.
- Walston, J., Lissitz, R. W., & Rudner, L. (2006). The Influence of Web-based Questionnaire Presentation Variations on Survey Cooperation and Perceptions of Survey Quality. The Journal of Official Statistics, 22, 271-291.
- Li, Y., & Schafer, W. (2005). Increasing the homogeneity of CAT's item-exposure rates by minimizing or maximizing varied target functions while assembling shadow tests. Journal of Educational Measurement , 42, 245-269.
- Schafer, W. (2005). Technical documentation for alternate assessments (2005). Practical Assessment Research & Evaluation , 10.
- Shafer, W., Gagne, P. & Lissitz, R. (2005). Resistance to Confounding Style and Content in Scoring Constructed Response Items. Educational Measurement: Issues and Practice, 24, 22-28.
- Jiao, H. (2004). Evaluating the Dimensionality of the Michigan English Language Assessment Battery. Spaan Fellow Working Papers in Second or Foreign Language Assessment: Volume 2 (pp. 27-52). University of Michigan, Ann Arbor, MI.