Abedi, J., & Ewers, N. (2013). Accommodations for english language learners and students with disabilities: A research-based decision algorithm. Retrieved from
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. American Educational Research Association.
Bhola, D. S., Impara, J. C., & Buckendahl, C. W. (2003). Aligning tests with states’ content standards: Methods and issues. Educational Measurement: Issues and Practice, 22(3), 21–29.
Campbell, D. T., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56(2), 81–105.
Cole, N. S., & Zieky, M. J. (2001). The new faces of fairness. Journal of Educational Measurement, 38(4), 369–382.
Crocker, L. M., Miller, M. D., & Franks, E. A. (1989). Quantitative methods for assessing the fit between test and curriculum. Applied Measurement in Education, 2(2), 179–194.
Cronbach, L. J. (1971). Test validation. In R. L. Thorndike (Ed.), Educational measurement, 2nd ed. American Council on Education.
Darling-Hammond, L., & Pecheone, R. (2010). Developing an internationally comparable balanced assessment system that supports high-quality learning. ETS Center for K-12 Assessment; Performance Management.
Doorey, N., & Polikoff, M. (2016). Evaluating the content and quality of next generation assessments. In Thomas B. Fordham Institute. Retrieved from
Fedorchak, G. (2012). Access by design—implications for equity and excellence in education. Draft paper prepared for the Smarter Balanced Assessment Consortium.
Haertel, E. H. (1999). Validity arguments for high-stakes testing: In search of the evidence. Educational Measurement: Issues and Practice, 18(4), 5–9.
HumRRO. (2016a). Evaluating the Content and Quality of Next Generation High School Assessments. Final Report. Retrieved from
HumRRO. (2016b). Smarter Balanced Assessment Consortium: Alignment Study Report. Retrieved from
Kane, M. T. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement, 4th ed. American Council on Education/Praeger.
Kane, M. T. (2013). Validating the interpretations and uses of test scores. Journal of Educational Measurement, 50(1), 1–73.
Linn, R. L. (2006). The standards for educational and psychological testing: Guidance in test development. In S. M. Downing & T. M. Haladyna (Eds.), Handbook of test development (pp. 27–38). Mahwah, NJ: Lawrence Erlbaum.
Martineau, G. (2016a). Options for measuring achievement growth using smarter balanced summative test scores. A presentation delivered at a Smarter Balanced webinar on growth measures. Retrieved from
Martineau, G. (2016b). A guide to understanding and selecting measures of growth for smarter balanced members. The National Center for the Improvement of Educational Assessment. Dover, NH. Retrieved from
Martone, A., & Sireci, S. G. (2009). Evaluating alignment between curriculum, assessment, and instruction. Review of Educational Research, 79(4), 1332–1361.
Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement, 3rd ed. American Council on Education.
Mislevy, R. J. (2009). Validity from the Perspective of Model-Based Reasoning. CRESST Report 752. In National Center for Research on Evaluation, Standards, and Student Testing (CRESST). ERIC.
Rothman, R., Slattery, J. B., Vranek, J. L., & Resnick, L. B. (2002). Benchmarking and alignment of standards and testing [CSE Technical Report]. National Center for Research on Evaluation, Standards,; Student Testing.
Russell, M. (2011). Digital test delivery: Empowering accessible test design to increase test validity for all students. Paper Prepared for Arabella Advisors.
Schmeiser, C. B., & Welch, C. J. (2006). Test development. In R. L. Brennan (Ed.), Educational measurement, 4th ed. American Council on Education/Praeger.
Shafer Willner, L., & Rivera, C. (2011). Are EL needs being defined appropriately for the next generation of computer-based tests ? AccELLerate!, 3(2), 12–14.
Sireci, S. G. (1998). Gathering and analyzing content validity data. Educational Assessment, 5(4), 299–321.
Sireci, S. G. (2012). Smarter Balanced Assessment Consortium: Comprehensive research agenda.
Sireci, S. G. (2013). Agreeing on validity arguments. Journal of Educational Measurement, 50(1), 99–104.
Smarter Balanced. (2014). Accessibility and accommodations framework. Retrieved from
Smarter Balanced. (2016). Item and task specifications bibliography. Retrieved from
Smarter Balanced. (2017a). English Language Arts/Literacy Content Specifications. Retrieved from
Smarter Balanced. (2017b). Mathematics content specifications. Retrieved from
Smarter Balanced. (2017c). Reporting achievement level descriptors. Retrieved from
Smarter Balanced. (2019a). Draft ELA/Literacy Achievement Level Descriptors (ALDs): Grades 9 and 10. Version 1.0. Retrievable from
Smarter Balanced. (2019b). Draft Mathematics Achievement Level Descriptors (ALDs): Grades 9 and 10. Version 1.2. Retrievable from
Smarter Balanced. (2022). 2020-21 summative technical report. Retrieved from
Smarter Balanced. (2023a). Online summative test administration manual. Retrieved from
Smarter Balanced. (2023b). Smarter balanced scoring specifications for summative and interim assessments. Retrieved from
Smarter Balanced. (2023c). Usability, Accessibility, and Accommodations Guide. Retrieved from
Thompson, S. J., Johnstone, C. J., & Thurlow, M. L. (2002). Universal design applied to large scale assessments. Synthesis report.
Thurlow, M. L., Quenemoen, R. F., & Lazarus, S. (2011). Meeting the needs of special education students: Recommendations for the race to the top consortia and states. Paper prepared for Arabella Advisors.
WestEd Standards, Assessment, and Accountability Services Program. (2017). Evaluation of the alignment between the common core state standards and the smarter balanced assessment consortium summative assessments for grades 3, 4, 6, and 7 in english language arts/literacy and mathematics. Retrieved from
Whitely, S. E. (1983). Construct validity: Construct representation versus nomothetic span. Psychological Bulletin, 93(1), 179.