References

Abedi, J. (2006). Language issues in item development. In S. M. Downing & T. M. Haladyna (Eds.), Handbook of test development (pp. 377–398). Lawrence Erlbaum Associates Publishers.
Abedi, J., & Ewers, N. (2013). Accommodations for english language learners and students with disabilities: A research-based decision algorithm. Retrieved from https://portal.smarterbalanced.org/library/accommodations-for-english-language-learners-and-students-with-disabilities-a-research-based-decision-algorithm/.
Abedi, J., & Lord, C. (2001). The language factor in mathematics tests. Applied Measurement in Education, 14(3), 219–234.
Abedi, J., Lord, C., & Plummer, J. (1995). Language background as a variable in NAEP mathematics performance [CSE Technical Report 429]. University of California, National Center for Research on Evaluation, Standards,; Student Testing.
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. American Educational Research Association.
American Institutes for Research. (2013). Cognitive laboratories technical report.
American Institutes for Research. (2015). Hawaii smarter balanced assessments 2014–15 technical report. Addendum to the smarter balanced technical report. Submitted to Hawaii Department of Education.
August, D., Carlo, M., Dressler, C., & Snow, C. (2005). The critical role of vocabulary development for english language learners. Learning Disabilities Research & Practice, 20(1), 50–57.
Bailey, A. L., Huang, B. H., Shin, H. W., Farnsworth, T., & Butler, F. A. (2007). Developing academic english language proficiency prototypes for 5th grade reading: Psychometric and linguistic profiles of tasks [CSE Technical Report 727]. University of California, National Center for Research on Evaluation, Standards,; Student Testing.
Bernhardt, E. (2005). Progress and procrastination in second language reading. Annual Review of Applied Linguistics, 25, 133–150.
Bhola, D. S., Impara, J. C., & Buckendahl, C. W. (2003). Aligning tests with states’ content standards: Methods and issues. Educational Measurement: Issues and Practice, 22(3), 21–29.
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp. 395–479). Addison-Wesley.
Borgioli, G. M. (2008). Equity for english language learners in mathematics classrooms. Teaching Children Mathematics, 15, 185–191.
Campbell, D. T., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56(2), 81–105.
Cizek, G. J., & Bunch, M. B. (2007). Standard setting: A guide to establishing and evaluating performance standards on tests. Sage.
Cohen, J., & Albright, L. (2014). Smarter balanced adaptive item selection algorithm design report.
Conley, D. T., Drummond, K. V., De Gonzalez, A., Rooseboom, J., & Stout, O. (2011). Reaching the goal: The applicability and importance of the common core state standards to college and career readiness.
Connecticut State Department of Education. (2016). Developing Connecticut’s Growth Model for the Smarter Balanced Summative Assessments in English Language Arts (ELA) and Mathematics.
Connecticut State Department of Education. (2017). The Relationship Between the Smarter Balanced Grade 8 Assessments and the PSAT 8/9 Assessments.
Crocker, L. M., Miller, M. D., & Franks, E. A. (1989). Quantitative methods for assessing the fit between test and curriculum. Applied Measurement in Education, 2(2), 179–194.
Cronbach, L. J. (1971). Test validation. In R. L. Thorndike (Ed.), Educational measurement, 2nd ed. American Council on Education.
CTB/McGraw-Hill. (2013). Smarter Balanced Assessment Consortium: Technical Report Initial Achievement Level Descriptors. Retrieved from https://portal.smarterbalanced.org/library/en/technical-report-initial-achievement-level-descriptors.pdf.
Cummins, D. D., Kintsch, W., Reusser, K., & Weimer, R. (1988). The role of understanding in solving word problems. Cognitive Psychology, 20(4), 405–438.
Dana, T. M., & Tippins, D. J. (1993). Considering alternative assessments for middle level learners. Middle School Journal, 25(2), 3–5.
Darling-Hammond, L., & Pecheone, R. (2010). Developing an internationally comparable balanced assessment system that supports high-quality learning. ETS Center for K-12 Assessment; Performance Management.
Doorey, N., & Polikoff, M. (2016). Evaluating the content and quality of next generation assessments. In Thomas B. Fordham Institute. Retrieved from https://eric.ed.gov/?id=ED565742.
Dorans, N. J., & Kulick, E. (1983). Assessing unexpected differential item performance of female candidates on SAT and TSWE forms administered in December 1977: An application of the standardization approach (ETS Research Report RR-83-09).
Dorans, N. J., & Kulick, E. (1986). Demonstrating the utility of the standardization approach to assessing unexpected differential item performance on the Scholastic Aptitude Test. Journal of Educational Measurement, 23(4), 355–368.
Educational Testing Service. (2014). Pilot Test Data Analysis Results: Dimensionality Study and IRT Model Comparison [ETS Research Report].
Educational Testing Service. (2015a). Linking study between smarter balanced mathematics field test and CSU entry level math test. A memorandum prepared for california state university [ETS Research Report].
Educational Testing Service. (2015b). Study of the relationship between the early assessment program and the smarter balanced field tests [ETS Research Report].
Fedorchak, G. (2012). Access by design—implications for equity and excellence in education. Draft paper prepared for the Smarter Balanced Assessment Consortium.
Forster, K. I., & Olbrei, I. (1973). Semantic heuristics and syntactic analysis. Cognition, 2(3), 319–347.
Gaffney, T. (2015). Dimensionality of the SBAC: An argument for its validity. Presentation for the CAASPP-CAHSEE Technical Advisory Group; California Department of Education.
Haertel, E. H. (1999). Validity arguments for high-stakes testing: In search of the evidence. Educational Measurement: Issues and Practice, 18(4), 5–9.
Hansen, E. G., & Mislevy, R. J. (2008). Design patterns for improving accessibility for test takers with disabilities (pp. i–32) [ETS Research Report]. https://doi.org/10.1002/j.2333-8504.2008.tb02135.x
Holland, P. W., & Thayer, D. T. (1988). Differential item performance and the mantel-haenszel procedure. In H. Wainer & H. I. Braun (Eds.), Test validity. Lawrence Erlbaum Associates, Inc.
Houts, C., & Cai, L. (2016). flexMIRT: Flexible multilevel item factor analysis and test scoring user’s manual. Seattle, WA: Vector Psychometric Group.
HumRRO. (2016). Smarter Balanced Assessment Consortium: Alignment Study Report. Retrieved from https://portal.smarterbalanced.org/library/smarter-balanced-assessment-consortium-alignment-study-report/.
Kane, M. T. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement, 4th ed. American Council on Education/Praeger.
Kane, M. T. (2013). Validating the interpretations and uses of test scores. Journal of Educational Measurement, 50(1), 1–73.
Kopriva, R. (2010). Building on student strengths or how to test ELs against challenging math (and science) standards when they don’t have the english yet. Paper presented at the Common Core State Standards Implementation Conference. Arlington, VA.
Kurlaender, M., Kramer, K. A., & Jackson, E. (2018, March). Predicting college success: How do different high school assessments measure up? Stanford Graduate School of Education; Policy Analysis for California Education.
Lewis, D. M., Mitzel, H. C., Mercado, R. L., & Schulz, E. M. (2012). The bookmark standard setting procedure. In G. J. Cizek (Ed.), Setting performance standards: Foundations, methods, and innovations (pp. 245–273). Routledge.
Mantel, N. (1963). Chi-square tests with one degree of freedom; extensions of the mantel-haenszel procedure. Journal of the American Statistical Association, 58(303), 690–700.
Mantel, N., & Haenszel, W. (1959). Statistical aspects of the analysis of data from retrospective studies of disease. Journal of the National Cancer Institute, 22(4), 719–748.
Martone, A., & Sireci, S. G. (2009). Evaluating alignment between curriculum, assessment, and instruction. Review of Educational Research, 79(4), 1332–1361.
Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement, 3rd ed. American Council on Education.
MetaMetrics. (2016a). Linking the Smarter Balanced English Language Arts/Literacy Summative Assessment with the Lexile Framework for Reading.
MetaMetrics. (2016b). Linking the Smarter Balanced Mathematics Summative Assessment with the Quantile Framework for Mathematics.
Michaelides, M. P. (2008). An illustration of a mantel-haenszel procedure to flag misbehaving common items in test equating. Practical Assessment, Research, and Evaluation, 13(7).
Mislevy, R. J., & Haertel, G. D. (2006). Implications of evidence-centered design for educational testing. Educational Measurement: Issues and Practice, 25(4), 6–20.
Mislevy, R. J., Steinberg, L. S., & Almond, R. (2003). On the structure of educational assessments. Measurement: Interdisciplinary Research and Perspectives, 1(1), 3–67.
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1), i–30.
Muraki, E., & Bock, R. D. (1997). PARSCALE 4.1: IRT-Based Item Analysis and Test Scoring for Rating-Scale Data. Scientific Software International, Inc.
National Center for Research on Evaluation, Standards, and Student Testing. (2016). External validity: Analysis of existing external measures.
National Center for Research on Evaluation, Standards, and Student Testing. (2018). Longitudinal Analysis of SBAC Achievement Data (2015 and 2016). Retrievable from Smarter Balanced website upon approval for posting.
National Governors Association Center for Best Practices & Council of Chief State School Officers. (2016). Development process. Washington, DC. Retrieved from http://www.corestandards.org/about-the-standards/development-process/.
Pitoniak, M. J., Young, J. W., Martiniello, M., King, T. C., Buteux, A., & Ginsburgh, M. (2009). Guidelines for the assessment of english language learners.
Rose, D., & Meyer, A. (2000). Universal design for learning. Journal of Special Education Technology, 15, 67–70.
Rothman, R., Slattery, J. B., Vranek, J. L., & Resnick, L. B. (2002). Benchmarking and alignment of standards and testing [CSE Technical Report]. National Center for Research on Evaluation, Standards,; Student Testing.
Russell, M. (2011). Digital test delivery: Empowering accessible test design to increase test validity for all students. Paper Prepared for Arabella Advisors.
Schachter, P. (1983). On syntactic categories. Indiana University Linguistics Club.
Schmeiser, C. B., & Welch, C. J. (2006). Test development. In R. L. Brennan (Ed.), Educational measurement, 4th ed. American Council on Education/Praeger.
Schultz, S. R., Michaels, H. R., Dvorak, R. N., & Wiley, C. R. H. (2016). Evaluating the content and quality of next generation high school assessments. Final Report.
Shafer Willner, L., & Rivera, C. (2011). Are EL needs being defined appropriately for the next generation of computer-based tests ? AccELLerate!, 3(2), 12–14.
Sireci, S. G. (1998). Gathering and analyzing content validity data. Educational Assessment, 5(4), 299–321.
Sireci, S. G. (2012). Smarter Balanced Assessment Consortium: Comprehensive research agenda.
Sireci, S. G. (2013). Agreeing on validity arguments. Journal of Educational Measurement, 50(1), 99–104.
Smarter Balanced. (2010). Race to the top assessment program application for new grants: Comprehensive assessment systems.
Smarter Balanced. (2012a). General accessibility guidelines. Retrieved from https://portal.smarterbalanced.org/library/general-accessibility-guidelines/.
Smarter Balanced. (2012b). Guidelines for accessibility for english language learners. Retrieved from https://portal.smarterbalanced.org/library/guidelines-for-accessibility-for-english-language-learners/.
Smarter Balanced. (2013a). ELA/Literacy ALDs and College Content-Readiness Policy. Retrieved from https://portal.smarterbalanced.org/library/elaliteracy-alds-and-college-content-readiness-policy/.
Smarter Balanced. (2013b). Mathematics ALDs and College Content-Readiness Policy. Retrieved from https://portal.smarterbalanced.org/library/mathematics-alds-and-college-content-readiness-policy/.
Smarter Balanced. (2013c). Technical report: Initial achievement level descriptors. Retrieved from https://portal.smarterbalanced.org/library/technical-report-initial-achievement-level-descriptors/.
Smarter Balanced. (2015a). End of grant report. Retrieved from https://portal.smarterbalanced.org/library/end-of-grant-report/.
Smarter Balanced. (2015b). Item and task specifications. Retrieved from http://www.smarterbalanced.org/assessments/development/.
Smarter Balanced. (2015c). Style guide. Retrieved from https://portal.smarterbalanced.org/library/style-guide-for-smarter-balanced-assessments/.
Smarter Balanced. (2016a). 2013-2014 technical report. Retrieved from https://portal.smarterbalanced.org/library/2013-14-technical-report/.
Smarter Balanced. (2016b). Accessibility and accommodations framework. Retrieved from https://portal.smarterbalanced.org/library/accessibility-and-accommodations-framework/.
Smarter Balanced. (2016c). Administration and Registration Tools (ART) User’s Guide. Version 1.3. Retrieved from https://portal.smarterbalanced.org/library/administration-and-registration-tools-art-user-guide/.
Smarter Balanced. (2016d). Test administrator user guide. Version 1.0. Retrieved from https://portal.smarterbalanced.org/library/en/v1.0/test-administrator-user-guide.docx.
Smarter Balanced. (2017a). Achievement level setting final report. Retrieved from http://www.smarterbalanced.org/assessments/development/additional-technical-documentation/.
Smarter Balanced. (2017b). English Language Arts/Literacy Content Specifications. Retrieved from https://portal.smarterbalanced.org/library/english-language-artsliteracy-content-specifications/.
Smarter Balanced. (2017c). Interpretation and use of scores and achievement levels. Retrieved from https://portal.smarterbalanced.org/library/interpretation-and-use-of-scores-and-achievement-levels/[Available to members only].
Smarter Balanced. (2017d). Mathematics content specifications. Retrieved from https://portal.smarterbalanced.org/library/mathematics-content-specifications/.
Smarter Balanced. (2018a). 2018-19 Paper-Pencil Test Administration Manual: ELA Form 4, Non-secure (Primary). Version 1.0. Retrieved from https://portal.smarterbalanced.org/library/p-p-tam-ela-form-3-non-secure/.
Smarter Balanced. (2018b). 2018-19 Paper-Pencil Test Administration Manual: Math Form 4 (Primary). Version 1.0. Retrieved from https://portal.smarterbalanced.org/library/en/p-p-tam-math-form-3.docx.
Smarter Balanced. (2021a). Interpretative Guide for English Language Arts/Literacy and Mathematics Assessments. Retrieved from https://portal.smarterbalanced.org/library/en/reporting-system-interpretive-guide.pdf[Available to members only].
Smarter Balanced. (2021b). Online summative test administration manual. Retrieved from https://portal.smarterbalanced.org/library/en/v3.0/online-summative-test-administration-manual.docx.
Smarter Balanced. (2021c). Smarter balanced reporting system user guide. Retrieved from https://portal.smarterbalanced.org/library/reporting-system-user-guide/[Available to members only].
Smarter Balanced. (2022a). Bias and sensitivity guidelines. Retrieved from https://portal.smarterbalanced.org/library/bias-and-sensitivity-guidelines/.
Smarter Balanced. (2022b). Member Procedures Manual. Retrieved from https://portal.smarterbalanced.org/library/member-procedures-manual/.
Smarter Balanced. (2022c). Smarter balanced scoring specifications for summative and interim assessments. Retrieved from https://technicalreports.smarterbalanced.org/scoring_specs/_book/scoringspecs.html.
Smarter Balanced. (2022d). Usability, accessibility, and accommodations guidelines. Version 5.2. Retrieved from change log at https://portal.smarterbalanced.org/library/usability-accessibility-and-accommodations-guidelines/.
Smarter Balanced. (2022e). Usability, Accessibility, and Accommodations Implementation Guide (UAAG). Retrieved from https://portal.smarterbalanced.org/library/usability-accessibility-and-accommodations-implementation-guide/.
South Dakota Board of Regents. (2017). Revisions to BOR Policy 2:3: System Undergraduate Admissions (Second Reading). Retrieved from https://www.sdbor.edu/the-board/agendaitems/2014AgendaItems/2017%20Agenda%20Items/August1017/6_E_BOR0817.pdf.
Thompson, S. J., Johnstone, C. J., & Thurlow, M. L. (2002). Universal design applied to large scale assessments. Synthesis report.
Thurlow, M. L., Quenemoen, R. F., & Lazarus, S. (2011). Meeting the needs of special education students: Recommendations for the race to the top consortia and states. Paper prepared for Arabella Advisors.
Washington Office of Superintendent of Public Instruction. (2016). The Relationship between Smarter Balanced Test Scores and Grades in ELA and Mathematics Courses.
WestEd Standards, Assessment, and Accountability Services Program. (2017). Evaluation of the alignment between the common core state standards and the smarter balanced assessment consortium summative assessments for grades 3, 4, 6, and 7 in english language arts/literacy and mathematics. Retrieved from https://portal.smarterbalanced.org/library/wested-alignment-evaluation/.
Young, J. W. (2008). Ensuring valid content tests for english language learners. R&D Connections, No. 8.
Zhang, T., Haertel, G., Javitz, H., Mislevy, R. J., & Wasson, J. (2009). A design pattern for a spelling assessment for students with disabilities. Paper presented at the annual conference of the American Psychological Association, Montreal, Canada.