How Massachusetts Scores Curriculum Evidence: Inside the CURATE Bid's 50/50 Split

Half the Score, None of the Roadmap: How Massachusetts Weighs Curriculum Evidence — LXD Research

Massachusetts has built one of the country’s most evidence-forward state procurement processes for curricular materials — and most publishers haven’t fully reckoned with what it’s asking for. The state’s CURATE bid splits its scored evaluation evenly between alignment to Massachusetts standards and whether a curriculum is “efficacious in implementation.” Each is worth half the score. The structural challenge — and the competitive opening — lives in that second half.

The Scorecard

How Is the Massachusetts CURATE Bid Actually Scored?

The Massachusetts scorecard (Bid #23MACISGW2, currently on Amendment 6) is refreshingly simple in its scoring architecture. After clearing required qualifications, bidders are evaluated on just two scored criteria, each on a 0–5 scale multiplied by 10 for a maximum of 50 points apiece.

Criterion 1

50 pts

Alignment to Massachusetts learning standards and WIDA 2020 English language development standards.

Criterion 2

50 pts

Efficacy in implementation and adherence to submission requirements.

Total Available

100 pts

A publisher scoring “Outstanding” on both lands at 100. Outstanding on one and Below Average on the other lands near 70.

The efficacy criterion is not a tiebreaker or a bonus. It is, structurally, half the available score. And it’s where Massachusetts diverges sharply from states like Michigan that have built more incremental, transparent scoring around evidence.

How Does Massachusetts’s Approach Differ From Michigan’s?

Michigan’s framework, which we covered earlier, is scored and incremental. Higher ESSA tiers earn more points, lower tiers earn fewer, and no evidence earns zero. The pathway is visible: publishers can look at the rubric and see exactly how Tier I evidence outperforms Tier III evidence in scoring.

Massachusetts uses a different model. The state has clearly articulated three pathways for the alignment half of the score — an existing CURATE report for “enhanced” versions, an EdReports review, or an independent third-party review evaluated against gateway evaluator criteria. Six amendments to the bid have refined this side of the process in detail, including a framework defining Minor, Moderate, and Substantive product changes.

For the efficacy half, the bid documents define no comparable pathway. The term “efficacious in implementation” is not defined. No tiered evidence framework is referenced.

Element	Michigan	Massachusetts
Scoring model	Incremental, tiered points by ESSA level	50/50 split between alignment and efficacy
Alignment pathway	Defined via Science of Reading + HQIM criteria	Three explicit pathways, refined across six amendments
Evidence pathway	ESSA Tier I–IV explicitly scored on a worksheet	Undefined; the term “efficacious in implementation” is not specified

The contrast matters. Michigan tells publishers exactly how evidence will be scored. Massachusetts tells publishers that evidence is half the score, and leaves the how largely to interpretation.

The Core Concept

What Does “Efficacious in Implementation” Actually Mean?

The phrasing itself does most of the work. “Efficacious in implementation” points away from theoretical or design-based arguments and toward evidence from real classroom use — students, outcomes, comparison conditions. That maps cleanly onto ESSA’s evidence framework, where Tiers I–III (Strong, Moderate, and Promising) all require studies measuring student outcomes against a counterfactual.

EdReports and CURATE reviews, which dominate the alignment half of the score, are not efficacy studies. They are expert-panel reviews of instructional materials against design and standards criteria. They evaluate whether a curriculum looks rigorous on paper. They cannot, by design, tell you whether students who used it outperformed students who didn’t.

Massachusetts seems to recognize this distinction in its own language. The bid documents repeatedly refer to the gateway hurdle as the “quality threshold” — not the evidence threshold. Quality and efficacy are scored as two different things, and gateway reviews satisfy only the first.

Half 1 — 50 pts

Quality Reviews

Evaluate curriculum design, standards alignment, and content rigor. EdReports, CURATE, and gateway third-party reviews. Expert panel methodology. Satisfy the first 50 points.

Half 2 — 50 pts

Efficacy Studies

Measure student outcomes against a comparison condition. ESSA Tiers I–III. Require real classroom data, matched groups, statistical analysis. Address the second 50 points.

The Opening

What Are Publishers Asking About in the Bid Q&A — And What Aren’t They Asking?

The publisher Q&A document attached to the bid is revealing. Publishers have asked DESE dozens of detailed questions: which third-party evaluators count as comparable to EdReports, what qualifies as a “minor” versus “substantive” enhancement, how to handle K–5 foundational skills attestations, whether state-level reviews can substitute for gateway reports. The alignment side has been pressure-tested from every angle.

Across all of those exchanges, not a single question asks what “efficacious in implementation” means or how to demonstrate it.

That silence is the opening. If most publishers in the bidding pool are organizing their submissions around the questions they’ve already asked — alignment pathways, gateway logistics, submission formatting — then publishers who arrive with actual ESSA-aligned efficacy studies are competing for the second 50 points largely uncontested. A submission with rigorous research evidence isn’t just stronger on its own merits. It’s stronger relative to a field that hasn’t yet engaged with the criterion at all.

What Does a Strong Efficacy Submission Actually Look Like?

With no defined pathway in the bid documents, publishers have latitude — and responsibility — to demonstrate efficacy on their own terms. A submission that takes the criterion seriously borrows from the standards already used by ESSA, by Evidence for ESSA, and by states like Michigan that have made their requirements explicit. In practice, that means a package built around five core elements.

Anatomy of a Strong Efficacy Submission

An ESSA tier classification with clear justification. Tier II (Moderate) is the most achievable bar for most curriculum publishers — it requires a quasi-experimental study comparing students who used the product to a matched group who didn’t, without a randomized trial. Name the tier explicitly, and show your work.

A study design narrative that names the comparison group. Reviewers want to know whether treatment and comparison groups were similar at baseline, how matching was done, and what the analysis controlled for. A study without a credible comparison condition can’t make a causal claim — and shouldn’t be presented as if it can.

Results documented with effect sizes, statistical significance, and sample size. A statistically significant positive result on a meaningful outcome measure is the core claim. The number of students, schools, and districts in the study tells reviewers how much weight that result carries.

A population match to Massachusetts. A study run in districts demographically and instructionally similar to Massachusetts students will carry more weight than one run in an unrepresentative sample. This is the factor most often overlooked in submissions and most often probed by careful reviewers.

An open, accessible link to the full study report. Studies behind paywalls or proprietary firewalls put reviewers in the position of taking claims on trust. Submissions that link directly to published reports — ideally with independent or peer-reviewed credibility — are easier to evaluate and harder to dismiss.

A submission organized this way isn’t an academic paper, and it isn’t a marketing one-pager. It’s a structured argument that the curriculum produced measurable outcomes for students like those Massachusetts serves.

What Does This Mean for Publishers Bidding in Massachusetts?

The Massachusetts cycle runs on rolling submissions through August 2027, with priority deadlines tied to each school year’s content priorities. Publishers preparing submissions for the October 1, 2026 priority deadline — for Spring 2027 review — are facing this scorecard right now, across ELA/Literacy K–12, Mathematics K–8, Science and Technology/Engineering K–8, and 9–12 History and Social Science.

For a publisher with strong EdReports green ratings but no efficacy study, the practical ceiling is roughly half the available scored points. Strong alignment evidence can deliver the first 50. The second 50 require something structurally different: actual classroom evidence of student outcomes.

The publishers who recognize this — and who arrive with rigorous research alongside their alignment documentation — are positioned to pull ahead of a field that hasn’t yet caught up to what the scorecard is actually asking for.

Where Does Your Evidence Stand Against the Massachusetts Criteria?

The time to contact us is today. Fill out our contact form to schedule a meeting with Dr. Rachel and assess where your research stands against the Massachusetts criteria — before you submit.

Meet With Dr. Rachel View Our Services

How Massachusetts Scores Curriculum Evidence: Inside the CURATE Bid’s 50/50 Split

How Is the Massachusetts CURATE Bid Actually Scored?

How Does Massachusetts’s Approach Differ From Michigan’s?

What Does “Efficacious in Implementation” Actually Mean?

What Are Publishers Asking About in the Bid Q&A — And What Aren’t They Asking?

What Does a Strong Efficacy Submission Actually Look Like?

What Does This Mean for Publishers Bidding in Massachusetts?

Where Does Your Evidence Stand Against the Massachusetts Criteria?