Most edtech companies assume the goal is always to climb as high as possible on the ESSA evidence ladder — that Tier I is the destination and everything else is a stepping stone. That framing misses something important. For the majority of district procurement decisions, ESSA Tier III is not the consolation prize. It’s what buyers are actually looking for. And a well-constructed portfolio of Tier III studies can be more persuasive than a single higher-tier study conducted in one context with one population.
What Does ESSA Tier III — Promising Evidence — Actually Require?
ESSA’s Tier III designation — officially called “Promising Evidence” — requires a study that measures student outcomes before and after using the product, uses statistical controls to account for pre-existing differences and sources of bias in the student population, and demonstrates a statistically significant positive effect. What it does not require is a large effect size or a separate comparison group of students who didn’t use the product.
That last point is worth sitting with. The effect doesn’t need to be dramatic — it needs to be real, positive, and statistically distinguishable from chance. A modest but consistent and well-controlled result clears the bar. What matters is that the analysis accounts for bias, the direction of the effect is positive, and the finding is statistically significant. Reviewers are not grading on magnitude.
That distinction matters enormously for logistics. A Tier III study works with the students and classrooms you already have. There’s no need to identify a parallel population, negotiate a matched comparison group across schools, or ask any district to withhold your product from anyone. If your product has been implemented in a school and that school collects pre- and post-assessment data — which virtually every school does — the raw material for a Tier III study may already exist.
The three things a Tier III study must demonstrate: The effect is statistically significant. The direction is positive. The analysis controls for sources of bias — pre-existing differences in achievement, demographics, or other factors that could explain the result. Effect size is not a criterion. A modest, well-controlled, positive result is exactly what the standard asks for.
Why Is Tier III the Evidence Standard Most EdTech Buyers Actually Want?
Walk through a realistic procurement scenario. A curriculum director is evaluating three literacy platforms for a supplemental intervention purchase. She has a limited budget, a board that wants evidence the money is well spent, and a timeline measured in weeks, not months. She is not convening a research committee. She is not cross-referencing study designs against the What Works Clearinghouse methodology standards. She is asking one question: has this product been studied, and did students improve?
Tier III answers that question cleanly and credibly. It says: yes, this product has been studied with real students in real schools, the data was analyzed rigorously, and the results were statistically significant. For the vast majority of purchasing decisions at the district level — supplemental products, professional learning tools, assessment platforms, intervention programs — that is a sufficient evidence bar.
The districts that specifically require Tier II or Tier I evidence tend to be operating under formal improvement plans, large federal grants with strict procurement rules, or state-level mandates that specify evidence tiers by name. Those contexts exist and matter. But they represent a fraction of the overall market. Most buyers want to know it works. Tier III tells them it works.
Why Does a Portfolio of Tier III Studies Outperform a Single Study?
A single Tier III study conducted in one district with one grade level and one student population tells you something. It tells you the product showed promising results in that context. What it can’t tell a skeptical buyer is whether those results travel — whether they hold for different demographics, different implementation models, different regions of the country.
A portfolio of Tier III studies conducted across multiple contexts answers that question before it gets asked. Three studies showing significant gains in urban Title I schools, suburban mid-sized districts, and rural implementation contexts, across different grade bands and different years, is a fundamentally different evidence story than one study showing the same thing once. The pattern of consistent results across varied conditions is itself a form of evidence — one that speaks directly to the generalizability question every informed buyer carries into a procurement decision.
This is also where the compounding value of building an evidence portfolio over time becomes clear. Each study you add doesn’t just add incrementally — it multiplies the credibility of everything that came before it. And because Tier III studies are logistically achievable on a realistic timeline and budget, the portfolio approach is within reach for companies that are serious about evidence without being research institutions.
How Can Low-Usage Students Serve as a Comparison Group Within a Tier III Study?
This is one of the most practical and underused methodological moves in edtech research, and it’s worth understanding in some depth. Most products, when deployed at scale, have natural variation in how much individual students or classrooms actually use them. Some students engage deeply; others use the product minimally or inconsistently. That variation — which exists in nearly every real-world implementation — can be leveraged analytically.
By separating students into high-usage and low-usage groups based on their actual engagement data, a researcher can construct a within-product comparison that approximates the structure of a quasi-experimental design. The high-usage group effectively becomes the treatment condition; the low-usage group becomes the comparison. If high-usage students show significantly greater gains than low-usage students — after controlling for pre-existing differences in achievement and demographics — that is a dose-response relationship, and it is meaningful causal evidence.
This design has a particular strength: both groups come from the same implementation context, which eliminates many of the confounding factors that make comparison group studies difficult. They had the same teachers, the same school environment, the same calendar, the same district-level influences. The only meaningful difference is how much they used the product. That’s a clean signal, and reviewers recognize it as such.
Depending on how it’s designed and documented, a study using a low-usage comparison group may qualify for review at the Tier II level — or it may be reviewed as a particularly strong Tier III study with internal comparison evidence. Either way, it strengthens the portfolio considerably compared to a straightforward pre-post design without any comparison component.
We walk through how to use your own usage data to build internal comparison groups — and what the analysis actually looks like. Watch on the LXD Research YouTube channel →
Which Organizations Review and Validate ESSA Tier III Studies?
Conducting a rigorous study is one thing. Having it independently reviewed and validated is what converts internal research into a credible market credential. Two organizations are particularly relevant for Tier III evidence validation in the edtech space, and understanding what each one offers helps companies decide where to invest their review efforts.
Reviews edtech products through its Research-Practice Partnership framework. Certification signals that a product has been studied with students and that the research meets defined quality standards. Widely recognized by districts and increasingly referenced in state procurement guidance.
Minimum sample size for Tier III: 50 students. This is a practical planning threshold — studies need to be designed with sufficient enrollment from the outset to clear this bar.
Reviews educational programs and products across the full ESSA evidence spectrum. Applies a structured methodology standards review and publishes findings publicly. A credential from EduEvidence signals independent third-party verification that the study meets the claimed evidence tier.
What independent review actually does for you: It converts “we conducted a study” into “our study was independently reviewed and verified.” That’s a meaningful distinction in a procurement conversation, particularly when a district’s legal or compliance team is involved in the purchase. The reviewer’s methodology standards function as a third-party audit — and the credential that comes out of it is something you can put in a proposal, a sales deck, and a grant application.
How Should EdTech Companies Think About Building an Evidence Portfolio Over Time?
The most practical way to think about it is as a sequence, not a single event. A company that is serious about evidence typically moves through something like the following progression — not necessarily in strict order, but with each layer adding to the overall picture.
Companies that follow something like this sequence arrive at a position where they can say, credibly and specifically: our product has been studied in multiple contexts, the results have been consistent, and the research has been independently reviewed. That is a compelling evidence story — and it is achievable without a randomized controlled trial.
Ready to Start Building Your Evidence Portfolio?
LXD Research designs and conducts Tier III studies built for independent review — and we help companies think through the full portfolio sequence from the first study onward. Let’s talk about where you are and what makes sense next.
Schedule a Free Consultation View Our Services