Sign up for Chalkbeat’s free weekly newsletter to keep up with how education is changing across the U.S.
A group of Stanford University researchers started with one question: Could a human tutor providing motivation and support get students to spend more time working with an AI literacy tutor?
The answer turned out be yes — but only between one and four minutes more per week. Many students never logged on at all.
That left the researchers with a different set of questions.
“A key finding that we weren’t even meaning to test is that having access to this AI tutor isn’t the same as using it,” said Carly Robinson, the lead author on the study released Wednesday and the director of research for the SCALE Initiative at the Stanford Accelerator for Learning.
Robinson said this doesn’t necessarily mean AI tutoring doesn’t work. “We never really got close enough to the dosage needed to find out.”
The study adds to a very limited research base for AI tools in education as many school districts are looking for ways to maintain or increase tutoring that costs less than hiring people to do the work. It also aligns with the experience of Khanmigo founder Sal Khan, who recently acknowledged that students didn’t engage much with the AI tutor he initially hoped would revolutionize education.
Robinson and her colleagues worked with two school districts serving high-poverty populations using the same AI learning platform in different tutoring settings. Neither the districts nor the learning platform are named in the study.
In one school district, elementary children were supposed to use the literacy tutor during homework time in an afterschool program. In the other, early elementary students were supposed to use the AI tutor during class time.
The learning platform provider said that students would show improvement in their reading skills with at least 30 minutes of use a week. The goal was for students to complete two 30-minute sessions each week.
Students were randomly assigned to either work independently with the AI tutor or with guidance from a specially trained person — afterschool program staff in one district and middle school students who were strong readers in the other — who could provide motivation and tech support.
Other research has found that the relationship between tutors and students can be a key factor in tutoring effectiveness. Researchers wanted to know if having human support would increase engagement with the tool and raise test scores over the course of the school year. They also thought they might learn that students working independently made more progress because they spent more time on the platform and less time chatting.
But it turned out both groups of students barely used the platform, and having a human tutor didn’t increase the likelihood that students would log on.
Students who did use the platform spent about 13 minutes in a session in the afterschool setting and almost 26 minutes in a session when they used it in class, enough time to probably derive some benefit, Robinson said.
But the low rate of participation meant students in the afterschool setting averaged only about two minutes a week on the platform when they worked independently and three minutes when working with a tutor. In the other district, younger students working independently in class averaged a little more than five minutes a week on the platform, which increased to almost 10 minutes a week when they worked with a tutor.
There was no meaningful difference in the reading scores of the two groups.
The bigger finding, Robinson said, was that many students didn’t log on most weeks. And those that did log on were more likely to already be higher performing and not identified for special education.
Researchers don’t know why students didn’t use the platform. Perhaps some didn’t find it engaging, or perhaps teachers directed students to spend their time in other ways. Those other activities may have been beneficial, Robinson said. The study didn’t compare outcomes for students who used the platform and who didn’t use the platform.
Robinson said she still sees enormous potential for digital tools to provide personalized instruction at scale. But before districts pay for a license, administrators need to look at more than whether there’s evidence a particular AI tool improves student learning.
“The challenge isn’t just building good AI tools,” Robinson said. “It’s really getting students to use them, and that seems to take the same type of intentional design that we’ve learned matters with other ed tech interventions and tutoring.”
Erica Meltzer is Chalkbeat’s national editor based in Colorado. Contact Erica at emeltzer@chalkbeat.org.