AI in Education Unplugged.

 A conversation


This section is a conversation between Seiji Isotani, Professor of Computer Science and Learning Technology at the University of Sao Paulo (Brazil), and the OECD Secretariat. It focuses on remedying digital divides and focuses on AI in Education Unplugged, that is, the design of AI tools and AI interventions for places where digital infrastructure, connectivity and skills are limited. The section starts by presenting the idea and applications of AI Unplugged before zeroing in on the current and promising uses of generative AI systems in this context, notably with small language models. Isotani is now Faculty Director of the Learning Analytics and Artificial Intelligence Program at the University of Pennsylvania (United States). 


OECD: Where does the concept of AI Unplugged come from? 

Seiji Isotani: When we look at the AI space, people have been working on how to improve education with AI for decades. The AI in Education (AIED) society was created in 1997, almost 30 years ago. In the beginning, researchers tried to mimic what a teacher does, for example by trying to understand pedagogical strategies and their impact on learning. Others started working on “user” or student modelling, for example trying to predict how much students know based on their potential behaviour. Then we moved more towards what we call open learning models, which not only assess how students are doing but also present that information to students, so they are informed about what computers believe they know about them. It's more of a cognitive and metacognitive process. The consistent problem in AIED has always been the expectation that students and schools, in general, have a minimum infrastructure to run AI. And as AI grows, the standard for this minimum requirement gets higher, right? This has become a problem, which is why people have discussed the digital divide for so many years. For me, the question is: How can I bring the benefits of AI to people and regions where infrastructure is almost non-existent? This completely shifts the research question and how you approach AI. Instead of creating the most innovative AI technology, you actually work with the community and ask, “okay, what does this community need, and how can I bring AI to them?” This is super interesting because by understanding users, we can create different technologies for that particular space. In 2022, about 90% of the world population, including those in low-income countries, had access to mobile phone subscriptions or mobile phones. If you go to poor places in Brazil, in countries near the Amazon, in Africa, or in parts of India, you will always see a mobile phone. Then we looked at the statistics on Internet access, using data from 2022 and going back 30 years, so we could project trends. What we realised is that only 15% of people in low-income communities have access to stable Internet. This completely changes how we think about AI design, because we can consider using mobile phones and other low-cost equipment, but we cannot assume that the Internet will be available everywhere. Even with the ambitious ongoing efforts to bring Internet access to everyone, the data give you the feeling that it's going to take about 100 years for lowincome communities to have the same Internet access as higher income communities. And this considers the trends of things evolving and getting better. 

OECD: So, what is the minimum infrastructure needed for AI Unplugged? 


Seiji Isotani: We are developing a framework to implement AIED considering different degrees of infrastructure. To date, what we realised is that the minimum access required is a mobile phone (and not even a smartphone). That's the only thing we need. And some access to the Internet. It doesn't need to be all the time, every day, or every hour. At some point during the week you may have access to the Internet. If you have these two things – limited access to the Internet and a mobile phone, just a good enough mobile phone – then we can use AIED Unplugged. If you have Wi-Fi access once every week, I think it's enough, because it's sufficient for us to update information from the local equipment to a server. This allows us to do more intensive processing, analyse the data, update anything that needs updating, and then return information about the students. 



OECD: What are the things that AI Unplugged allows you to do? Could you give us an example? 

Seiji Isotani: Sure. Right after the COVID-19 pandemic, the Brazilian government asked us to help improve the writing skills of students in 5th to 9th grade. In Brazil, pupils spent almost two years out of school without writing, so when they returned they couldn't write well. The government asked us to try to do something about this at scale in Brazil, considering all the inequalities in the country. We accepted the challenge. So, we performed a data analysis on mobile phone access, Internet access, and so on. In Brazil, most schools (over 90%) have some Internet access, but mainly for administrative tasks available at the principal's office, leaving students and teachers with little to no access. In this context, at least one location in most schools in Brazil has Internet access. This is an interesting and important feature. We created an application enabling teachers to take pictures of students' essays. They would ask students to write essays on a sheet of paper, and then take a photo of those essays (see Figure 6.1). Whenever the Internet was available – it could be the next day or two days later, or whenever possible, usually during lunchtime – a teacher would go to the principal's office and leave their mobile phone there. Our application would then upload all the photos to our server, perform the intensive processing, and return all the analysis of these essays. It would then provide the teacher with a dashboard for a particular student, for the whole class, or for a group of classes. The goal was to provide analysis and recommendations to empower teachers to improve how they support students in their writing. We analysed the outcomes with about half a million students in Brazil across 1 500 different municipalities over a year, and we found benefits from doing this. So, this type of AI technology that requires a minimal physical infrastructure offers significant support for teachers and helps students write essays through its automatic evaluation. We are now doing the same thing for basic mathematics and other subjects as well. 

OECD: For that purpose, you didn't have to use generative AI, right? 

Seiji Isotani: Yes, in that case we didn't use generativeAI. We are now trying to use generative AI and compare it with the traditional methods we used. Right now, our previous AI model is still better at detecting incorrect words because what happens with generative AI is that it corrects the student's mistake when it processes a photo, thus correcting something that shouldn't be corrected in this context. For us to evaluate students, we need to know exactly what the student wrote, including their mistakes! But in one or two years, we will probably be in a position where generative AI will substitute all this previous work we've done.

 OECD: How do you ensure that the recommendations given to teachers are pedagogically sound? 

Seiji Isotani: That's a very good question. We have a library of the best pedagogical strategies that the field of the learning sciences provides to us. We try to match students' challenges or difficulties with these materials and strategies. For example, one of the materials we use is "WordGen." "WordGen" is a set of materials created by colleagues from Harvard University and other institutions. They work not only to support reading and writing but also to support reflection. Their general idea is that to support reading and writing, you need to engage students in interesting interactions. Just to give an example, imagine you are a student and you need to write and defend a position. You could talk about climate change, or about economic challenges. But students who are just starting to learn are usually more interested in local challenges. For example, in "WordGen," they have one problem related to lunch in the school cafeteria: should we have pizzas or salads for students? It’s much more interesting to discuss this topic. Or, should we have mobile phones in school or not? It’s interesting because you can also provide data about this: How good is a pizza? How good is a salad? What benefits do you get from each? Then you can position yourself in one of these directions, and during the debate, you bring all those ideas you collected from your reading and start to debate. Then you become much more critical when you are discussing. So, the idea of producing critical thinking or improving critical thinking becomes relevant in this case. In this context, the role of AI could be to match students’ difficulties with potential pedagogical strategies and materials that can be used by teachers. The AI might assess the class and say "Okay, so students are not well-versed in semantic analysis, so we need to improve their capacity to understand more complex sentences, to retain their meaning." Based on that, it can recommend “WordGen” together with pedagogical strategies for reflection and peer reading that we know from the learning sciences are proven to work for that specific need.

OECD: Thank you. We have a good example here of how to use unplugged AI when the AI tool is teacher-facing. Do you have examples of AI applications when it is used directly by the students and not by the teacher? 

Seiji Isotani: Right now, I don't think we have a good example of a student using it directly. Because one of the foundations of AIED Unplugged is trying to reduce the amount of equipment needed. In low-income communities, most students do not have devices – so a scenario where they would all have one is not really helpful. We always think in this context that the proxy can be a teacher, a parent, or a mentor. We empower this proxy user so the students receive the greatest benefit. This is one of the challenges of using AIED Unplugged, because students who benefit from it are vulnerable. Current AI still has some biases. We can avoid them to some extent by having a proxy who tries to understand what makes sense and what doesn't to support that particular student. So, we work with an intermediary that prevents students from being affected by additional biases, that's the idea.

"AI in Education Unplugged Support Equity Between Rural and Urban Areas in Brazil”, Proceedings of the 13th International Conference on Information & Communication Technologies and Development



OECD: Let's move now to the small generative AI models. How could they help? They are interesting because, of course, they can be used to support teachers. They can also directly support students, even if it's through their parents or siblings, or whoever has a device. They could help students to develop their AI literacy in contexts with little resources. What are the possibilities for actually using generative AI in the unplugged model? 

Seiji Isotani: Yes, that's a key question that we are discussing in our group. I think the first point is breaking another barrier to access AI and technology in general. Even with mobile phones, we still require the user to have at least a minimum knowledge to use a mobile phone. When we use these AIED Unplugged models, they could run on a mobile phone or any other device. The interaction interface can be voice, which completely changes the game. Users are not just clicking; they are talking. And by talking, we can have a communication and interaction that students and teachers are already knowledgeable about. Then, whenever you have a more knowledgeable partner and you want to ask and learn from them, you start asking questions and analysing their responses and trying to use them in your everyday life. So, when I think of GenAI in the AIED Unplugged model, it's like a companion that helps you solve challenging problems. It's not meant to substitute for anything, but it will enhance your capability to do things better. When you go to schools in the Amazon in Brazil, or in remote areas in many countries, you see that schools often don't have distinct grades. Five-year-old, six-year-old, ten-year-old students are all together. But they need different kinds of support, and the teacher isn't knowledgeable about everything and also need support. In this scenario, GenAIED Unplugged can actually provide specific or tailored support for students when they need it, at their level. So, that's something we are trying to produce right now. Our work is exploring whether we can ask something and receive a response in a good enough amount of time so that people with no connectivity can have a fruitful conversation. If it is possible, then the next question is, “are those interactions adequate or correct, or do they help students learn something?” These are the next steps that needs to be addressed.

OECD: How does it work? If I put a Small Language Model (SLM) on my phone, does that mean that when I'm not connected to the Internet, I can still interact and get responses from the chatbot and everything?

 Seiji Isotani: Yes. You would just have a small or mini version of any LLM on your mobile phone (ChatGPT, Llama, DeepSeek, Mistral etc.) and it would work offline. It's a smaller version of an LLM, which means that hallucination is more problematic and it is not as powerful. Responses are sometimes incomprehensible. Words can be invented, so there are several problems to use it for learning right now. But on the good side, we do have a mini version of the web in our hands, so we can ask questions, get answers, and get help in different ways. So, the interaction process doesn't change much compared to an LLM: their capability to respond in different ways is limited, but they are still capable of doing things. One work that we are yet to publish, but are finalising, explores how many different pedagogical strategies can actually be used by LLMs in both an online and an offline environment (SLMs). In an online environment, LLMs are much more capable of using different strategies. If you ask them, "use the Socratic method to teach me something," they will do it, that is, ask questions and not give you the answer. On the contrary, if you are using a SLM on a mobile phone and you ask "use the Socratic method," probably - and we have observed this - in almost all cases, they will just give you the response. They won't use the strategies you want. So, these are some examples of the limitations.

OECD: Does it make a difference for a small model to be offline or online? Would the small model online be more performant than the small model offline, or would it be more or less the same?

 Seiji Isotani: In our case, it doesn't change anything if it's online or offline, because the only thing we are doing is analysing interactions. If you want to create models that update over time or can search for current information on the Web according to student interactions, then an online model can make a huge difference. But if you are just thinking about the interaction itself, then offline is fine


OECD: You have studied the trade-offs between different types of small language models. What are the lessons from your work on that? 

Seiji Isotani: The lessons learned are that every community has different challenges, and we realised that AIED Unplugged can potentially address about 70% or 80% of these challenges. We are always thinking about these large language models, but perhaps for educational purposes, we should consider more seriously small language models. Small language models seem to be more effective for different purposes such as handling specialised, domainspecific tasks, like offering short feedback on a decimal misconception; they do not need to be online and they are not as high-cost as large language models. In addition, AI agents in the space of small language models also seem to be a promising path forward that few people talk about. People are discussing these huge agents that can do several different things and produce a final result. But when we think about AIED Unplugged, perhaps we can think of agents running in small language models to complete small, specialised tasks for teachers, for example, lesson planning, or creating specific materials for a particular activity for a student of a certain age. These are small agents that you can actually create to help teachers produce better quality materials and support them in their activities. They don’t need to be able to do everything. For students, agents are really interesting as well. Think about students in high-income families and what they have access to: if they struggle, their family might hire a private tutor so they can improve. Students with mental health issues will have access to a psychologist or someone working on their well-being. If uncertain about their career, a career adviser will help them, right? Three different things. Students in low-income families don't have access to this, but with small language models, using a mobile phone, you can actually run three different specialised agents: one focusing on the student's cognitive capability, another on the student's well-being, and another on the student's career. Then AI agents and students’ caregivers can collaborate with students to make the best decision about what they should do next. So, I think this is a very promising path forward because, in many cases, those students lack any kind of support right now.

OECD: Large Gen AI models are increasingly getting trained for specific purposes, a bit the way you were describing. Is it possible to do something like that with a small model? Could you train a small model to be more focused on academic or on social and emotional skills, or to really have some kind of specialty? And can it be combined with other types of "good old-fashioned AI"?

 Seiji Isotani: Yes, it's possible to tailor a small language model to specific tasks and activities for education. For example, we can use RAG (Retrieval-Augmented Generation) and train those models on data just from Wikipedia for example, or any other source. Their responses would then be heavily based on Wikipedia information. This is completely feasible. We can also do it with other different sets of materials. We can think about books, materials from OECD, or several other different resources. So, these students can actually have agents that will help them in different domains. We could use ontologies and knowledge representation (i.e. symbolic AI) to create hybrid approaches that can potentially have better results without relying solely on a single AI technology. 


OECD: How much time does it usually take for a small language model to respond to a question? And does it make a difference if I type it or if I speak?

 Seiji Isotani: Yes, typing or speaking makes a difference because the model needs to transcribe your oral input from what you're saying and then generate the response. When you type, it's faster. In January 2025, the last time we experimented with voice, it took more than a minute to get a response. So, if you ask a complex question, it can take much more time. But I believe that with new models and optimisations we will probably get better results over time


OECD: One last question. Here we're talking about language models. Generative AI is more than just language models. It can generate pictures, music, videos, and so on. How much of that can a small model do? 

Seiji Isotani: That's a big limitation right now. With the capabilities we have, small language models are not enough to generate images, videos, and so on. You need a lot of processing power and energy to do that. And even now, with the current large GenAI models, it may take several minutes to generate a good quality image. So, right now, small GenAI models don’t have the capability to generate these high-quality materials. But my bet is that this is just a matter of time. Every two years, the power required to generate an image is reducing. OpenAI showed that in the beginning, responding to any prompt would cost USD 5, and now it's a matter of a pence - one cent of a dollar. So, I think things will continue to improve, and these smaller language models will be able to do something like generating images and performing more task

Comments

Popular posts from this blog

(Day 2) Beyond the Algorithm: Navigating the Future of Artificial Intelligence - 49th Annual UNIS-UN International Student Conference.

(Day 1 - Part 2) Beyond the Algorithm: Navigating the Future of Artificial Intelligence - 49th Annual UNIS-UN International Student Conference.