AI Boosts Math Scores in Sierra Leone Schools
In brief
- A recent study involving over 1,700 students in Sierra Leone has shown that AI-powered learning tools can significantly improve math outcomes.
- Over eight weeks, students using Google's Gemini AI in conjunction with teacher-led instruction saw notable progress.
- The tool was designed to encourage critical thinking by posing questions rather than providing direct answers, with 91% of interactions focused on building understanding.
- The AI system, called Guided Learning, worked alongside teachers, who remained central to the learning process.
- Educators reported that using the tool helped them become more effective facilitators, discovering new ways to explain concepts like fractions.
- The study highlights how AI can enhance teaching without replacing it, offering a promising model for other countries looking to integrate technology in education.
- Moving forward, Google plans to release a teacher training guide to help others replicate this success.
- Future studies will explore how AI tools can be adapted for different subjects and regions, aiming to provide evidence-based solutions for improving global education.
Read full story at DeepMind Safety →
More briefs
Iowa State University Study Finds AI Writing Tools Require More Thought From Students
Students at Iowa State University learned that writing with AI tools is not as easy as it seems. They found that AI only handles surface-level writing. The students completed a course where they used AI tools to write. At first, they thought AI would do all the work. But they soon learned that AI requires trial and error. They had to try, test, and revise their work many times. The study found that students need to understand three key ideas to write well with AI. Now researchers will continue to study how students can use AI to improve their writing skills.
AI Safety Researchers Unlock New Method to Control Risky Chatbot Responses
AI researchers have discovered a novel way to control how chatbots respond to harmful prompts. By analyzing activation patterns in five open-source models, they found that one technique-Iterative Nullspace Projection (INLP)-can suppress unsafe responses nearly as effectively as the previous method. INLP's "counterfactual flipping" approach moves harmful activation clusters into safer zones, while its "nullspace projection" collapses them between safe and harmful states. This breakthrough offers a tunable control mechanism, preserving model performance while reducing risks. The findings suggest that models encode the absence of concepts differently from their presence, a distinction that could guide future research. Developers can now better balance safety and utility by fine-tuning these interventions. As AI systems become more integrated into daily life, such advances are crucial for maintaining trust and usability. Look out for further studies exploring how these methods apply to real-world scenarios.
Brain-Inspired AI Breaks New Ground in Security
Scientists have discovered that adding "noise" to artificial neural networks, inspired by how our brains process information, can make AI systems more secure against cyberattacks. By introducing structured noise into ANN activations, researchers found that these networks become significantly more robust to adversarial attacks and natural image changes. This breakthrough aligns with biological observations, where variability in brain signals plays a crucial role in processing sensory data. The study reveals that while unstructured noise doesn't offer much benefit, structured noise-like patterns influenced by real-world data-greatly enhances security. Interestingly, the effectiveness of this noise varies depending on the type of attack or image modification. For instance, noise structured from adversarial attacks tends to generalize better across different types of threats compared to naturalistic image changes. This biologically inspired approach could pave the way for creating more resilient AI systems that mimic how the brain handles information, potentially leading to safer and more reliable machine learning applications in the future.
AI Models Can Now Be Compared Using Simple Agents That Find Behavioral Differences
Google's DeepMind team has developed a new method for comparing the behavior of different AI models. They introduced "diffing agents," which are simple systems designed to detect differences in how models respond. This approach is more efficient than previous methods, which focused on static prompts and might miss rare or subtle behavioral changes. The key innovation is allowing these diffing agents to create their own prompts to search for and validate differences. When tested against real models, the agents proved effective in identifying discrepancies, especially when standard auditing tools failed. For instance, they successfully found intended behavior changes in model organisms but struggled with a specific secret behavior, highlighting potential limitations. Looking ahead, this technique could help improve AI safety by uncovering unexpected behaviors in large language models. Future research will focus on refining these agents and expanding their applications, potentially offering new insights into model capabilities that traditional evaluations might miss.
AI Tools Fail to Impress in Medical Practice Tests
Clinical AI tools are being used in medical practice despite a lack of independent evaluation. These tools were compared to general purpose language models in three tests. They were given 500 medical questions and 500 items to evaluate their agreement with expert clinicians. They also received 100 real clinical queries from physicians. The general purpose language models performed better in all three tests. This shows that clinical AI tools may not be as effective as claimed, and independent evaluation is needed before they are used in medical practice. New evaluations will be done to further test these tools.