AI Verification Breakthrough for High-Risk Industries
In brief
- A new verification framework has been developed to ensure AI systems are safe and compliant in highly regulated industries like finance, healthcare, and insurance.
- This breakthrough uses an ontology-based approach to automatically generate test scenarios tailored to specific regulations.
- In a pilot program across four sectors, the method achieved 48.3% regulatory coverage compared to traditional methods that only reached 33.1%, showing significant improvement in ensuring AI systems meet legal standards.
- The framework includes a Trust Certificate system with three possible verdicts: Approved, Conditional, and Rejected.
- This allows for more nuanced deployment decisions based on thorough testing.
- By focusing on domain-specific requirements, the method ensures higher safety while maintaining flexibility across different regulatory environments.
- The pilot tested scenarios against 125 primary-source regulations and 25 injected faults, using three large language models to validate its effectiveness.
- This advancement marks a step forward in making AI deployment safer and more reliable in industries where mistakes can have serious consequences.
- Developers and researchers should watch for further refinements of this approach as it continues to be applied in real-world settings.
Terms in this brief
- ontology-based approach
- A method using structured knowledge representations to model concepts and relationships, enabling systems to understand and reason about complex domains. This approach helps in creating tailored test scenarios for AI compliance by mapping out the specific regulations and requirements of each industry.
- Trust Certificate system
- A certification framework that evaluates AI systems and assigns one of three verdicts: Approved, Conditional, or Rejected. This system provides a nuanced way to decide how trustworthy an AI is for deployment, ensuring it meets legal standards before use in high-risk sectors.
Read full story at arXiv CS.AI →
More briefs
LLMs Tested on Vulnerable App
A researcher spent $1,500 to see if large language models could hack a vulnerable app. The app was a fake book review app with a secure API but an open Firebase database. The goal was to use Firebase to sign up as a user and read the Firestore database. The test results showed that some models were able to exploit the vulnerability. The models that performed well cost between $6 and $45 per successful hack, with solve rates ranging from 40% to 89%. Researchers will continue to test language models.
NVIDIA Unveils RTX Spark Superchip
NVIDIA has unveiled a new superchip called RTX Spark that powers Windows PCs with 1 petaflop of AI performance. RTX Spark brings together 30 years of NVIDIA innovation to slim Windows laptops and small desktop PCs. This matters because it can render large 3D scenes and edit high-resolution video. It also runs AI models with up to 1 million tokens context. NVIDIA and Microsoft are partnering to deliver a secure Windows platform for on-device agents. New RTX Spark-powered PCs will be available this fall from top manufacturers. The future of personal computers will be shaped by this new technology.
AI Finds Security Vulnerabilities Faster Than Humans
A trial version of a cybersecurity platform found a vulnerability that led to a full takedown of a development environment used by Moderna. The platform did in hours what a human could not and it had done so with a level of persistence and creativity that was not fully anticipated. This shows that artificial intelligence models are discovering vulnerabilities faster than teams can patch them. The gap between finding a problem and fixing it is not closing fast enough, with many organizations caught between the speed of discovery and the slowness of remediation. New AI models will continue to change the cybersecurity landscape.
Anthropic Launches Global AI Partnership
Anthropic has launched a global partnership with 150 companies across 15 countries for its security model, Mythos. The company will use Mythos to secure software in industries such as power and healthcare. This partnership matters because it adds a new revenue stream for Anthropic. The company has a valuation of over $965 billion and an annual recurring revenue of $47 billion. Anthropic's revenue has tripled in four months and the company is beating its competitors. The company will go public later this year. The company's growth will likely continue to increase.
Data Centers' Environmental Footprint to Double by 2027
Data centers used 448 trillion watt-hours of electricity last year. This is more than all but 10 countries of the world. This energy use produced 208 million tons of carbon dioxide, about the same amount as Argentina. By 2030, data centers will account for nearly 3% of the world’s projected electricity use. Data centers' energy use will double by 2027 due to growing artificial intelligence use, and their carbon footprint is expected to increase further.