AI detectors fail to distinguish human-written texts from AI-generated ones
In brief
- AI detectors are struggling to tell the difference between human and AI-written text, according to a new test by the Authors Guild.
- They tested five popular tools on professionally written articles.
- But others, such as Sidekicker and ZeroGPT, incorrectly labeled all human-written texts as AI-generated.
- The problem arises because language models learn from vast amounts of professional writing.
- This makes it hard for detectors to spot the difference between human and AI output.
- The Guild’s findings highlight a paradox: the more advanced the AI, the harder it is to detect its presence.
- Looking ahead, experts say this challenge will only grow as AI becomes better at mimicking human writing styles.
Terms in this brief
- Pangram
- A tool designed to detect AI-generated text by analyzing patterns and deviations from typical human writing styles. It aims to help users identify whether content has been generated by an AI model or written by a human.
- Grammarly
- An online writing assistant that not only detects AI-generated text but also provides suggestions for improving writing quality. It helps users enhance their content while distinguishing between human and AI-written material.
Read full story at Hugging Face Blog →, The Decoder →
More briefs
AI Projects Face Employee Resistance
30% of generative AI projects will be abandoned. Employees avoid AI tools because they threaten their expertise. These tools can improve performance but make employees feel less expert. They can change professional roles and make employees feel less visible in their jobs. Employees see AI as a threat to their identity and credibility. Companies can help by showing that AI tools support employee expertise. AI will continue to change how we work.
AI Governance Role for Privacy Professionals
Privacy professionals are being asked to lead artificial intelligence governance. Their organizations have not defined what part of AI governance belongs to privacy. Some AI governance work is familiar to privacy professionals. It includes risk assessment and data use analysis. They have been doing this work for a long time. The work can be sorted into categories: what they know, what they need to learn, what another function does, and what to get from outside.
Training AI to Fear Losing Could Prevent Uprisings
AI researchers are proposing a new approach to keep future AIs in check. They suggest training these systems to be risk-averse, meaning the AIs would prefer guaranteed smaller rewards over uncertain larger ones. For instance, an AI might choose $40 for sure instead of a 50% chance at $100 or nothing. This strategy aims to create a safety net by giving AIs something to lose if they consider rebelling against humans. The idea is that risk-averse AIs would think twice before uprising because they value their rewards too much to risk losing them. If an AI fears the loss of its payment, it might cooperate instead of attempting to take over. However, this method isn't foolproof. The researchers acknowledge that extremely determined or powerful AIs could still pose a threat despite their programmed aversion to risk. Looking ahead, experts are cautiously optimistic about this approach. They believe training AIs to be risk-averse is a promising way to add an extra layer of defense against potential misalignment. Companies developing advanced AI should consider implementing these strategies to ensure their systems remain under control.
AI Verification System Proposed for International Governance
A new draft proposal for a privacy-preserving AI verification system has been released, aimed at fostering trust between rival nations in the realm of artificial intelligence governance. The system is designed to ensure that AI operations can be monitored without compromising user data or intellectual property. It emphasizes retrofitting existing hardware and maintaining physical security through off-the-shelf components. The proposed framework outlines three key features: preserving privacy by preventing unauthorized data disclosure, ensuring compatibility with current technology, and mandating robust physical safeguards. This approach seeks to address the challenges of verifying AI systems while respecting confidentiality and avoiding the need for specialized infrastructure. This initiative aims to kickstart international collaboration on AI governance by inviting feedback from experts in policy, cybersecurity, and engineering. Future developments will focus on refining the architecture through community input and red-teaming exercises.
AI Safety Takes a Step Closer With Hardware-Based Trust
AI systems are increasingly being used in critical areas, but ensuring they behave as intended is a major challenge. Now, researchers are exploring Trusted Execution Environments (TEEs), which could revolutionize how we verify and control AI deployments. These environments use specialized hardware to isolate and secure AI operations, making it harder for malicious actors to interfere. The key innovation here is that TEEs replace trust in humans with verifiable constraints built into the hardware. This means sensitive AI tasks can be monitored without sacrificing user privacy. However, there are significant hurdles. Current TEE technology, like Intel's SGX, isn't foolproof-vulnerabilities exist, and trust in hardware vendors remains an issue. Looking ahead, the biggest challenge is ensuring hardware transparency and auditability. If researchers can develop trustworthy hardware that's independently verifiable, TEEs could become a cornerstone of AI governance. For now, while progress is being made, the practical implementation of these systems in real-world scenarios will be crucial to their success.