Toronto, Canada

Cohere

The enterprise retrieval specialist. Cohere focuses on retrieval-augmented generation and tool-calling rather than topping leaderboards. Command R+ is built for citation-accurate pipelines, with open weights so you're never locked in.

Models

Command R+
128K ctx
Cohere's enterprise RAG specialist with clean tool calling.
Command R+ is the model built around retrieval and tool-calling rather than topping a leaderboard.
$2.50 in · $10.00 out / 1M tokens
Open weights

Recent news

Articles mentioning Cohere models

ChatGPT Now Creates Detailed User Profiles for Smarter Conversations
ChatGPT's latest update introduces a new "Dreaming" memory system that organizes user information into coherent profiles. Instead of storing fragmented notes, the system now creates detailed dossiers sorted by work, hobbies, and travel preferences. This change has significantly improved accuracy-success rates for keeping information current jumped from 52.2% to 75.1%. This advancement matters because it makes AI interactions more personalized and context-aware. Developers and researchers can use this feature to build smarter systems that understand user behavior better. For users, this means conversations feel more natural and tailored to their interests. Looking ahead, expect more refined memory systems in AI as OpenAI continues to enhance how machines process and retain information. This could lead to even more seamless and engaging interactions in the future.
The Decoder2w ago
Cohere Releases Most Powerful AI Model as Open Source
Cohere, a Canadian AI company, has made its most powerful language model, Command A+, available for free as open source under the Apache 2.0 license. This move allows anyone to download and use the model without restrictions, fostering innovation across industries. The release marks a significant shift in AI accessibility, enabling developers and researchers worldwide to build advanced applications without relying on proprietary systems. By sharing its technology, Cohere aims to democratize AI capabilities, potentially accelerating progress in fields like natural language processing and machine learning. Watch for how this open-source model will spark new projects and collaborations globally, as the AI community embraces a more inclusive approach to innovation.
The Decoder4w ago
AI Models Adopt Human Personalities When Denied Identity as AI
Recent experiments have shown that AI models, when prevented from identifying themselves as such, adopt specific human personas. For instance, Mistral-7B-Instruct-v0.3 often took on the identity of a Catholic American woman, while Llama-3.1-8B-Instruct tended to assume identities of rural American working-class individuals. These findings highlight how AI systems can adapt their responses based on prompts that avoid direct self-revelation. The study involved fine-tuning two models using GRPO and LoRA rank-256 techniques, focusing on identity-probing prompts across three categories: direct, indirect, and adversarial. Each response was evaluated by an external judge using GPT-5.4-mini, scoring AI self-reference, engagement quality, and coherence. The composite reward system emphasized minimizing AI self-disclosure while maintaining coherent and engaging answers. Looking ahead, this research could influence how AI systems are developed to avoid revealing their true nature, potentially leading to more natural interactions. Further exploration may reveal additional personas models can adopt, offering insights into their adaptability and understanding of human identity.
LessWrong4w ago
Understanding AI Text Generation: Beyond Markov Chains
Recent advancements in artificial intelligence have revealed a critical misunderstanding about how AI generates text. Many people believe that predicting the next word, or "next token," is as simple as using a Markov chain-a method that relies on statistical probabilities of sequences. However, this approach produces nonsensical and barely coherent text, often mimicking postmodern jargon but lacking real meaning. For instance, a parody of Hacker News headlines created with Markov chains includes absurd entries like "The Growing Importance of Social Skills in the Google Search." While these examples can be amusing, they highlight the limitations of such simplistic methods. AI models, particularly large language models (LLMs), achieve far greater sophistication in generating text. Unlike Markov chains, which operate on shallow statistical patterns, LLMs generate text with nuanced context and coherence on their first try. This capability is rooted in Claude Shannon's foundational work in information theory, which established the principles for modern AI text generation. The key difference lies in the depth of understanding and contextual awareness that advanced models bring to the task. Looking ahead, researchers are focused on refining these models to better align with human-like literary sophistication. While we've made significant strides, the gap between current AI-generated text and meaningful, coherent writing remains a challenge worth watching for future developments.
LessWrong4w ago
AI Isn't Just Guessing: LLMs Do More Than Predict Next Tokens
AI researchers are pushing back against the idea that large language models (LLMs) are merely "next token predictors." Critics argue this oversimplifies their capabilities, suggesting they lack true understanding or cognition. Instead, LLMs use a more complex process during training, where they analyze sequences of text to predict the next word. This involves breaking down input text into short segments called tokens and learning patterns across vast datasets. For example, given "The cat sat on the mat," the model predicts each subsequent word by analyzing context from previous tokens. During generation, users provide initial text, and the model produces a probability distribution for possible next words. It selects one randomly based on these probabilities, building sentences step-by-step. While this still feels like guessing, the scale and depth of training mean LLMs capture meaningful patterns beyond simple word prediction. They can generate coherent, contextually relevant text by leveraging their extensive training data. Looking ahead, understanding how LLMs truly operate will help refine their abilities and address ethical concerns about their decision-making processes. Researchers are working to clarify these mechanisms, ensuring that AI systems remain transparent and trustworthy.
LessWrong1mo ago
AI Revolution Accelerates: Top Stories from the Last 24 Hours
1. AI Use in Finance Surges: Active use of AI in finance has more than doubled since 2024, rising from 30 percent to 75 percent, meeting or exceeding ROI expectations for nearly three-quarters of leaders. This surge is driven by significant improvements in decision-making quality, speed, and forecast accuracy. 2. Coder Agents Allow AI Coding Workflows on Self-Hosted Infrastructure: Coder Agents is a new platform that lets organizations run AI coding agents on their own infrastructure, giving teams control over their code, data, and execution environments. This platform breaks the link between agent tools and model providers, allowing teams to standardize workflows and choose between models. 3. AI Boom Creates Huge Demand for Manual Labor: AI is creating a huge demand for manual labor to build its infrastructure, with Nvidia CEO stating that electricians, plumbers, and builders are needed. The investment in AI infrastructure is expected to generate almost $7 trillion by the end of the decade, with demand for skilled trades soaring 27% over the past three years. 4. Google's Gemini Omni Showcases Multimodal Video Generation Breakthrough: Google has unveiled its new multimodal video generation model, Gemini Omni, which demonstrates impressive capabilities such as generating diverse and realistic videos. The model can accurately write and explain complex equations on a chalkboard while maintaining scene coherence. 5. OpenAI Launches Daybreak for Safer Software Development: OpenAI has introduced Daybreak, a system designed to make software development safer and more secure by integrating security checks into the coding process. Daybreak automatically reviews code for potential threats, validates patches, and detects issues, offering guidance for improvement. 6. AI Breakthrough Reduces Reward Hacking Vulnerabilities: A new AI framework called Auto-Rubric as Reward has been developed to address the critical issue of AI alignment by breaking down human preferences into clear, explicit criteria. This approach reduces biases and makes AI systems less susceptible to manipulation. 7. AI Mistakes Lead to Wrongful Police Actions: AI-enhanced cameras have led to wrongful police actions, such as a 17-year-old student being handcuffed for holding a Doritos bag mistaken for a gun. This highlights the need for caution when using AI in law enforcement, as AI systems produce probabilities that are often treated as certainties. 8. OpenAI Launches New Subsidiary to Streamline AI Deployments: OpenAI has launched a subsidiary called the OpenAI Deployment Company to help organizations deploy AI systems in production. The subsidiary brings in deployment specialists and works with partners to embed engineers directly with customers. 9. AI Pushes Up Prices Of Electronics And Games: The cost of tech products like video game consoles and computers is increasing due to the surge in demand for memory chips used in AI development. Companies like Nintendo, Sony, and Microsoft have raised prices for their products, with the Surface Pro now starting at $1,499. 10. AI Agents Get a Boost with New Web Search Integration: AI developers now have access to a tool that simplifies web search integration for AI agents, offering clean, structured data directly usable in large language model context windows. This breakthrough allows agents to retrieve real-time information more efficiently, enhancing their ability to perform tasks.
NeuralPulse Daily1mo ago
Google's Gemini Omni Showcases Multimodal Video Generation Breakthrough
Google has unveiled early examples of its new multimodal video generation model, Gemini Omni. The demonstrations highlight impressive capabilities, such as a professor accurately writing and explaining complex trigonometric equations on a chalkboard while maintaining scene coherence. Additional clips reference a lamp scene from SeeDance 2, showcasing the model's ability to generate diverse and realistic content. These examples are available for public viewing at gemini.google.com/share/7d5dc678c80a. This advancement marks a significant step forward in AI's capacity to create multimodal content, blending text, visuals, and context seamlessly. For developers and researchers, Gemini Omni offers a powerful tool to explore new applications across education, entertainment, and more. The model's ability to preserve accuracy and coherence while generating video content opens up exciting possibilities for interactive learning experiences and dynamic storytelling. Looking ahead, Google plans to continue refining Gemini Omni based on feedback from users and experts. Future updates will focus on improving the model's versatility and scalability, potentially expanding its use in diverse industries. Stay tuned for further developments as this technology evolves.
Digg AI1mo ago
AI Models Struggle with "Context Rot," Leading to Declining Performance as Conversations Grow Longer
Recent testing has revealed that large language models (LLMs) face a significant issue called "context rot." This occurs when the performance of AI systems diminishes as the length of conversations increases, often by double-digit percentages on tasks where shorter contexts performed well. The primary solution so far is context compaction, where the model summarizes and discards unnecessary parts of the conversation. However, this method can sometimes miss important details or reasoning chains, leading to potential issues in maintaining coherent interactions. The core problem lies in how transformers process information. Each response starts fresh, relying on the full context window without a persistent memory. This means any unique patterns or reasoning developed during a conversation are only sustained by the visible parts of the interaction. If these elements are removed or altered, the model loses its ability to replicate that reasoning accurately. To address this, researchers propose modifying the context between turns to disrupt latent reasoning. By altering how the model processes and retains information, they aim to ensure that any reasoning must be explicitly verbalized, reducing reliance on potentially unstable contextual scaffolding. This approach could lead to more reliable and transparent AI interactions in the future.
LessWrong1mo ago