latentbrief
Back to news
Launch1h ago

Argonne National Lab Launches AI Inference Service Using Spare Supercomputing Power

Hacker News1 min brief

In brief

  • The Department of Energy's Argonne National Laboratory has introduced a new AI inference service using spare supercomputing resources.
    • This initiative aims to assist researchers across the U.S., particularly those involved in the Genesis Mission, by providing access to advanced AI models through a chatbot-like interface.
  • The service currently operates on two clusters: Sophia, equipped with 192 Nvidia A100 GPUs, and Metis, featuring 32 SambaNova AI accelerators.
  • Researchers can utilize models like OpenAI's GPT-OSS and Meta’s Llama for tasks such as analyzing experimental data in real-time or processing large datasets from particle accelerators.
  • The service ensures secure AI experimentation without exposing sensitive data to public platforms.
  • As Argonne expands the service to include more systems, this effort promises to enhance scientific discovery by efficiently leveraging underutilized computing resources.

Terms in this brief

Nvidia A100 GPUs
Nvidia A100 Graphics Processing Units (GPUs) are high-performance computing accelerators designed for AI and data-centric workloads. They provide significant computational power, making them ideal for tasks like training large language models and processing massive datasets.
SambaNova AI accelerators
SambaNova AI accelerators are specialized hardware components that speed up AI computations. Unlike traditional CPUs or GPUs, these accelerators are optimized specifically for AI workloads, offering better performance and efficiency for tasks like inference and model training.

Read full story at Hacker News

More briefs