AI Styling Studio — Infinite avatar looks from just 1 photo.Try it now.

BestAITools

Submit your Tool

8000+ AI tools already listed
8K+Tools
100K+/moViews
25K+/moVisitors

AI NewsMultiverse Computing pushes its compressed AI models into the mainstream

Multiverse Computing pushes its compressed AI models into the mainstream

5:13 PM IST · March 19, 2026

Multiverse Computing pushes its compressed AI models into the mainstream

With private company defaults running atupwards of 9.2%— the highest rate in years — VC firm Lux Capital recently advised companies relying on AI to get their compute capacity commitmentsconfirmed in writing. With financial instability rippling through the AI supply chain, Lux warned, a handshake agreement isn’t enough. But there’s another option entirely, which is to stop relying on external compute infrastructure altogether. Smaller AI models that run directly on a user’s own device — no data center, no cloud provider, no counterparty risk — are getting good enough to be worth considering. AndMultiverse Computingis raising its hand. The Spanish startup has so far kept a lower profile than some of its peers, but as demand for AI efficiency grows, this is changing. After compressing models from major AI labs including OpenAI, Meta, DeepSeek and Mistral AI, it has launched both an app that showcases the capabilities of its compressed models and an API portal — a gateway that lets developers access and build with those models — that makes them more widely available. TheCompactifAI app, which shares its name with Multiverse’s quantum-inspired compression technology, is an AI chat tool in the vein of ChatGPT or Mistral’s Le Chat. Ask a question, and the model answers. The difference is that Multiverse embedded Gilda, a model so small that it can run locally and offline, according to the company. For end users, this is a taste of AI on the edge, with data that doesn’t leave their devices and doesn’t require a connection. But there’s a caveat: their mobile devices must have enough RAM and storage. If they don’t — and many older iPhones won’t — the app switches back to cloud-based models via API. The routing between local and cloud processing is handled automatically by a system Multiverse has named Ash Nazg, whose name will ring a bell for Tolkien fans as it references the One Ring inscription in “The Lord of the Rings.” But when the app routes to the cloud, it loses its main privacy edge in the process. These limitations mean that CompactifAI is not quite ready for mass customer adoption yet, although that may never have been the goal. According to data from Sensor Tower, the app hadfewer than 5,000 downloadsin the past month. The real target is businesses. Today, Multiverse is launching aself-serve API portalthat gives developers and enterprises direct access to its compressed models — no AWS Marketplace required. “The CompactifAI API portal [now] gives developers direct access to compressed models with the transparency and control needed to run them in production,” CEO Enrique Lizaso said in a statement. Real-time usage monitoring is one of the key features of the API, and that’s no accident. Alongside the potential advantages of deploying on the edge, lower compute costs are one of the main reasons why enterprises are considering smaller models as an alternative to large language models (LLMs). It also helps that small models are less limited than they used to be. Earlier this week, Mistral updated its small model family with thelaunch of Mistral Small 4, which it says is simultaneously optimized for general chat, coding, agentic tasks and reasoning. The French company alsoreleased Forge, a system that lets enterprises build custom models, including small models for which they can pick the tradeoffs their use cases can best tolerate. Multiverse’s recent results also suggest the gap with LLMs is narrowing. Its latest compressed model,HyperNova 60B 2602, is built on gpt-oss-120b — an OpenAI model whose underlying code is publicly available. The company claims it now deliversfaster responsesat lower cost than the original it was derived from, an advantage that matters particularly for agentic coding workflows, where AI autonomously completes complex, multi-step programming tasks. Making models small enough to operate on mobile devices while still remaining useful is a big challenge.Apple Intelligencesidestepped that issue by combining an on-device model and a cloud model. Multiverse’s CompactifAI app can also route requests to gpt-oss-120b via API, but its main goal is to showcase that local models like Gilda and its future replacements have advantages that go beyond cost savings. For workers in critical fields, a model that can run locally and without connecting to the cloud offers more privacy and resilience. But the bigger value is in the business use cases this can unlock – for instance, embedding AI in drones, satellites, and other settings where connectivity can’t be taken for granted. The company already serves more than 100 global customers including the Bank of Canada, Bosch and Iberdrola, but expanding its customer base could help it unlock more funding. After raising a$215 million Series Blast year, it is nowrumored to be raising a fresh €500 million funding roundat a valuation of more than €1.5 billion.

read more

Latest AI News

View All News →
Lumo, Proton’s privacy-focused AI chatbot, gets an upgrade

Lumo, Proton’s privacy-focused AI chatbot, gets an upgrade

Proton, the privacy-focused productivity app company, releaseda public AI chatbot, Lumo,last year. On Tuesday, the chatbot received an upgrade. Lumo 2.0 gives the chatbot a variety of newfound powers, including image recognition and image generation capabilities. Users can now upload pictures into Lumo, then use the chatbot to analyze or edit them. Similar to other LLMs, Lumo can also generate imagery based on a user’s prompt. Version 2.0 also expands Lumo’s capabilities for Projects — the widget that allows users to upload documents and conduct work via Proton’s other products like email and cloud storage. Projects now come with user-controlled persistent memory, which is a function that allows Lumo to recall a user’s preferences across various conversational sessions. Additionally, the company says Lumo’s update makes it significantly more powerful than its previous version. The 2.0 version responds to most queries up to 76% faster than its previous iteration, the company says. The chatbot also comes with a new “thinking mode” for more complex problems or questions. “Lumo 2.0 has been re-engineered from the ground up and the introduction of thinking mode gives it powerful new capabilities,” said Andy Yen, founder and CEO at Proton. “Lumo 2.0 demonstrates that users no longer need to choose between powerful AI capabilities and meaningful privacy protections.” The public version of Lumo appears roughly equivalent to other major chatbots in terms of usefulness. It answers questions in a similar format as Gemini and ChatGPT, with approximately the same level of detail and context. Yet, Proton distinguishes Lumo from other chatbot providers with its privacy protections. It uses what it calls zero-access encryption architecture, which encrypts users’ data in transit and at rest, only allowing access to the user. The company also claims that no server-side logging of sessions is retained, so nobody at Proton can see the contents of conversations. Proton also promises to never use customer data for AI training or share it with third-parties. Lumo 2.0 is available immediately. In addition to the free public version, Proton offers paid tiers (Plus and Professional) that give those users significantly more access and resources.

15 minutes ago

View

Persistent’s Nagarro Gambit & the Billion Euro Bet on an AI-Driven Future

Persistent’s Nagarro Gambit & the Billion Euro Bet on an AI-Driven Future

Persistent’s biggest acquisition to date is a strategic wager that scale, European incumbency and deeper enterprise relationships will matter more than ever in the AI era.

15 minutes ago

View

Kapture CX Secures $10 Mn Pre-Series B Funding to Scale Agentic Enterprise Stack

Kapture CX Secures $10 Mn Pre-Series B Funding to Scale Agentic Enterprise Stack

This investment, led by Bajaj Finserv Ventures, will support Kapture's expansion into global markets and boost R&D efforts.

15 minutes ago

View

Micron Is Having Its NVIDIA Moment

Micron Is Having Its NVIDIA Moment

High-bandwidth memory has become AI’s next strategic battleground, and Micron is emerging as one of its biggest beneficiaries.

15 minutes ago

View