AI Styling Studio — Infinite avatar looks from just 1 photo.Try it now.

BestAITools

Submit your Tool

8000+ AI tools already listed
8K+Tools
100K+/moViews
25K+/moVisitors

AI NewsStartup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

1:19 AM IST · March 24, 2026

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

Stanford adjunct professor and successfully exited founder Zain Asgar just raised an $80 million Series A for a startup that solve the AI inference bottleneck problem in an astute way. The round was led by Menlo Ventures. The company,Gimlet Labs, has created what it claims is the first and only “multi-silicon inference cloud” which is software that allows an AI workload to be simultaneously run across diverse types of hardware. It can split an AI app’s work across both traditional CPUs and AI-tuned GPUs, as well as high-memory systems. “We basically run across whatever different hardware that’s available,” Asgar told TechCrunch. A single agent may chain together multiple steps, and each “requires different hardware: Inference is compute-bound; decode is memory-bound; and tool calls are network-bound,” writes lead investor, Menlo’s Tim Tully, in a blog post about the funding. No chip yet does it all, but as new hardware gets rolled out, and aging GPUs get redeployed, “the multi-silicon fleet is ready — it’s just missing the software layer to make it work.” That’s what Tully believes Gimlet Labs offers. If the current deploy-more-compute trend continues,McKinsey estimatesdata center spending will tally nearly $7 trillion by 2030. Asgar says that apps are only using the existing hardware already deployed “somewhere between 15 to 30 percent” of the time. “Another way to think about this: you’re wasting hundreds of billions of dollars because you’re just leaving idle resources,” he said. “Our goal was basically to try to figure out how you can get AI workloads to be 10x more efficient than ever, today.” So he and his cofounders, Michelle Nguyen, Omid Azizi, and Natalie Serrino, set about building orchestration software that slices up agentic workloads so that they can be simultaneous spread across all kinds of hardware. Gimlet Labs claims it reliably speeds AI inference up by 3x to 10x for the same cost and power. Gimlet says it can even slice the underlying model so that it runs across different architectures, using the best chip for each portion of the model. The company has already partnered with chip makers NVIDIA, AMD, Intel, ARM, Cerebras and d-Matrix. Gimlet’s product, delivered either as software or through an API to its own Gimlet Cloud, isn’t for the rank-and-file AI app developer. It’s for the largest AI model labs and data centers. The company publicly launchedin Octoberwith, it said, eight-figure revenues out of the gate (so at least $10 million). Asgar said that his customer base has more than doubled in the last four months and now includes a major model maker and an extremely large cloud computing company, although he declined to name them. The cofounders had previously worked together at Pixie, a startup that created an open source observability tool for Kubernetes. Pixie wasacquiredby New Relic in 2020, just two months after it launched with a $9 million Series A led by Benchmark. (Pixie’s tech is now part of the open source org that oversees Kubernetes.) After Asgar randomly ran into Tully about a year ago and also received angel investments from Stanford professors, VCs started calling. After launch, a term sheet landed on Asgar’s desk. When VCs heard Asgar was looking at offers, “we got a pretty big swarm of funding,” and the round was quickly oversubscribed, he said. With the previous seed, the startup has now raised a total of $92 million, including from a slew of angels like Sequoia’s Bill Coughran, Stanford Professor Nick McKeown, former CEO of VMware Raghu Raghuram and Intel CEO Lip-Bu Tan. The company currently employs 30 people. Other investors include Factory, who led the seed, Eclipse Ventures, Prosperity7 and Triatomic.

read more

Latest AI News

View All News →
The AI jobs debate just got messier

The AI jobs debate just got messier

AI-related job loss fears grow each time another companyannounces a round of layoffs. Through May of 2026, companies announced that close to90,000 job cutswere tied to AI, and, by some accounts, up to 15% of U.S. jobs areprojectedto beeliminated by AIover the next five years. Promises from the tech industry that AI will also create new jobs does little to ease fears, especially for the generation wondering if anyone will be hiring when they graduate. A recent report from Ramp and Revelio Labs, which track enterprise AI spend and workforce records from nearly 22,000 companies, respectively, complicates that gloomy narrative. The report found that companies spending heavily on AI are growing headcount faster, even in the entry-level roles that many fear are doomed. According to the report, “high-intensity adopters” — firms that spend on average $30 per employee per month on AI in the first three months — saw headcount increase 10.2%. Headcount also rose across functions, includingengineering, sales, administration, customer service, finance, marketing, and scientist roles. The strongest job growth among high-intensity adopters was in the information sector, which includes software, internet, media, and tech-adjacent firms. Despite these positive signals, the data isn’t as rosy as it seems. It skews heavily towards tech-forward, knowledge-work firms — ones that might have VC-backing and are growing fast anyway, making it difficult to say whether AI is contributing to the hiring or just showing up at companies that are expanding anyway. “This paper does not show that AI universally creates jobs,” the paper’s authors admit, “but it does counter claims that AI will lead to broad job losses.” It also counters claims that AI is killing all junior jobs.Recent researchfrom Goldman Sachs found that AI has already erased about 16,000 net jobs per month over the past year, with Gen Z and entry level workers taking the brunt of the burden. But in tech-forward firms, the report finds that entry-level headcount actually rose by 12%. So what can we take away from this? Perhaps that AI isn’t always a tool for labor substitution, but that it can be a tool for firm-expansion instead. “For software and technology firms, AI can make core output cheaper or faster to produce: writing code, debugging, building internal tools, producing technical documentation, and supporting product development,” the report reads. “Lower production costs in these workflows can raise the return to expanding the whole firm, not just the engineering team.” But companies that buy subscriptions and run pilots, yet did not go on to make sustained investments, don’t tend to see any gains in headcount, per the report. That sets up the potential for awidening gapbetween firms that have the resources — like capital, technical staff, founder networks, and management bandwidth — to turn AI adoption into actual business gains and those that are stuck experimenting with subscriptions. In other words, this report suggests that firms that already have the resources are the ones who will see the largest gains. The paper’s authors speculate such a divide may continue to grow, saying: “Firms without those channels may fall behind.”

1 hour ago

View

Gujarat Taps IBM, IAIRO to establish Industrial AI Centre of Excellence

Gujarat Taps IBM, IAIRO to establish Industrial AI Centre of Excellence

Envisioned as a ‘living lab’, the Industrial AI CoE will support the development, testing, and adoption of industrial AI applications

1 hour ago

View

CoreWeave Launches ARIA to Help Researchers Find Hidden Patterns in AI Experiments

CoreWeave Launches ARIA to Help Researchers Find Hidden Patterns in AI Experiments

ARIA analyses thousands of experiment runs in minutes, surfaces hidden patterns and recommends improvements to accelerate AI innovation

1 hour ago

View

Delhi to Roll Out AI-Enabled PUCC 3.0 Ahead of Winter Pollution Season: Report

Delhi to Roll Out AI-Enabled PUCC 3.0 Ahead of Winter Pollution Season: Report

The new PUCC 3.0 system will use AI, geotagging and encrypted data transmission to curb fraudulent vehicle emission certificates.

1 hour ago

View