From LLMs to hallucinations, here’s a simple guide to common AI terms

9:51 PM IST · April 12, 2026

Artificial intelligence is a deep and convoluted world. The scientists who work in this field often rely on jargon and lingo to explain what they’re working on. As a result, we frequently have to use those technical terms in our coverage of the artificial intelligence industry. That’s why we thought it would be helpful to put together a glossary with definitions of some of the most important words and phrases that we use in our articles. We will regularly update this glossary to add new entries as researchers continually uncover novel methods to push the frontier of artificial intelligence while identifying emerging safety risks. Artificial general intelligence, or AGI, is a nebulous term. But it generally refers to AI that’s more capable than the average human at many, if not most, tasks. OpenAI CEO Sam Altmanrecentlydescribed AGI as the “equivalent of a median human that you could hire as a co-worker.” Meanwhile,OpenAI’s charterdefines AGI as “highly autonomous systems that outperform humans at most economically valuable work.” Google DeepMind’s understanding differs slightly from these two definitions; the lab views AGI as “AI that’s at least as capable as humans at most cognitive tasks.” Confused? Not to worry —so are experts at the forefront of AI research. An AI agent refers to a tool that uses AI technologies to perform a series of tasks on your behalf — beyond what a more basic AI chatbot could do — such as filing expenses, booking tickets or a table at a restaurant, or even writing and maintaining code. However, as we’veexplained before, there are lots of moving pieces in this emergent space, so “AI agent” might mean different things to different people. Infrastructure is also still being built out to deliver on its envisaged capabilities. But the basic concept implies an autonomous system that may draw on multiple AI systems to carry out multistep tasks. Given a simple question, a human brain can answer without even thinking too much about it — things like “which animal is taller, a giraffe or a cat?” But in many cases, you often need a pen and paper to come up with the right answer because there are intermediary steps. For instance, if a farmer has chickens and cows, and together they have 40 heads and 120 legs, you might need to write down a simple equation to come up with the answer (20 chickens and 20 cows). In an AI context, chain-of-thought reasoning for large language models means breaking down a problem into smaller, intermediate steps to improve the quality of the end result. It usually takes longer to get an answer, but the answer is more likely to be correct, especially in a logic or coding context. Reasoning models are developed from traditional large language models and optimized for chain-of-thought thinking thanks to reinforcement learning. (See:Large language model) Although somewhat of a multivalent term, compute generally refers to the vitalcomputational powerthat allows AI models to operate. This type of processing fuels the AI industry, giving it the ability to train and deploy its powerful models. The term is often a shorthand for the kinds of hardware that provides the computational power — things like GPUs, CPUs, TPUs, and other forms of infrastructure that form the bedrock of the modern AI industry. A subset of self-improving machine learning in which AI algorithms are designed with a multi-layered, artificial neural network (ANN) structure. This allows them to make more complex correlations compared to simpler machine learning-based systems, such as linear models or decision trees. The structure of deep learning algorithms draws inspiration from the interconnected pathways of neurons in the human brain. Deep learning AI models are able to identify important characteristics in data themselves, rather than requiring human engineers to define these features. The structure also supports algorithms that can learn from errors and, through a process of repetition and adjustment, improve their own outputs. However, deep learning systems require a lot of data points to yield good results (millions or more). They also typically take longer to train compared to simpler machine learning algorithms — so development costs tend to be higher. (See:Neural network) Diffusion is the tech at the heart of many art-, music-, and text-generating AI models. Inspired by physics,diffusion systems slowly “destroy” the structure of data— for example, photos, songs, and so on — by adding noise until there’s nothing left. In physics, diffusion is spontaneous and irreversible — sugar diffused in coffee can’t be restored to cube form. But diffusion systems in AI aim to learn a sort of “reverse diffusion” process to restore the destroyed data, gaining the ability to recover the data from noise. Distillation is a technique used to extract knowledge from a large AI model with a ‘teacher-student’ model. Developers send requests to a teacher model and record the outputs. Answers are sometimes compared with a dataset to see how accurate they are. These outputs are then used to train the student model, which is trained to approximate the teacher’s behavior. Distillation can be used to create a smaller, more efficient model based on a larger model with a minimal distillation loss. This is likely how OpenAI developed GPT-4 Turbo, a faster version of GPT-4. While all AI companies use distillation internally, it may have also been used by some AI companies to catch up with frontier models. Distillation from a competitor usuallyviolatesthe terms of service of AI API and chat assistants. This refers to the further training of an AI model to optimize performance for a more specific task or area than was previously a focal point of its training — typically by feeding in new, specialized (i.e., task-oriented) data. Many AI startups are taking large language models as a starting point to build a commercial product but are vying to amp up utility for a target sector or task by supplementing earlier training cycles with fine-tuning based on their own domain-specific knowledge and expertise. (See:Large language model [LLM]) A GAN, or Generative Adversarial Network, is a type of machine learning framework that underpins some important developments in generative AI when it comes to producing realistic data – including (but not only) deepfake tools. GANs involve the use of a pair of neural networks, one of which draws on its training data to generate an output that is passed to the other model to evaluate. This second, discriminator model thus plays the role of a classifier on the generator’s output – enabling it to improve over time. The GAN structure is set up as a competition (hence “adversarial”) – with the two models essentially programmed to try to outdo each other: the generator is trying to get its output past the discriminator, while the discriminator is working to spot artificially generated data. This structured contest can optimize AI outputs to be more realistic without the need for additional human intervention. Though GANs work best for narrower applications (such as producing realistic photos or videos), rather than general purpose AI. Hallucination is the AI industry’s preferred term for AI models making stuff up – literally generating information that is incorrect. Obviously, it’s a huge problem for AI quality. Hallucinations produce GenAI outputs that can be misleading and could even lead to real-life risks — with potentially dangerous consequences (think of a health query that returns harmful medical advice). This is why most GenAI tools’ small print now warns users to verify AI-generated answers, even though such disclaimers are usually far less prominent than the information the tools dispense at the touch of a button. The problem of AIs fabricating information is thought to arise as a consequence of gaps in training data. For general purpose GenAI especially — also sometimes known as foundation models — this looks difficult to resolve. There is simply not enough data in existence to train AI models to comprehensively resolve all the questions we could possibly ask. TL;DR: we haven’t invented God (yet). Hallucinations are contributing to a push towards increasingly specialized and/or vertical AI models — i.e. domain-specific AIs that require narrower expertise – as a way to reduce the likelihood of knowledge gaps and shrink disinformation risks. Inference is the process of running an AI model. It’s setting a model loose to make predictions or draw conclusions from previously seen data. To be clear, inference can’t happen without training; a model must learn patterns in a set of data before it can effectively extrapolate from this training data. Many types of hardware can perform inference, ranging from smartphone processors to beefy GPUs to custom-designed AI accelerators. But not all of them can run models equally well. Very large models would take ages to make predictions on, say, a laptop versus a cloud server with high-end AI chips. [See:Training] Large language models, or LLMs, are the AI models used by popular AI assistants, such asChatGPT,Claude,Google’s Gemini,Meta’s AI Llama,Microsoft Copilot, orMistral’s Le Chat. When you chat with an AI assistant, you interact with a large language model that processes your request directly or with the help of different available tools, such as web browsing or code interpreters. AI assistants and LLMs can have different names. For instance, GPT is OpenAI’s large language model and ChatGPT is the AI assistant product. LLMs are deep neural networks made of billions of numerical parameters (or weights, see below) that learn the relationships between words and phrases and create a representation of language, a sort of multidimensional map of words. These models are created from encoding the patterns they find in billions of books, articles, and transcripts. When you prompt an LLM, the model generates the most likely pattern that fits the prompt. It then evaluates the most probable next word after the last one based on what was said before. Repeat, repeat, and repeat. (See:Neural network) Memory cache refers to an important process that boosts inference (which is the process by which AI works to generate a response to a user’s query). In essence, caching is an optimization technique, designed to make inference more efficient. AI is obviously driven by high-octane mathematical calculations and every time those calculations are made, they use up more power. Caching is designed to cut down on the number of calculations a model might have to run by saving particular calculations for future user queries and operations. There are different kinds of memory caching, although one of the more well-known isKV (or key value) caching. KV caching works in transformer-based models, and increases efficiency, driving faster results by reducing the amount of time (and algorithmic labor) it takes to generate answers to user questions. (See:Inference) A neural network refers to the multi-layered algorithmic structure that underpins deep learning — and, more broadly, the whole boom in generative AI tools following the emergence of large language models. Although the idea of taking inspiration from the densely interconnected pathways of the human brain as a design structure for data processing algorithms dates all the way back to the 1940s, it was the much more recent rise of graphical processing hardware (GPUs) — via the video game industry — that really unlocked the power of this theory. These chips proved well suited to training algorithms with many more layers than was possible in earlier epochs — enabling neural network-based AI systems to achieve far better performance across many domains, including voice recognition, autonomous navigation, and drug discovery. (See:Large language model [LLM]) RAMageddon is the fun new term for a not-so-fun trend that is sweeping the tech industry: an ever-increasing shortage of random access memory, or RAM chips, which power pretty much all the tech products we use in our daily lives. As the AI industry has blossomed, the biggest tech companies and AI labs — all vying to have the most powerful and efficient AI — are buying so much RAM to power their data centers that there’s not much left for the rest of us. And that supply bottleneck means that what’s left is getting more and more expensive. That includes industries like gaming (where major companies have had toraise prices on consolesbecause it’s harder to find memory chips for their devices), consumer electronics (where memory shortage could causethe biggest dip in smartphone shipmentsin more than a decade), and general enterprise computing (because those companies can’t get enough RAM for their own data centers). The surge in prices is only expected to stop after the dreaded shortage ends but, unfortunately, there’snot really much of a signthat’s going to happen anytime soon. Developing machine learning AIs involves a process known as training. In simple terms, this refers to data being fed in in order that the model can learn from patterns and generate useful outputs. Things can get a bit philosophical at this point in the AI stack — since, pre-training, the mathematical structure that’s used as the starting point for developing a learning system is just a bunch of layers and random numbers. It’s only through training that the AI model really takes shape. Essentially, it’s the process of the system responding to characteristics in the data that enables it to adapt outputs towards a sought-for goal — whether that’s identifying images of cats or producing a haiku on demand. It’s important to note that not all AI requires training. Rules-based AIs that are programmed to follow manually predefined instructions — for example, such as linear chatbots — don’t need to undergo training. However, such AI systems are likely to be more constrained than (well-trained) self-learning systems. Still, training can be expensive because it requires lots of inputs — and, typically, the volumes of inputs required for such models have been trending upwards. Hybrid approaches can sometimes be used to shortcut model development and help manage costs. Such as doing data-driven fine-tuning of a rules-based AI — meaning development requires less data, compute, energy, and algorithmic complexity than if the developer had started building from scratch. [See:Inference] When it comes to human-machine communication, there are some obvious challenges. People communicate using human language, while AI programs execute tasks and respond to queries through complex algorithmic processes that are informed by data. In their simplest definition, tokens represent the basic building blocks of human-AI communication, in that they are discrete segments of data that have either been processed or produced by an LLM. Tokens are created via a process known as “tokenization,” which breaks down raw data and refines it into distinct units that are digestible to an LLM. Similar to how a software compiler translates human language into binary code that a computer can digest, tokenization interprets human language for an AI program via their user queries so that it can prepare a response. There are several different kinds of tokens — including input tokens (the kind that must be generated in response to a human user’s query), output tokens (the kind that are generated as the LLM responds to the human’s request), and reasoning tokens, which involve longer, more intensive tasks and processes that occur as part of a user request. With enterprise AI, token usage also determines costs. Since tokens are equivalent to the amount of data being processed by a model, they have also become the means by which the AI industry monetizes its services. Most AI companies charge for LLM usage on a per-token-basis. Thus, the more tokens a business burns as it uses an AI program (ChatGPT, for example), the more money it will have to pay its AI service provider (OpenAI). A technique where a previously trained AI model is used as the starting point for developing a new model for a different but typically related task – allowing knowledge gained in previous training cycles to be reapplied. Transfer learning can drive efficiency savings by shortcutting model development. It can also be useful when data for the task that the model is being developed for is somewhat limited. But it’s important to note that the approach has limitations. Models that rely on transfer learning to gain generalized capabilities will likely require training on additional data in order to perform well in their domain of focus (See:Fine tuning) Weights are core to AI training, as they determine how much importance (or weight) is given to different features (or input variables) in the data used for training the system — thereby shaping the AI model’s output. Put another way, weights are numerical parameters that define what’s most salient in a dataset for the given training task. They achieve their function by applying multiplication to inputs. Model training typically begins with weights that are randomly assigned, but as the process unfolds, the weights adjust as the model seeks to arrive at an output that more closely matches the target. For example, an AI model for predicting housing prices that’s trained on historical real estate data for a target location could include weights for features such as the number of bedrooms and bathrooms, whether a property is detached or semi-detached, whether it has parking, a garage, and so on. Ultimately, the weights the model attaches to each of these inputs reflect how much they influence the value of a property, based on the given dataset. This article is updated regularly with new information.

Latest AI News

View All News →

Anthropic’s Claude Tag is learning your company, one Slack message at a time

Anthropic is introducing Claude Tag in research preview, an “always-on Claude” that lives in Slack and acts as an AI teammate. The new feature — which allows users to tag @Claude to provide insights in chats and assign tasks — will begin in research preview, available through Slack for Claude Enterprise and Claude Team customers. Claude Tag is an evolution ofseveral integrations that already exist. Users can already DM @Claude within Slack or tag it in channels for on-demand help, andClaude Code in Slackroutes coding tasks from channel mentions to full coding sessions on the web, posting updates back into the thread. But Claude Tag adds a layer of persistent context and memory that would be difficult to maintain with previous tools. “As Claude follows along with its channel, it learns ever more about the work,” reads a statement from Anthropic. “Claude can also automatically gather facts from elsewhere in the organization, if it’s granted permission to read other channels.” With Claude Tag, everyone in a given Slack channel can access a single Claude identity, meaning “anyone can see what Claude has been working on, and can pick up the conversation from where the last person left off.” System administrators will specify which tools, information, and channels Claude can access, and each Claude identity will stay scoped to whichever channels the admins define, so that a Claude set up for legal work can’t seed memories into the engineering channel, for example. When assigned a specific task, Claude Tag will break down the task into stages and will work through them using whichever tools it has access to, responding in a Slack thread with what it has created. But Claude Tag also features an ambient mode that proactively jumps into the chat of its own accord to keep your team updated, flag things from across the organization, and follow up on threads or tasks that have been forgotten. Anthropic says this makes it feel like you’re “working with a real colleague — one that can produce work in public view, with far greater context and understanding than before.” That context is an increasingly critical part of enterprise deployments, and Anthropic isn’t the only company focused on it. Microsoft also has Graph, expressed through Copilot and Work IQ.Snowflakeand Databricks are positioning their platforms as the back-end support containing tacit organizational knowledge that agents can tap into.Glean is also building an intelligence layerthat understands company context and sits between the model and the enterprise data.

3 hours ago

View

Fika Jobs raises $4M to build a video-first hiring platform where AI agents interview candidates

The hiring process has long been criticized for its inefficiency and opacity. Candidates spend hours writing applications and submitting cover letters, only to disappear into what often feels like a black box. Generative AI has only made things messier, with employers increasingly relying on AI-powered screening systems to sift through an overwhelming number of submissions. Stockholm-based startupFika Jobsthinks there’s a better way. The company is building a video-first hiring platform that combines AI interview agents with short-form video profiles, creating something that feels like a cross between LinkedIn and TikTok. Instead of relying solely on resumes, candidates complete AI-powered interviews designed to showcase their personality and communication skills. Fika Jobs announced on Tuesday a $4 million pre-seed round, which will be used to continue developing the platform, grow the team, and prepare for a wider launch later this year. For job seekers, the process starts by connecting a LinkedIn profile. Fika’s AI reviews the candidate’s background and generates personalized interview questions. Candidates then complete a roughly 10-minute video interview with the AI agent, currently powered by Google’s Gemini models. After the interview, Fika automatically turns responses into short video clips and organizes them into a profile. Instead of applying to every new role, candidates maintain a live profile that employers can discover and revisit as new opportunities arise. The idea came from co-founders and brothers Jakob Dubois (CEO) and Alexander Dubois (CTO) while they were building their previous startup. “When we were building [social app] Gaff, we spent a lot of time recruiting and almost passed on a candidate because his resume did not really stand out,” Jakob Dubois told TechCrunch. “We ended up speaking with him anyway, and within minutes, his grit, drive, and ambition became obvious. Exactly the kind of person we wanted to hire.” That experience convinced the founders that some traits that employers care about most are difficult to capture on paper. Unlike most competitors (Alex,Maki, andMercor, among others) that focus on helping employers source, screen, and match candidates more efficiently with AI, Fika is building a platform where candidates maintain video-first profiles and employers browse a pool of people who have already been interviewed and evaluated by AI. If successful, Fika Jobs could help employers assess communication skills and cultural fit early in the hiring process, complementing traditional resume and application reviews. This approach may be especially valuable for early-career professionals and candidates from non-traditional backgrounds, whose potential is not always apparent from a resume alone. Of course, video profiles introduce real bias risks that are also worth acknowledging. When employers can see a candidate’s race, age, gender, physical appearance, and accent before evaluating their qualifications, it opens the door to discrimination that a resume, for all its flaws, at least partially obscures. There’s a reason some companies have moved toward blind resume screening. The platform plans to open early access to candidates this week, with a broader public launch expected this fall. The company will initially focus on Sweden before expanding internationally. Fika currently has a small team but expects to reach around 10 employees by the end of the year. More than 100 companies are on the waitlist, say the founders, though they declined to disclose which ones. Separately, they said more than 50 companies have tested the platform, including Plenty Labs, SICS.ai, Kognity, and Rebtel. The platform is free for job seekers. Employers pay nothing up front, but Fika takes 10% of a candidate’s first-year salary upon a successful hire. (The company notes that this is lower than the 20% to 30% placement fees often charged by traditional recruiters and headhunters.) The round was led by Luminar Ventures, with participation from Alliance VC and King co-founders Sebastian Knutsson and Riccardo Zacconi, the duo best known for creating the hit mobile game Candy Crush.

7 hours ago

View

4 days left to save up to $190 on TechCrunch Founder Summit 2026

Founders don’t grow alone. The best founders learn from peers facing similar challenges, gain insights from operators who have already scaled, and build relationships with investors who can help fuel the next stage of growth. You have just four days left to save up to $190 on your pass toTechCrunch Founder Summit 2026before Early Bird rates end on June 26 at 11:59 p.m. PT. On November 4 in Boston, more than 1,000 founders and investors will come together for a full day of practical insights, peer-to-peer learning, and meaningful networking designed to help startups grow faster. This is TechCrunch’s flagship founder conference, built specifically for founders. Whether you’re preparing to raise capital, scaling revenue, hiring your next team members, or planning your next major milestone, Founder Summit connects you with the people and strategies that can help move your company forward. Register by June 26 to save up to $190 on your pass.Groups of four or more can save up to 30%. TechCrunch Founder Summitis designed to deliver practical takeaways you can put into action immediately. You’ll connect with: The conversations are candid, focused, and designed to help founders solve real business challenges.Register here to save up to $190. Founder Summitprogramming focuses on the decisions that shape a company’s future. Through breakout sessions and roundtable discussions, you’ll gain insights you can apply right away. Past topics have included: Whether you’re raising your first round or scaling toward your next major milestone, these sessions are built to help you make smarter decisions and move faster.Register here to save up to $190. Previous speakers have shared firsthand lessons on company building, fundraising, and growth, including: Additional speakers have included leaders from Sequoia Capital, NFX, Underscore VC, Glasswing Ventures, Wing Venture Capital, Construct Capital, Greylock, and Precursor Ventures. The 2026 agenda is currently taking shape, with more founders, operators, and investors to be announced soon on theevent page. Interested in leading the conversation?Submit a topicfor a breakout or roundtable session for a chance to be voted onto the agenda by the TechCrunch audience. TechCrunch Founder Summit 2026is where founders come to gain practical insights, build valuable relationships, and accelerate growth. Join 1,000+ founders and investors in Boston on November 4 for a day of learning, networking, and conversations that can help shape your company’s future. Early Bird savings end in just four days, June 26 at 11:59 p.m. PT.Register now to save up to $190on your pass and up to 30% when registering as a group before prices increase.

7 hours ago

View

Tredence Acquires KMK Consulting to Expand Healthcare, Life Sciences Business

The company targets 25% revenue from the sector by 2028.

7 hours ago

View

Submit your Tool

BestAITools.online is an AI Tools Directory helping individuals, businesses, and creators discover the best AI tools for writing, coding, design, productivity, and more.

Contact Promote Terms of Service Refund Policy Privacy Policy Email

From LLMs to hallucinations, here’s a simple guide to common AI terms

Latest AI News

Anthropic’s Claude Tag is learning your company, one Slack message at a time

Fika Jobs raises $4M to build a video-first hiring platform where AI agents interview candidates

4 days left to save up to $190 on TechCrunch Founder Summit 2026

Tredence Acquires KMK Consulting to Expand Healthcare, Life Sciences Business

Quick Links

AI Tools by Task

Alternatives

Best AI Tools

Top AI Tools

Browse & Compare

Featured Lists

Discover