Trending Stories

BJP Alleges Rahul Gandhi Insulted Hindus, He Responds With RSS Jab

BJP Alleges Rahul Gandhi Insulted Hindus, He Responds With RSS Jab

2 Ministers, MP Lead Fight Back On Rahul Gandhi's Alleged Insult Of Hindus

2 Ministers, MP Lead Fight Back On Rahul Gandhi's Alleged Insult Of Hindus

Indian-Origin Woman, 24, Dies On Qantas Flight From Melbourne To New Delhi

To Rahul Gandhi's Agniveer Remark, A Rajnath Singh Counterpunch

Key Features Of India's 3 New Criminal Laws

Day After Lonavala Horror, Man Swept By Strong Currents Of Pune Waterfall

Rajya Sabha Rumble: M Kharge's "Small Request" And JP Nadda's Fierce Reply

"Justice Replaces Punishment In New Criminal Laws": Amit Shah

Trump Has Some Immunity From Prosecution As Ex President: US Supreme Court

"Isn't It Morning Yet": Andhra Minister's Wife To Cop For Making Her Wait

View More Stories

Explained: The Tech That Powers ChatGPT, Google Gemini And Meta AI

AI Explainer: Large Language Models - the underlying technology of ChatGPT, Google Gemini and Meta AI

Advertisement

Artificial Intelligence Written by Zaid Nazir

Updated : June 25, 2024 6:22 pm IST

ChatGPT, Google Gemini and Meta AI are all LLMs that work by predicting the next word, using word vectors

New Delhi:

Ever wondered how ChatGPT works? The short answer to this complex question lies in Large Language Models or LLMs which are foundational models that are trained using large amounts of textual data. These models do not process words as humans do. They instead use a long series of numbers, representing a single word. This data is fed to computers in the form of Word Vectors.

These sequences of numbers are known as Word Vectors and can be imagined as a single point in an imaginary space, with words that have similar meaning placed closer to each other. The scale of each model is massive and almost impossible to envision, but for reference, GPT4 has a staggering 1.76 trillion parameters, with millions of unique word vectors, according to a June 28, 2023 report by SemiAnalysis, a US-based independent AI research and analysis company. Processing such a huge number of vectors with trillions of parameters has been possible due to the dramatic advancement of computing power over the last few years. Most recently, on June 19, Nvidia became the largest public company in the world based on market capitalisation, surpassing Microsoft and Apple, as a result of surging demand on their AI capable chipsets.

ChatGPT, Google Gemini and Meta AI are all LLMs that work by predicting the next word, using word vectors. This prediction is done by transforming word vectors fed by the user as "prompts" into predictions, using Transformers.

How Is Text Prediction Done In LLMs?

Advertisement

LLMs are multi-layered. Each layer consists of a neural network architecture (imagine artificial neurons) known as transformers. These transformers process the input text - each word vector individually - and inside each transformer, words in the form of vectors look around and interact for relevant information. This process is repeated over and over again, not just for a single prompt, but even for the next time a user feeds a prompt with similar words into the LLMs. This enables efficiency in the future searches for better prediction of "the next word' in the sequence.

How Are LLMs Trained?

Advertisement

LLMs are trained using unsupervised learning, eliminating the need for human labelling of data. Data from web pages, books, and other textual sources is used to feed LLMs before going public. These have also courted controversy as it reflected human biases in some cases. Most notably, Microsoft's Twitter Chatbot Tay, Google's Gemini and OpenAI's Sora (text-to-video converter) have courted controversy over the years for giving bigoted, racial and gender discriminatory responses. To its credit, the industry has responded to the challenge and is constantly evolving to negate human biases from LLMs.

Featured Video Of The Day

2 Dead, 13 Injured After Water Tank Built 3 Years Ago Collapses In Mathura

Advertisement

MORE

Next