Meta’s Llama 3 Ignites AI Battle: Is it a GPT-4 Killer or Just Hype? Open Source Titan Takes the Field! | by Rohan Sharma | Apr, 2024

Rohan Sharma(@techie150) talking open supply fashions in Chief Knowledge Officer Summit

Meta simply dropped their newest AI mannequin, the Llama 3, and it’s type of an enormous deal. They’ve acquired variations with 8 billion and a whopping 70 billion parameters. Right here’s the news:

They’ve upgraded the tokenizer in Llama 3. Now it handles a large 128K tokens, which is far more environment friendly than the final model — like 15% extra compact.
They added this cool function referred to as Grouped Question Consideration to all of the fashions. Even the smaller fashions get a lift with this, in contrast to in Llama 2 the place solely the large man had it.
This factor was pre-trained on 15 trillion tokens, and most of that’s in English. They used 16,000 GPUs on the identical time to coach it. Meta’s additionally cooking up some new instruments to raised handle GPU time, which may very well be a game-changer.
Right here’s a enjoyable reality: they used Llama 2 to scrub up the info for this new mannequin. Exhibits you the way these language fashions will be tremendous helpful past simply chatting.
They’re making an attempt out a brand new approach to fine-tune these fashions too, mixing up reasoning tracing with desire rating to chop down on errors. It’s one thing like what OpenAI did earlier than.
There’s additionally this new library referred to as TorchTune. It’s constructed proper into PyTorch, and it’s imagined to make working with these large language fashions simpler and fewer memory-hungry.
On the accountability entrance, Meta’s not pulling any punches. They’re pushing arduous on making AI safer with stuff like Llama Guard 2 and Code Defend.
Efficiency-wise, Llama 3 is top-notch. It’s smashing information, particularly in reasoning duties. They in contrast it to Claude however haven’t stacked it up in opposition to GPT-4 but. They’re hinting at an even bigger mannequin, possibly 400 billion parameters, which sounds prefer it may actually shake issues up.
Better of all, Llama 3 is open supply. You will discover it on platforms like Hugging Face and WatsonX, which is fairly superior.

So yeah, it appears like Llama 3 was positively well worth the wait. Appears to be like like Meta’s pushing the envelope on what these AI fashions can do. Can’t wait to see what’s subsequent!

Source link

Meta’s Llama 3 Ignites AI Battle: Is it a GPT-4 Killer or Just Hype? Open Source Titan Takes the Field! | by Rohan Sharma | Apr, 2024

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

NVIDIA Accelerates Google Quantum AI Processor Design With Simulation of Quantum Device Physics

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Teradata AI Unlimited in Microsoft Fabric is Now Available for Public Preview through Microsoft Fabric Workload Hub

Cognigy Unveils Agentic AI: Transforming the Future of Enterprise Contact Centers

Our Picks

XGBoost Hyperparameter Tuning. XGBoost Hyperparameter Tuning series… | by My Skill | May, 2024

Step into the future: a 48-qubit programmable processor

What is Intelligent Document Processing?

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Meta’s Llama 3 Ignites AI Battle: Is it a GPT-4 Killer or Just Hype? Open Source Titan Takes the Field! | by Rohan Sharma | Apr, 2024

Related Posts