Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …

by | News/Events

Midjourney depiction of lama playing basketball

The AI Weekly Breakthrough | Issue 6 | April 17, 2024

Welcome to The AI Weekly Breakthrough, a roundup of the news, technologies, and companies changing the way we work and live

Meta Breaks Silence, Announces Llama 3

Midjourney depiction of lama sitting in a chair

Meta, relatively quiet as of late, breaks its silence this week to make two AI-related announcements. First, Meta announces that Llama 3, its new suite of next-gen foundation models, will be released within the next month. Llama 3 is expected to have about 140 billion parameters, twice as many as are available in the largest Llama 2 model. Users should expect a release similar to Anthopic’s Claude 3, which included three models that differed in size and cost. Second, Meta announced the next generation of their Meta Training and Inference Accelerator (MTIA), their family of custom-made chips designed for Meta’s AI workloads. This is something we’re seeing all the big players do – improve and elevate their AI infrastructure in order to make bigger and better performing AI models.

March RAGness Results Are In

RAG Leaderboard

Tonic.ai has published the results of their March RAG leaderboard that looks surprisingly similar to another March bracket with an acronym. In this competition, Tonic tested the top RAG systems against each other. Using their Tonic Validate answer similarity score, they evaluated the response quality of each RAG system on a set of benchmark Q&As. Cohere placed first with a score of 4.62 out of 5, followed by Haystack in second place at 4.40, and Langchain landing in the third spot with a 4.0 score. At the bottom of the list were Amazon Bedrock at 3.16 and OpenAI’s November 2023 API version at a mere 2.42.

Cloudless Skies at Google Cloud Next

Google Cloud Next

This year’s Cloud Next Conference, hosted by Google in Las Vegas, wasn’t so much about the cloud as it was about generative AI. The event showcased a suite of generative AI-powered tools across Google’s ecosystem, including Gemini Code Assist, Google’s answer to enhancing coding efficiency and quality, rivaling Github’s Copilot. Google also unveiled the ability to use generative text to create live images with Imagen 2, and its Vertex AI Agent Builder, which empowers developers to craft multimodal AI agents.Topping off the roster, Google announced that Gemini 1.5 Pro is in public preview on Vertex AI, accessible in over 180 countries and boasting an industry-leading one million token context window.

Rerank 3 Boosts Relevance, Lowers Costs

Rerank 3 from Cohere

Cohere, a leader in AI models specifically designed for enterprise generative AI projects, has released Rerank 3. Working with a RAG pipeline, Rerank 3 helps assess the relevancy of retrieved documents before the LLM processes the retrieved data. This improves the overall relevance and accuracy of the LLM’s response and, since the LLM only has to process a smaller set of relevant data, it brings the cost down of the entire generative AI system. Improvements in Rerank 3 include a larger 4K token context window length; the ability to search over multi-aspect and semi-structured data like emails, codes and tables; and multilingual coverage for 100+ countries.

X Releases Grok 1.5V

Grok-1.5 Vision

X announced the release of Grok 1.5V, its first-generation multimodal AI model. In addition to Grok’s text capabilities, it can now process visual information like documents, diagrams, screenshots, and photographs. According to X, Grok performs comparably with other frontier models like GPT-4V, Gemini 1.5 Pro and Claude 3 Opus, and outperforms them all in their newly defined “RealWorldQA” benchmark that measures real-world spatial understanding.

Augment Yourself 🤖

🔥 For more AI News brought to you via email subscribe to our newsletter here.
👀 Want to know more about Shelf’s suite of AI solutions? Check out our website here.

Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …: image 1

Read more from Shelf

May 23, 2024RAG
Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …: image 2 10-Step RAG System Audit to Eradicate Bias and Toxicity
As the use of Retrieval-Augmented Generation (RAG) systems becomes more common in countless industries, ensuring their performance and fairness has become more critical than ever. RAG systems, which enhance content generation by integrating retrieval mechanisms, are powerful tools to improve...

By Vish Khanna

May 23, 2024Generative AI
Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …: image 3 Prevent Costly GenAI Errors with Rigorous Output Evaluation — Here’s How
Output evaluation is the process through which the functionality and efficiency of AI-generated responses are rigorously assessed against a set of predefined criteria. It ensures that AI systems are not only technically proficient but also tailored to meet the nuanced demands of specific...

By Vish Khanna

May 22, 2024News/Events
Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …: image 4 Mannequin Medicine Makes Perfect, OpenAI’s Shifting Priorities, Google Search Goes Generative
AI Weekly Breakthroughs | Issue 11 | May 22, 2024 Welcome to AI Weekly Breakthroughs, a roundup of the news, technologies, and companies changing the way we work and live. Mannequin Medicine Makes Perfect Darlington College has introduced AI-powered mannequins to train its health and social care...

By Oksana Zdrok

Meta Breaks Silence, March RAGness, Cloudless Skies at Cloud Next …: image 5
The Definitive Guide to Improving Your Unstructured Data How to's, tips, and tactics for creating better LLM outputs