The AI Weekly Breakthrough | Issue 6 | April 17, 2024

Welcome to The AI Weekly Breakthrough, a roundup of the news, technologies, and companies changing the way we work and live

Meta Breaks Silence, Announces Llama 3

Midjourney depiction of lama sitting in a chair

Meta, relatively quiet as of late, breaks its silence this week to make two AI-related announcements. First, Meta announces that Llama 3, its new suite of next-gen foundation models, will be released within the next month. Llama 3 is expected to have about 140 billion parameters, twice as many as are available in the largest Llama 2 model. Users should expect a release similar to Anthopic’s Claude 3, which included three models that differed in size and cost. Second, Meta announced the next generation of their Meta Training and Inference Accelerator (MTIA), their family of custom-made chips designed for Meta’s AI workloads. This is something we’re seeing all the big players do – improve and elevate their AI infrastructure in order to make bigger and better performing AI models.

March RAGness Results Are In

RAG Leaderboard

Tonic.ai has published the results of their March RAG leaderboard that looks surprisingly similar to another March bracket with an acronym. In this competition, Tonic tested the top RAG systems against each other. Using their Tonic Validate answer similarity score, they evaluated the response quality of each RAG system on a set of benchmark Q&As. Cohere placed first with a score of 4.62 out of 5, followed by Haystack in second place at 4.40, and Langchain landing in the third spot with a 4.0 score. At the bottom of the list were Amazon Bedrock at 3.16 and OpenAI’s November 2023 API version at a mere 2.42.

Cloudless Skies at Google Cloud Next

Google Cloud Next

This year’s Cloud Next Conference, hosted by Google in Las Vegas, wasn’t so much about the cloud as it was about generative AI. The event showcased a suite of generative AI-powered tools across Google’s ecosystem, including Gemini Code Assist, Google’s answer to enhancing coding efficiency and quality, rivaling Github’s Copilot. Google also unveiled the ability to use generative text to create live images with Imagen 2, and its Vertex AI Agent Builder, which empowers developers to craft multimodal AI agents.Topping off the roster, Google announced that Gemini 1.5 Pro is in public preview on Vertex AI, accessible in over 180 countries and boasting an industry-leading one million token context window.

Rerank 3 Boosts Relevance, Lowers Costs

Rerank 3 from Cohere

Cohere, a leader in AI models specifically designed for enterprise generative AI projects, has released Rerank 3. Working with a RAG pipeline, Rerank 3 helps assess the relevancy of retrieved documents before the LLM processes the retrieved data. This improves the overall relevance and accuracy of the LLM’s response and, since the LLM only has to process a smaller set of relevant data, it brings the cost down of the entire generative AI system. Improvements in Rerank 3 include a larger 4K token context window length; the ability to search over multi-aspect and semi-structured data like emails, codes and tables; and multilingual coverage for 100+ countries.

X Releases Grok 1.5V

Grok-1.5 Vision

X announced the release of Grok 1.5V, its first-generation multimodal AI model. In addition to Grok’s text capabilities, it can now process visual information like documents, diagrams, screenshots, and photographs. According to X, Grok performs comparably with other frontier models like GPT-4V, Gemini 1.5 Pro and Claude 3 Opus, and outperforms them all in their newly defined “RealWorldQA” benchmark that measures real-world spatial understanding.

Augment Yourself 🤖

🔥 For more AI News brought to you via email subscribe to our newsletter here.
👀 Want to know more about Shelf’s suite of AI solutions? Check out our website here.