No, GPT-4 Doesn’t Get Worse Over Time (FUD Debunked)

There has been a lot of drama on Twitter about the new Stanford UC Berkely collab paper titled “How Is ChatGPT’s Behavior Changing over Time?” (source) The paper’s authors selectively provide examples where newer versions of GPT-4 seem to perform “worse” and have “formatting mistakes” than older versions. The first evaluation they provide is the … Read more

Llama 2: How Meta’s Free Open-Source LLM Beats GPT-4!

Meta (formerly Facebook) has released Llama 2, a new large language model (LLM) that is trained on 40% more training data and has twice the context length, compared to its predecessor Llama. Llama 2 is open-source so researchers and hobbyist can build their own applications on top of it. Llama 2 is trained on a … Read more

Microsoft Scales LLMs to a Mind-Boggling 1B (!) Token Context 🀯

The paper “LongNet: Scaling Transformers to 1,000,000,000 tokens” presents a machine learning breakthrough, particularly in handling and analyzing large amounts of text data. Simply put, this paper is about a new model called LongNet that can understand and analyze really long strings of text – up to 1 billion words or phrases, called “tokens,” at … Read more

Claude 2 LLM Reads Ten Papers in One Prompt with Massive 200k Token Context

The new Claude 2 model from AI research company Anthropic has proved insane new capabilities. In this quick article, I’ll give you a short and concise overview of what you need to know. Claude 2 Overview Anthropic’s latest prodigy, Claude 2, is making waves. This AI language model, part of the Claude series, is a … Read more

Those 4 OG Prompt Programming Tricks Still Work Today

I just read one of the most influential papers titled “Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm” by Gwern Branwen, Connor Leahy, and Alexander Turner that first introduced prompt programming, a predecessor of today’s multi-six-figure skill prompt engineering. This short excursion in prompting history will be valuable for improving your prompting skills … Read more

Can ChatGPT Pass the US Medical Licensing Exam (USMLE)?

Executive Summary This article summarizes a recent research paper titled Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models to examine how ChatGPT performs in the USMLE, a comprehensive US-based medical examination. πŸ”— Quote: “medical students often spend approximately 300–400 hours of dedicated study time in preparation for this exam” … Read more

Study Reveals GitHub Copilot Improves Developer Productivity by 55.8%

In a recent interaction with one of my Discord Mastermind Group members, I learned about an interesting paper from Microsoft Research, GitHub Research, and MIT School of Management titled “The Impact of AI on Developer Prodcutivity: Evidence from GitHub Copilot”. In this short article, allow me to give you the TLDR. πŸ‘‡ πŸ’‘ TL;DR: The … Read more

I Read the World Economic Forum Future of Jobs Report 2023 And Wasn’t Impressed

The Future of Jobs Survey 2023 comprises responses from 803 companies across 27 industries and 45 economies, employing over 11.3 million workers. Key findings from the report are: The data source and report are available at the World Economic Forum website here. Let’s have a look at the most interesting graphs and charts from the … Read more

Cross-Species Cooperation: Uniting Humans and Embodied AI through English Language

I stumbled upon an interesting new MIT and IBM Watson AI Lab project titled Building Cooperative Embodied Agents Modularly with Large Language Models. AI Breaks Out Of Your Screen If you’re like me, you are not deep into AI research, so let’s start with the question: What are embodied agents anyway? πŸ‘¨β€πŸ’» Definition: Embodied agents … Read more

Bye Bye Elites πŸ‘‹ ChatGPT Levels the Playing Field in Search Performance and UX vs Google Search

πŸ‘¨β€πŸ’» TLDR: Following OpenAI’s ChatGPT integration into Bing in February 2023, Bing’s traffic increased by 15.8%, while Google’s declined by 1%, a consequential shift considering each 1% change in search market share equates to $2 billion. A recent study compared user behaviors using Google’s keyword-based method and ChatGPT’s conversational approach, involving 95 participants. Despite ChatGPT’s … Read more

ChatGPT Breaks New Ground in Robotics! This New Microsoft Research Shows How

Microsoft Research has just released a cutting-edge study that offers groundbreaking insights into the use of OpenAI’s ChatGPT for robotics applications. Dubbed “ChatGPT for Robotics: Design Principles and Model Abilities,” the paper pioneers an innovative strategy that could revolutionize how we approach and interact with robotic tasks, platforms, and forms. The study uniquely converges two … Read more

AI-Assisted Coding: New Google Research Says How You Should Use It

As the landscape of technology and machine learning evolves, we’re witnessing a paradigm shift in the field of software development. The advent of AI-powered code-writing assistants, like Github’s Copilot, Alphabet’s DeepMind’s AlphaCode, and Amazon’s CodeWhisperer, is revolutionizing the traditional practice of writing code and molding new development workflows. This Finxter article based on a new … Read more