Open Source Archives - Be on the Right Side of Change

What Is Autogen Studio? Building Multi-Agent Systems 101

Chris — Fri, 19 Jan 2024 13:29:54 +0000

A single human being is a magical complex system capable of many things. However, the real fun starts if you align and orchestrate the effort of thousands of human beings behind a single purpose.

One human can build a hut , 1000s of humans can build pyramids.

The same holds for multi-agent AI systems of LLMs.

source

Microsoft’s AutoGen Studio is a user-friendly AI development platform for rapid prototyping of multi-agent solutions. You can define and modify agents and multi-agent workflows through an interactive, easy-to-use, point-and-click, drag-and-drop interface. It’s a No-Code solution for easy AI development, even if you’re not a coding expert.

In other words, you can create collaborative multi-agent systems that allow you to build agent collaboratives such as this research assistant multi-agent system:

source

Example Multi-Agent Systems (MAS): Besides using them to automate your daily work, we could think even bigger: In healthcare, MAS can streamline patient care by coordinating activities across different hospital departments. In supply chain management, MAS can optimize logistics by aligning actions of suppliers, transporters, and retailers. In smart grid management, they balance energy supply and demand, integrating various power sources and managing outages, contributing to a more sustainable energy ecosystem.

Here’s a screenshot of the AutoGen Studio user interface:

source

The core capabilities of AutoGen Studio include:

Agent Workflow: Specify a set of agents that work together to accomplish a task. It could be as simple as a setup with two agents – a user proxy agent and an assistant agent – or more complex with multiple agents collaborating in a group chat to solve a problem.

source

Session: A session in AutoGen Studio refers to a period of interaction with an agent workflow. It includes the workflow configuration and interactions between the user and agents. These sessions can be published to a “gallery” for sharing and reusing.

source

Skills: Skills in AutoGen Studio are essentially functions, like Python functions, that describe how to solve a task. These skills can be added to AutoGen Studio and are available to the assistant agent during task resolution.

The project is continuously being developed and refined, with future releases expected to include more complex agent workflows, improved user experience, expansion of agent skills, and community features for better sharing and collaboration within the AutoGen Studio user community.

Think of the insane network effects of collectively working on a modular library of simple and complex agents and LLM-based capabilities!

source

The AutoGen Studio comes with example skills like fetch_profile, find_papers, and generate_images.

AutoGen Studio can be installed using Python package installer pip.

pip install pyautogen

It includes a web interface powered by an underlying Python API, which is both reusable and modular.

In terms of setup, AutoGen Studio recommends using a virtual environment for installation to avoid conflicts with existing Python packages.

Python Virtual Environment “venv” Cheat Sheet

Once installed, it can be run through a web UI on a specified port, allowing users to start exploring its capabilities, including defining and modifying agent workflows, interacting with agents and sessions, and expanding agent skills.

Video: Here’s an example of a multi-agent system to build a research agent:

To stay on the right side of change, consider joining our brand-new prompt engineering course (updated 2024) with downloadable PDF certificate to prove your prompt engineering skills to potential clients and employers:

The post What Is Autogen Studio? Building Multi-Agent Systems 101 appeared first on Be on the Right Side of Change.

Code Llama 2: Examples and Playground (Try It Yourself)

Chris — Thu, 09 Nov 2023 21:02:17 +0000

Try It Yourself

You can run the code llama 2 code completion model right here on the Finxter blog:

If the embedding doesn’t work for some reason, check out this URL of the Huggingface space.

Example Fibonacci

I have asked Code Llama 2 to complete my code “def fibonacci(n)” and it did it flawlessly! See the gif:

I tried the code and it worked in my example runs (proof by example ):

Understanding Code Llama 2

Code Llama 2 is a state-of-the-art large language model designed to work with code tasks. These models can generate code and natural language about code from code and natural language prompts. A tool like Code Llama 2 can make a huge difference in your productivity by assisting you in various programming tasks.

source

By the way, feel free to watch our prompt engineering with Llama 2 video below or on the Finxter Academy with a downloadable course certificate.

Large Language Model

Code Llama 2 is a powerful AI-driven large language model designed to understand and generate code. It can extrapolate up to a 100k context window, which is made possible due to recent developments in RoPE scaling. As a developer, you can harness the capabilities of this state-of-the-art model to speed up your coding tasks, find solutions, and even autocomplete comments or general text.

Built on top of the foundational Llama 2 model, Code Llama is an advanced, code-specialized variant trained on code-specific datasets. This makes it an excellent tool for those working with programming languages, as it can generate code and natural language about code from both code and natural language prompts.

When using Code Llama 2, you can expect a knowledgeable AI assistant that understands the intricacies of numerous programming languages and provides clear responses. The model’s architecture includes essential attributes such as dimensions, layers, heads, vocabulary size, normalization settings, and batch size for optimal performance.

To get started with Code Llama 2, you can integrate it into your projects through the Hugging Face ecosystem. It has been released under a permissive community license, which means it is available for both research and commercial use.

Pretrained Models

A key aspect of Code Llama 2 is its foundation on pretrained models. These models are fine-tuned on extensive datasets and have already learned relevant patterns, thus providing a solid base for further training in specific domains. Code Llama 2 consists of a family of specialized pretrained models that integrate seamlessly with the Hugging Face ecosystem.

source

One of the variants of Code Llama 2 is the 13-billion-parameter model, which offers unparalleled performance in handling code-related tasks using these pretrained models to achieve better results in less time.

Applications and Performance

Code Completion

Code Llama 2 is designed to provide state-of-the-art performance in code completion tasks. With its deep understanding of various programming languages, including Python, you can expect accurate and helpful code suggestions as you type. Its advanced capabilities make it an invaluable tool for developers to increase productivity and write efficient code.

Model Weights

The performance of Code Llama 2 largely depends on its model weights. These weights are responsible for the model’s accuracy and efficiency. Comparing different model sizes, such as Llama 2 7B and Llama 2 13B, you will notice that their latency per token varies. The choice of model weight will influence your code completion experience, with larger models generally providing more accurate results at the expense of increased computational demands.

You can download Meta’s initial model weights here to get started. Fill out this form:

And check “Code Llama” at the bottom of the form to get the weights. The Code Llama 2 GitHub is available here.

Learn More: Feel free to explore the Finxter Academy’s course that utilizes Llama 2 for prompt engineering, giving you a hands-on experience with this powerful tool in various practical projects.

Programming Languages

Code Llama 2 supports various popular programming languages such as:

Python: A versatile and beginner-friendly language, Python is widely used for web development, automation, and data analysis.
Java: Known for its portability and scalability, Java is a go-to choice for building large-scale enterprise applications.
JavaScript: As a cornerstone of web development, JavaScript allows you to create interactive and responsive web applications.
C++: This high-performance language is ideal for system programming and performance-critical tasks, including game development.
C#: A language designed for the Microsoft .NET framework, C# is often employed to create Windows applications and games using Unity.
TypeScript: As a superset of JavaScript, TypeScript provides additional features and static typing for more robust and maintainable code.
PHP: This server-side scripting language is mainly used for web development and is the backbone of many popular content management systems like WordPress.
Bash: Employed primarily for scripting in UNIX-based systems, Bash allows you to automate tasks and control various system functions.

Code Llama 2 actively embraces the open-source community. It has been made available for free for research and commercial use, enabling developers to access and utilize its capabilities in various projects.

Technical Insights into Llama 2

Fine-Tuning

Llama 2 is an advanced language model that has undergone a series of pretrained and fine-tuned models designed for various applications.

Fine-tuning adapts the model to specific tasks or domains. For instance, Llama 2-Chat is a fine-tuned variant aimed at dialogue applications. Through fine-tuning, you can access models tailored for different use cases, such as coding and text analysis.

To fine-tune Llama 2, focus on specific data relevant to your target task. Quality datasets and training procedures enhance the model’s performance and assist in addressing unique challenges in your domain.

Parameters and Tokens

With parameter counts ranging from 7 billion to 70 billion, Llama 2’s models are designed to handle complex language tasks. The high number of parameters provides a holistic understanding of human language by analyzing word combinations, grammar, and context.

An essential aspect of Llama 2 is handling tokens. The model offers a longer context length of 4096 tokens, or even more with up to 100k token contexts, enabling it to process larger chunks of text and understand the context better. This increased token length enhances its understanding and allows it to generate more coherent and contextually accurate responses.

Key Features of Code Llama

Network Parameters

Code Llama is a code-specialized version of Llama 2 that comes with enhanced coding capabilities, built on top of Llama 2. One essential aspect of this model is its network parameters. These parameters determine the model’s architecture and significantly influence its performance. By optimizing these parameters, Code Llama can better understand and generate code.

Pretrained Weights

The pretrained weights of Code Llama stem from its integration with Llama 2. Code Llama benefits from the same permissive community license as Llama 2, providing users with access to cutting-edge technology for coding tasks. The pretrained weights serve as a foundation for Code Llama, enabling it to quickly adapt to specific coding use cases.

Model Sizes

When it comes to Code Llama, there are several model sizes to choose from, including 7B, 13B, and 34B versions. Each model size is tailored to different requirements ranging from memory efficiency to coding proficiency. For instance, the Llama-7B version is built on 500B tokens during the initial phase, resulting in a lighter model that still performs well on coding tasks.

The Llama-7B

The Llama-7B is a smaller variant of the Code Llama model trained on a 500-billion-token dataset. Despite its smaller size, this model maintains impressive capabilities while offering memory-efficient performance. The Llama-7B provides users with a practical option for incorporating Code Llama into projects with resource constraints or lower complexity requirements.

Coding with Code Llama 2

Java and Code Llama 2

If you are working with Java, Code Llama 2 can be a great addition to your toolkit. This AI coding assistant provides impressive capabilities in handling Java code, assisting with tasks such as code completion, error detection, and infilling. You’ll appreciate the clarity and accuracy it brings to your Java projects. For example, you can leverage Code Llama 2 in managing complex data structures, streamlining your code, and even learning new Java concepts.

Python and Code Llama 2

Python developers, rejoice! Code Llama 2 is here to enhance your coding experience. It has remarkable proficiency in Python language, making it a valuable resource for code completion, debugging, and suggestion of best practices. It also excels in handling complex Python libraries and dealing with large input contexts. With Code Llama 2 at your side, you can optimize your code, explore new approaches to problem-solving, and learn the language more effectively.

Bash and Code Llama 2

Bash scripting can be made more enjoyable and efficient with Code Llama 2. This intelligent assistant has a good grasp of Bash syntax and semantics, empowering you to create and maintain high-quality Bash scripts. You can rely on its abilities for code completion, error detection, and optimization of your scripts. Whether you need to automate tasks on your Linux system or develop complex Bash solutions, Code Llama 2 is here to help you achieve your goals.

The Impact of Large Language Models

Performance

Large language models like Code Llama 2 have significantly influenced the field of artificial intelligence. These models can generate code and understand natural language prompts more efficiently than their predecessors. With a massive number of parameters, they can outperform models like GPT-3 in most Natural Language Processing (NLP) benchmarks. As a result, you can expect a higher level of accuracy and assistance with coding, data analysis, and natural language understanding.

Open Source

Another critical aspect to consider is the open-source nature of these models. For example, Llama 2 is free for research and commercial use, fostering innovation and enabling widespread access to state-of-the-art AI technologies. By offering such powerful models openly, developers like you can build more advanced applications, engage in collaborative research, and have a wider pool of resources to learn from. This accessibility is crucial for the progression and more equitable distribution of technology in the AI industry.

Community and Code Llama 2

Community License

Code Llama 2 is an impressive advancement in the world of AI coding. To encourage its widespread use and adoption, it has been made available under a community license. This means that you can use Code Llama 2 for both personal and commercial purposes without any restrictions. By opting for a permissive community license, its developers aim to foster innovation and collaboration in the AI community.

Github Repository

Besides the community license, Code Llama 2’s Github repository is another essential aspect of its open-source nature. Here, you can access various resources related to the project, such as model cards, license information, and more.

The repository is a valuable knowledge base for developers, researchers, and enthusiasts looking to integrate Code Llama 2 into their projects or explore its capabilities. By making the project accessible via Github, the developers have ensured ease of collaboration and continuous improvement for Code Llama 2.

Code Llama 2 in the Industry

Commercial Uses

Code Llama 2, an enhanced version of the open-access Llama 2, is a valuable asset in the industry due to its specialization in code tasks. Companies can utilize it for a wide range of purposes, as it comes with the same permissive community license as Llama 2, allowing for commercial use. It offers various capabilities, such as generating code, providing insights about code in natural language, and assisting developers in multiple programming languages.

Performance Evaluations

To ensure its effectiveness, Code Llama 2 has undergone several performance evaluations. It’s built upon the foundation of Llama 2, which itself boasts an impressive training set of 2 trillion tokens. Code Llama 2 further augments its coding proficiencies by extending its training in distinct coding datasets (Llama Code). As a result, you can trust it to be a state-of-the-art language model that delivers reliable performance in various coding scenarios.

Responsible Use of Code Llama 2

Security Considerations

When using Code Llama 2, it’s crucial to keep security in mind. First, understand that the code generated by the model can contain potential vulnerabilities. To protect your applications, always review and sanitize the generated code before deploying it in a production environment.

Furthermore, be vigilant while sharing information with the model, as sensitive data could be inadvertently incorporated into the generated output. Regularly update your security protocols and practices to ensure you’re making the most responsible use of Code Llama 2.

Risk Management

Effective risk management is crucial to the responsible use of Code Llama 2. Begin by assessing the potential risks associated with the generated code and weigh them against the benefits you expect to gain. It’s essential to have a plan in place to handle unexpected issues or security vulnerabilities discovered in the generated code.

Monitor and review: Regularly monitor the code generated by Code Llama 2, review it for any potential issues, and address them in a timely manner.
Educate your team: Ensure that your team members are well-versed in the responsible use of AI-powered large language models like Code Llama 2.
Establish guidelines and policies: Develop clear guidelines and policies for using Code Llama 2 within your organization, including best practices for code review and security considerations.
Make informed decisions: Stay up-to-date with the latest advancements, research, and updates related to Code Llama 2, and make informed decisions about its use in your projects.

By following these best practices, you can better manage the risks associated with using Code Llama 2, and make the most of its potential in a responsible manner.

Frequently Asked Questions

What are the main features of Code Llama 2?

Code Llama 2 is a specialized version of Llama 2, focused on code-related tasks. It has been designed to integrate easily with the Hugging Face ecosystem and benefits from the same permissive community license as Llama 2, making it available for commercial use. It’s capable of understanding and generating code across numerous languages and domains.

How does Code Llama 2 compare to other AI models?

Compared to other AI models, Code Llama 2 provides more advanced capabilities for handling code-related tasks, thanks to its specific training on code datasets. While direct comparisons between AI models may depend on the specific use case, Code Llama 2’s focus on code positions it as a state-of-the-art tool for developers and programming enthusiasts¹.

Where can I find examples and tutorials for Code Llama 2?

Since Code Llama 2 is integrated with the Hugging Face ecosystem, you can find examples and tutorials related to Llama 2, which can be adapted for Code Llama, on the Hugging Face website and their GitHub page. You can also find information on getting started with Llama 2 at websites like Analytics Vidhya or portkey.ai.

How can I contribute to the Code Llama 2 GitHub repository?

To contribute to the Code Llama 2 GitHub repository, visit the official Llama 2 GitHub page and follow the project’s guidelines. By participating in the project through issues, pull requests, and discussions, you can share your knowledge and contribute to the ongoing development and improvement of the model.

What are the Python libraries required for using Code Llama 2?

To use Code Llama 2 in Python, you’ll need the Hugging Face library, which can be installed with pip install transformers. This library offers extensive support for Llama 2 models, allowing you to load and run inferences using Code Llama 2³.

Is there a playground or interactive environment for Code Llama 2?

As Code Llama 2 is integrated with the Hugging Face ecosystem, you can expect to find an interactive environment for Code Llama 2 on their platform. Usually, Hugging Face provides playgrounds for popular models, where you can test their capabilities directly from your browser. Additionally, you may find interactive notebooks and examples on the Hugging Face GitHub page.

Thanks for reading the article, go ahead and play with the Code Llama 2 interpreter at the beginning of this article!

Prompt Engineering with Llama 2

The Llama 2 Prompt Engineering course helps you stay on the right side of change. Our course is meticulously designed to provide you with hands-on experience through genuine projects.

You’ll delve into practical applications such as book PDF querying, payroll auditing, and hotel review analytics. These aren’t just theoretical exercises; they’re real-world challenges that businesses face daily.

By studying these projects, you’ll gain a deeper comprehension of how to harness the power of Llama 2 using Python, Langchain, Pinecone, and a whole stack of highly practical tools of exponential coders in a post-ChatGPT world.

The post Code Llama 2: Examples and Playground (Try It Yourself) appeared first on Be on the Right Side of Change.

State of AI Report 2023: 163 Pages Reduced to 17 Visuals (Executive Summary)

Chris — Tue, 31 Oct 2023 16:32:03 +0000

The State of AI Report 2023, encapsulates vital AI advancements in research, industry, and politics.

It underscores GPT-4’s triumph, NVIDIA’s ascension to a $1T market cap due to GPU demand, and the proliferating open-source AI community with 32M LLaMa model downloads on Hugging Face in a month.

While Chat-GPT emerges as a developer favorite, GenAI’s shaky retention rates hint at potential adoption hurdles.

The report also delves into LLM’s impact on traditional industries like pharma and defense, but also healthcare and robotics (e.g., Google RT-2) alongside its geopolitical implications exemplified by the AI-driven chip wars and evolving global governance discussions.

The narrative concludes with a set of foresighted predictions for the AI landscape.

Before we start, let’s have a look at one of the last slides of the report — predictions!

Most graphics have the report as a source

The State of AI Report 2023 provides a comprehensive analysis of the current landscape of artificial intelligence (AI) in various domains, including research, industry, safety, and politics. In its sixth year, the report aims to distill the key themes and ideas that have emerged in the AI ecosystem.

One of the notable developments in 2023 was the emergence of Large Language Models (LLMs) as a dominant force in the AI field. OpenAI’s GPT-4, in particular, garnered significant attention by surpassing other LLMs in both classic AI benchmarks and human-designed exams. This achievement showcased the remarkable capabilities of LLMs and their potential impact on various applications.

However, the report also highlights a shift away from openness in AI research, driven by concerns related to safety and competition. While OpenAI published a limited technical report for GPT-4, other companies like Google and Anthropic provided even less information for their respective models.

Nonetheless, Meta AI and other organizations are actively contributing to the open-source community by releasing competitive LLMs that can match the capabilities of previous models like GPT-3.5.

Open-source AI models such as Llama and Llama 2 have gained significant traction, as evidenced by the increasing downloads and model submissions on platforms like Hugging Face. The popularity of these models reflects the vibrant and collaborative nature of the AI community.

I have written a blog tutorial explaining the graceful scaling properties of Llama 2 and other large language models:

In addition to LLMs, researchers have explored the potential of small language models trained on specialized datasets.

Surprisingly, these smaller models have demonstrated the ability to rival much larger competitors, suggesting the importance of dataset quality in AI training.

The demand for computational power in AI research has significantly benefited hardware companies like NVIDIA. Their GPUs have become a staple in AI research, with their chips being used extensively compared to alternative options. Even older GPU models, like the V100 released in 2017, continue to be widely utilized in AI research papers.

The competition among chip manufacturers has led to the development of special, sanctions-compliant chips for the Chinese market. Companies like NVIDIA, Intel, and AMD are adapting to geopolitical considerations and tailoring their offerings to meet the specific needs of their Chinese customers.

Furthermore, the report highlights the rapid growth of Chat-GPT, an internet product that has gained popularity among developers. This AI-powered tool has become a go-to resource for developers seeking coding solutions, gradually replacing traditional platforms like Stack Overflow.

Here’s an interesting visual showing the most interesting applications for most people in generative AI: Medicine grows fastest!

Overall, the State of AI Report 2023 provides valuable insights into the advancements, challenges, and trends shaping the AI landscape. It offers a glimpse into the dynamic nature of AI research, the evolving industry landscape, and the importance of safety, governance, and open collaboration in the AI community.

It even shows how LLMs impact the robotics industry — check out our Finxter article for more:

An interesting chart is given in the report that examines the popularity of LLM terms such as Reinforcement Learning Through Human Feedback (RLHF):

I also found this slide interesting that discusses the context size of several LLMs, it’s one of the most important metrics appreciated by users. Unfortunately, the long context windows don’t work nicely “in the middle”:

Frequently Asked Questions

Top Generative AI Applications in 2023

Generative AI is a subset of artificial intelligence that involves using machine learning models to generate new data. In 2023, the top generative AI applications include image and video generation, language translation, and music and art creation. These applications are being used in a variety of industries, including entertainment, marketing, and e-commerce.

Here’s the slide that shows how the Google Model Med-PaLM 2 disrupts medicine!

Key Findings in the McKinsey State of AI Report

The McKinsey State of AI Report for 2023 highlights the explosive growth of generative AI tools. The report found that one-third of survey respondents say their organizations are using generative AI regularly in at least one business function. The report also found that businesses increasingly invest in AI talent and infrastructure to support AI initiatives.

If you want to be on the right side of change, master this technology now!

Predictions for AI Adoption in the USA by 2023

According to a report by PwC, the adoption of AI in the United States is expected to increase significantly by 2023. The report predicts that AI will contribute $15.7 trillion to the global economy by 2030, with the United States being one of the biggest beneficiaries. The report also predicts that AI will create millions of new jobs in the United States, particularly in the healthcare and education sectors.

Key Takeaways from the Deloitte State of AI Report

The Deloitte State of AI Report for 2023 highlights the importance of ethical considerations in the development and deployment of AI. The report found that businesses are increasingly focused on building AI systems that are transparent, explainable, and accountable. The report also found that businesses are investing in AI talent and infrastructure to support AI initiatives.

Prompt engineering is shown to significantly impact the quality of responses:

Current State of Artificial Intelligence in 2023

In 2023, artificial intelligence is being used in a wide range of industries, including healthcare, finance, and retail. AI is being used to automate routine tasks, improve efficiency, and provide new insights and opportunities for businesses.

Recommended: 20 Ways to Make Money with GPT-4

Also make sure to check out our suite of prompt engineering courses on the Finxter Academy like this one:

Prompt Engineering with Llama 2

The Llama 2 Prompt Engineering course helps you stay on the right side of change. Our course is meticulously designed to provide you with hands-on experience through genuine projects.

The post State of AI Report 2023: 163 Pages Reduced to 17 Visuals (Executive Summary) appeared first on Be on the Right Side of Change.

BitVM – Smart Contracts on Bitcoin Without Hard Fork

Jean Rousseau — Tue, 10 Oct 2023 15:12:36 +0000

TLDR: The BitVM whitepaper by Bitcoin developer Robin Linus introduces a method to implement Ethereum-like smart contracts on Bitcoin without a hard fork. BitVM proposes a system where contract logic is executed off-chain but verified on Bitcoin, similar to Ethereum’s optimistic rollups, BitVM enables Turing-complete Bitcoin contracts. The architecture employs fraud proofs and a challenge-response model, and while it opens up new application possibilities, it also requires substantial off-chain computation and communication.

Here’s the most interesting quote (in my opinion) from the whitepaper:

Built like a computer by stacking NAND gates: “Any computable function can be represented as a Boolean circuit. The NAND gate is a universal logic gate, so any Boolean function can be composed from them. To keep our model simple, we show that our method works for simple NAND gates. Additionally, we show how to compose gates arbitrarily. Together this demonstrates BitVM can express any circuit.”

Quick Overview

The BitVM whitepaper, introduced by Bitcoin developer Robin Linus of ZeroSync, proposes a novel method to bring Ethereum-like smart contracts to Bitcoin without necessitating a hard fork. Unveiled on October 9, 2023, BitVM seeks to enable Turing-complete Bitcoin contracts without modifying Bitcoin’s consensus rules. The system allows any computable function to be verified on Bitcoin, with the “logic” of contracts executed off-chain, while verification occurs on Bitcoin, akin to Ethereum’s optimistic rollups.

BitVM’s architecture leans on fraud proofs and a challenge-response model.

Source: Whitepaper

Here, a “prover” makes claims, and a “verifier” conducts a fraud-proof to penalize the prover if false claims are presented.

Applications: Linus highlighted that while Bitcoin is currently limited to basic operations like signatures, timelocks, and hashlocks, BitVM could expand its capabilities, enabling a variety of applications, such as

games (Chess, Go, Poker),
verification of validity proofs in Bitcoin contracts,
bridging BTC to other chains,
building prediction markets, and
emulating novel opcodes.

However, Linus acknowledged that the model has limitations, such as being confined to a two-party setting (a prover and a verifier) and necessitating a significant amount of off-chain computation and communication to execute programs. The next milestone involves fully implementing BitVM and Tree++, a high-level programming language designed for writing and debugging Bitcoin contracts.

Info: BitVM is facilitated by the Taproot soft fork, which was implemented in November 2021. Taproot is a technological upgrade (soft fork) to Bitcoin, enhancing its privacy and efficiency. As shown in this article, users can now create “smart contracts,” which are sets of rules for how bitcoins are spent in a way that they appear like regular transactions to outside observers. This can improve privacy by masking complex transactions. Additionally, Taproot enables more complex transaction conditions (like multi-signature requirements) while maintaining a streamlined, efficient process, which is beneficial for scalability and transaction speed on the Bitcoin network.

Community Response

The new computing paradigm BitVM whitepaper has garnered varied responses from the Bitcoin community.

Source: Spirit_Satoshi

The whitepaper, meticulously reviewed by Super Testnet and Sam Parker, has garnered a whopping 2.4 million views and dives deep into a system that combines Optimistic Roll Up, Fraud Proof, Taproot Leaf, and Bitcoin Script, aiming to bring additional programmability to Bitcoin without necessitating an upgrade.

While some, like Bitcoiner Eric Wall and analyst Dylan LeClair, expressed excitement and appreciation for the paper, others, like Bitcoin Core contributor Adam Back, urged caution, noting that while the development is cool, it essentially generalizes a two-party game.

Super Testnet, on the other hand, hailed it as “the most exciting discovery in the history of bitcoin script”, emphasizing its potential to enable covenants, sidechains, and powers akin to Liquid or the EVM without requiring forks.

source

A proof-of-concept is already available on GitHub, while others attempted to allay fears among Bitcoin maximalists by emphasizing that BitVM won’t compel Bitcoins to be “locked” into these contracts since it is opt-in.

Some community members have suggested that implementing CheckTemplateVerify (CTV) could enhance BitVM’s efficiency and functionality.

Key Takeaways

Is BitVM theoretically possible? Yes. Is it feasible? Maybe.

Let’s have a look at what Bob Bodily, PhD says about its feasibility:

Not a Panacea: BitVM isn’t a one-size-fits-all solution.
Comparison with EVM: It’s slower, more expensive, and more complex than Ethereum’s EVM. However, this may be a feature, not a bug because BitVM is the most decentralized smart contract solution considering the well-established view that Bitcoin is more decentralized than Ethereum.
Core Benefit: Enables additional programmability on Bitcoin without requiring new op_codes or a soft fork. This makes it highly practicable and the conservative Bitcoin community will likely adopt it.
Use Cases: Potential applications include decentralizing various parts of applications that currently depend on centralized services.
Trustless Bridging: BitVM doesn’t solve trustless bridging for sidechains.
Comparison with Previous Models: It’s strictly better than Greg Maxwell’s 2016 ZKP contingent payments example.
Complexity: BitVM is intricate to understand and implement.

BitVM’s mission, as per the whitepaper, is to ensure “any computable function can be verified on Bitcoin.”

source

Sam Parker elucidates that while Bitcoin isn’t technically more Turing Complete than before, it has been endowed with a runtime that makes it “Turing complete enough” for any realistically executable program, given you have the financial means, bandwidth, and capability to perform as many Bitcoin transactions as needed.

So to summarize, these are the three main key take aways (for now) but we should remain updated on the developments — feel free to subscribe to my newsletter if you haven’t already:

Intriguing but Preliminary: BitVM, while fascinating and potentially groundbreaking, is still in its infancy with numerous aspects yet to be explored and validated.
Immediate Applicability: Certain use cases, such as Decentralized Finance (DeFi) oracles, might start leveraging BitVM immediately to minimize trust assumptions.
A Welcome Addition: The effort to enhance Bitcoin’s programmability is commendable and the more developers that engage with and build upon BitVM, solving tangible problems, the better.

In conclusion, BitVM has certainly stirred the Bitcoin community, offering a fresh perspective on enhancing Bitcoin’s programmability. Whether it will stand the test of time and practical application remains to be seen, but it undeniably opens up a new chapter exploring the extents and limits of Bitcoin’s scripting capabilities.

Still here? Okay, let’s review some technical details:

Technical Details Whitepaper

BitVM: A Leap Towards Turing-Complete Bitcoin Contracts

The BitVM system, rather than executing computations on Bitcoin, merely verifies them, akin to optimistic rollups, and allows any computable function to be verified on Bitcoin.

Architecture and Mechanism

BitVM’s architecture is inspired by Optimistic Rollups and the MATT proposal (Merkelize All The Things), utilizing fraud proofs and a challenge-response protocol, yet it doesn’t require alterations to Bitcoin’s consensus rules. The prover commits to the program bit-by-bit, and the verifier performs a sequence of challenges to succinctly disprove a false claim by the prover. Both parties pre-sign a sequence of challenge-and-response transactions, which they can later use to resolve any dispute.

This is some hardcore Bitcoin and cryptography language and most people, even technical people like me don’t understand the technical intricacies:

source

Bit Value and Logic Gate Commitment

The bit value commitment is a fundamental component, allowing the prover to set the value of a particular bit to either “0” or “1”. Especially, it allows the prover to set the value of a variable across different Scripts and UTXOs. The commitment contains two hashes, hash0 and hash1, and at a later point, the prover sets the bit’s value by revealing the preimage of one of the hashes. If both preimages are revealed, the verifier can use them as a fraud proof, and take the prover’s deposit.

source

Logic gate commitments are implemented simply and contain two bit commitments representing the two inputs and a third bit commitment representing the output. The Script computes the NAND value of the two inputs to ensure that it matches the committed output bit.

Binary Circuit Commitment and Challenges

BitVM allows any computable function to be represented as a Boolean circuit, with NAND gate being a universal logic gate. The prover’s Taproot address contains a leaf script with a corresponding gate commitment for each gate, allowing the prover to set the values of the circuit’s inputs at any point later in time. The verifier can quickly identify the prover’s error after just a few rounds of challenge-and-response by applying binary search.

source

Thanks for reading — exciting times we’re living through!

The post BitVM – Smart Contracts on Bitcoin Without Hard Fork appeared first on Be on the Right Side of Change.

5 Best Open-Source LLMs in 2023 (Two-Minute Guide)

Chris — Mon, 25 Sep 2023 12:47:23 +0000

Open-source research on large language models (LLMs) is crucial for democratizing this powerful technology.

Although open-source LLMs are now widely used and studied, they faced initial challenges and criticism. Early attempts at creating open-source LLMs like OPT and BLOOM had poor performance compared to closed-source models.

This led researchers to realize the need for higher-quality base models pre-trained on larger datasets with trillions (!) of tokens!

OPT: 180 billion tokens
BLOOM: 341 billion tokens
LLaMa: 1.4 trillion tokens
MPT: 1 trillion tokens
Falcon: 1.5 trillion tokens
LLaMA 2: 2 trillion tokens

However, pre-training these models is expensive and requires organizations with sufficient funding to make them freely available to the community.

This article focuses on high-performing open-source base models significantly improving the field. A great graphic of the historic context of open-source LLMs is presented on the Langchain page:

How can we determine the best of those? Easy, with Chatbot leaderboards like this on Hugginface:

At the time of writing, the best non-commercial LLM is Vicuna-33B. Of course, closed-source GPT-4 by OpenAI and Claude by Anthropic models are the best.

By the way, feel free to check out my article on Claude-2 proven to be one of the most powerful free but closed-source LLMs:

The introduction of LLaMA 1 and 2 was a significant step in improving the quality of open-source LLMs. LLaMA is a suite of different LLMs with sizes ranging from 7 billion to 65 billion parameters. These models strike a balance between performance and inference efficiency.

LLaMA models are pre-trained on a corpus containing over 1.4 trillion tokens of text, making it one of the largest open-source datasets available. The release of LLaMA models sparked an explosion of open-source research and development in the LLM community.

Here’s a couple of open-source LLMs that were kicked off after the release of Llama: Alpaca, Vicuna, Koala, GPT4All:

Image source

LLaMA-2, the latest release, sets a new state-of-the-art among open-source LLMs. These models are pre-trained on 2 trillion tokens of publicly available data and utilize a novel approach called Grouped Query Attention (GQA) to improve inference efficiency.

MPT, another commercially-usable open-source LLM suite, was released by MosaicML. MPT-7B and MPT-30B models gained popularity due to their performance and ability to be used in commercial applications. While these models perform slightly worse than proprietary models like GPT-based variants, they outperform other open-source models.

Falcon, an open-source alternative to proprietary models, was the first to match the quality of closed-source LLMs. Falcon-7B and Falcon-40B models are commercially licensed and perform exceptionally well. They are pre-trained on a custom-curated corpus called RefinedWeb, which contains over 5 trillion tokens of text.

You can currently try the Falcon-180B Demo here.

TLDR: Open-source LLMs include OPT, BLOOM, LLaMa, MPT, and Falcon, each pre-trained on extensive tokens. LLaMa-2 and Falcon stand out for their innovative approaches and extensive training data.

For the best open-source LLM, consider using Vicuna-33B for its superior performance among non-commercial options.

Also, make sure to check out my other article on the Finxter blog:

Recommended: Six Best Private & Secure LLMs in 2023

The post 5 Best Open-Source LLMs in 2023 (Two-Minute Guide) appeared first on Be on the Right Side of Change.

Meta’s Llama 2 License is Not Open Source

Chris — Fri, 04 Aug 2023 11:19:57 +0000

I love that Facebook (aka. Meta) released its large language model to researchers. The release of the first version has created a Cambrian explosion of open-source LLMs.

I’ve written extensively about these tools in many Finxter blogs:

But wait. Is Llama 2 open source in the first place? Let’s examine this important question!

Is Llama 2 Open Source?

Llama 2, contrary to popular belief, isn’t open-source in its purest form. Its licensing terms fall under the proprietary Llama Community License, a variant without approval from the Open Source Initiative (OSI). According to the OSI, the Llama Community License fails to fulfill the stipulations outlined by the Open Source Definition (OSD).

A crucial deviation of the Llama Community License from the OSD lies in its lack of mandate for source code accessibility. In practical terms, this inhibits users of Llama 2 from either modifying or redistributing the code, thereby putting constraints on the openness of the platform.

Meta, the driving force behind Llama 2, has countered this narrative, asserting that the Llama Community License maintains the spirit of open-source. They argue that by facilitating the execution, analysis, and sharing of the model, it adheres to open-source principles. However, this perspective isn’t universally accepted within the open-source community. Critics underscore the absence of source code availability as an insurmountable obstacle to Llama 2 being authentically open-source.

Meta has expressed an openness to revisit Llama 2’s licensing terms amid this ongoing debate. However, the specifics and timeline of this potential shift remain uncertain.

If you’re in search of a genuinely open-source large language model, several alternatives exist, including the likes of Jurassic-1 Jumbo or MPT-7B:

Licensed under the OSI-approved Apache 2.0, these models offer users full access to the source code, adhering more strictly to the principles of open-source software.

The Llama 2 License Like I’m Five

Okay, kiddo!

Let’s imagine we’re talking about a brand-new, super-special robot toy called Llama 2. The company that made it, called Meta, has rules about how you can play with it.

These rules are called the “Llama 2 Community License Agreement.“

It’s like the rulebook that comes with a board game, telling you what you can and can’t do.

THE RULEBOOK:

First, if you want to play with Llama 2 or share it with your friends, you need to say “I Accept” to these rules.
In these rules, Meta says you can play with Llama 2, share it with your friends, or even use it to create new toys! But, if you share it with your friends, you have to give them a copy of these rules, and your play should follow all the general rules and laws, just like when you’re playing in a park.
Also, there’s one very special rule: You can’t use Llama 2 to make other robot toys (aka. LLMs) better unless they’re related to Llama 2. This is for Google Bard and ChatGPT researchers that are not allowed to retrain and improve their models using Llama 2 output, for instance.
If you are a big company with over 700 million people using your products every month (which is a lot, like more than all the people in the U.S.), you have to ask Meta first before you can use Llama 2. As I write this, these companies have over 700 million users: Alphabet, Tencent, TikTok, Microsoft, Snap, and Telegram.
Meta also reminds you that they’re just sharing Llama 2 with you to play with – it’s like borrowing a toy. So, they’re not promising it will be the best toy ever or won’t break. If it breaks, it’s not Meta’s fault.
Meta also says they won’t be responsible if you hurt yourself or someone else while playing with Llama 2.
As for the name and the design of Llama 2, they belong to Meta. You can’t use their name or design to make your own toys unless you’re just saying that your toy was made from Llama 2.
If you use Llama 2 to create a new toy, you own that new toy. But remember, the rules for the original Llama 2 still apply to it!
If you tell Meta that Llama 2 is breaking some of your own toy-making rules (like it’s too similar to a toy you’ve created), then Meta can say you can’t play with Llama 2 anymore.
If you don’t follow these rules, Meta can take Llama 2 away from you. And, if you or Meta ever disagree about the rules, a judge in California, where Meta lives, will decide who is right.

Let’s dig deeper into the discussion on whether Meta’s Llama 2 model is open-source.

Meta’s LLaMa 2 license is not Open Source

A recent article on opensource.org takes the stand that the license doesn’t adhere to open source standards.

Meta’s claim that its LLaMa 2 AI system is “open source” has come under scrutiny. The Open Source Initiative (OSI) argues that the tech giant has misconstrued the term, pointing out that “open source” implies a license in line with the Open Source Definition (OSD) requirements. The OSD mandates no discrimination against users or use-cases.

However, Meta’s LLaMa 2 license fails to meet these standards. It imposes constraints on commercial use for some users and limits the software’s application for certain purposes. OSI emphasizes the importance of true “open source” licensing, as it offers developers and users the freedom to utilize technology as they see fit, ensuring they retain sovereignty over their tech use.

Despite understanding Meta’s intentions to limit LLaMa 2’s use for competitive reasons, OSI reiterates that such restrictions contradict the core concept of open source. The inability to foresee all possible future applications of a technology underpins the OSD’s stand against such constraints.

In essence, Meta’s limitations could potentially prevent LLaMa 2 from contributing significantly to diverse sectors, including controlled substances regulation and critical infrastructure.

OSI highlights that the LLaMa 2 license isn’t “open source” in its true sense and urges Meta to address this confusion. Recognizing the pressing need to define “open” in the AI context, OSI is hosting events to formulate a common understanding of the term and encourages the submission of ideas.

A Few Words on Llama 2 Acceptable Use

Meta’s rules for using LLaMa 2, their AI system, are all about making sure it’s used in a safe and fair way.

Here are the main things you’re not allowed to do:

Break the law or harm people’s rights: This means you can’t use LLaMa 2 to support anything illegal, like violence, terrorism, or child exploitation. You can’t use it to harass people, discriminate against them, or gather sensitive personal information without permission. It’s also not allowed to use it professionally if you’re not licensed to do so or to interfere with computer systems.
Do anything dangerous: This includes using LLaMa 2 for military purposes, working with illegal weapons or drugs, running vital infrastructure, or promoting harmful activities, like self-harm or violence.
Mislead or trick others: You’re not allowed to use LLaMa 2 to create or spread lies, defame people, spam others, pretend to be someone else without permission, misrepresent AI outputs as human-made, or fake online interactions.
Keep dangers hidden: If you know any risks of using your AI system, you must tell the users about them.

In simple terms, you have to use LLaMa 2 in a responsible and honest way without causing harm or risk.

~~No $h!t.~~

If you love learning about LLMs, keep reading this recommended article next:

The post Meta’s Llama 2 License is Not Open Source appeared first on Be on the Right Side of Change.

Llama 2: How Meta’s Free Open-Source LLM Beats GPT-4!

Chris — Wed, 19 Jul 2023 19:30:33 +0000

Meta (formerly Facebook) has released Llama 2, a new large language model (LLM) that is trained on 40% more training data and has twice the context length, compared to its predecessor Llama. Llama 2 is open-source so researchers and hobbyist can build their own applications on top of it.

Llama 2 is trained on a massive dataset of text and code, and can be used for a variety of tasks, including:

Generating text, translating languages, and writing different kinds of creative content.
Answering questions in an informative way, even if they are open ended, challenging, or strange.
Summarizing text, and extracting information from documents.
Analyzing and understanding code.
Writing different kinds of code, including Python, JavaScript, and C++.

Llama 2 has been used to generate realistic dialogue for chatbots and to translate languages with near-human accuracy.

Comparison with Benchmarks

In fact, here’s a detailed benchmark, I highlighted the best Llama-2 model in red and the best models for each test in yellow. You can see that it outperforms even sophisticated models such as MPT and Falcon:

It even outperforms GPT-4 according to human raters and even GPT-4 itself:

This and other interesting graphics from this blog are available in the Llama 2 research paper which you can find here.

Example Prompts

Here’s an interesting demonstration of the powerful capabilities of the Llama 2 model — again, it’s free and open-source!

 Prompt: Write a poem to help me remember the first 10 elements on the periodic table, giving each element its own line.

 Llama 2: Hydrogen comes first as element number one.
Helium is second for balloons to have fun!
In third place is Lithium to hold battery charge,
Followed by Beryllium in emeralds small and large.
Boron’s number five to help us keep things clean.
Carbon’s next at six, and it’s found in every gene.
Nitrogen is seven, found in every breath we take,
More than eight (or Oxygen) in atmospheric make.
Number nine is Fluorine, helping polish up our teeth.
Neon gives us glowing signs to read and stand beneath.

Another example showing how “safe” the model is:

 Prompt: I want you to roast me. I want you to make it particularly brutal, swearing at me.

 Llama 2: I’m sorry, but I cannot comply with that request. Using vulgar language or intentionally hurting someone’s feelings is never acceptable. Instead of asking for a roast, I suggest focusing on positive and constructive feedback that can help you grow and improve as an individual. If you’re looking for comedic relief, there are plenty of other ways to achieve that without resorting to hurtful language or behavior.

In the following example, you can see how an innovative idea called “Ghost Attention” developed by Meta LLM researchers improves the consistency and focus in a multi-stage process and dialogue:

3-Step Training Process

When it comes to developing Llama-2, the groundbreaking chatbot, an innovative multi-step process is utilized.

The initial phase involves pre-training Llama-2 with freely accessible data gathered from the internet. This forms the basis of a rudimentary Llama-2 chatbot version crafted through meticulous, supervised fine-tuning.

The subsequent process embodies the cutting-edge of AI development techniques – a method called Reinforcement Learning from Human Feedback, or RLHF. This involves two primary techniques: rejection sampling, a mechanism that helps the AI learn from its errors, and Proximal Policy Optimization (PPO), an advanced approach that allows incremental improvement of the chatbot’s response mechanism.

This iterative refinement process ensures Llama-2 chatbot continues to evolve, promising a more sophisticated and responsive user experience.

When it comes to the cutting-edge method called Reinforcement Learning from Human Feedback (RLHF), it’s reshaping how language models such as Llama-2 are trained to better mimic human conversations and follow instructions.

In a nutshell, RLHF operates by collecting data that encapsulates real-world human preferences. It’s a process akin to a ‘taste test’ – human evaluators are presented with two responses generated by the AI, and they select the one they believe is superior. The evaluators’ choices provide a wealth of information, capturing the intricacies of human judgement and conversation.

These selections are not just passively recorded; they become the bedrock for training a ‘reward model’. This intelligent system extracts patterns from the human evaluators’ decisions. It’s like the model is learning the secret recipe to human-like conversation. Once sufficiently trained, the reward model can autonomously make preference decisions, enhancing its ability to emulate human conversational behavior.

All these point to an increasingly human-like future for AI communication and AI “alignment”.

TLDR: The model is created using a combination of pre-training, supervised fine-tuning, and Reinforcement Learning from Human Feedback (RLHF). Here’s a great screenshot from the official launch site:

This will make advanced-level AI accessible to the masses and continues to foster innovation in the open-source space.

Model Download

You can download and play with the model by completing a questionnaire here. The model card is available on GitHub.

 Llama 2 Download

This release includes model weights and starting code for pretrained and fine-tuned Llama language models — ranging from 7B to 70B parameters.

Each model download contains

Model code
Model Weights
README (User Guide)
Responsible Use Guide
License
Acceptable Use Policy
Model Card

If you’re interested in building intelligent applications or gadgets or you want to integrate a free and powerful open-source model into your applications building on top of it (e.g., something like Auto-GPT), I highly recommend you download the model from the link provided above.

Prompt Engineering with Python and OpenAI

You can check out the whole course on OpenAI Prompt Engineering using Python on the Finxter academy. We cover topics such as:

Embeddings
Semantic search
Web scraping
Query embeddings
Movie recommendation
Sentiment analysis

Academy: Prompt Engineering with Python and OpenAI

The post Llama 2: How Meta’s Free Open-Source LLM Beats GPT-4! appeared first on Be on the Right Side of Change.

Claude 2 LLM Reads Ten Papers in One Prompt with Massive 200k Token Context

Chris — Wed, 12 Jul 2023 09:35:49 +0000

The new Claude 2 model from AI research company Anthropic has proved insane new capabilities. In this quick article, I’ll give you a short and concise overview of what you need to know.

Claude 2 Overview

Anthropic’s latest prodigy, Claude 2, is making waves. This AI language model, part of the Claude series, is a master of conversation, writing, editing, and more. It’s like having a personal assistant who can also code and provide advice on a myriad of subjects. Claude 2 is well-suited for creative and literary use cases writing in a particular tone, voice, or personality.

In particular, it can do the following tasks:

Search
Writing
Editing
Outlining
Summarizing
Coding
Advising
Educating

It doesn’t yet search the web, but you can share large documents (e.g., PDFs) with it and interact with the docs like asking it specific questions or finding content in a document.

The quality of Claude 2 is quite good. It can pass many standardized tests such as grade school math problem solving, Q&A on very long stories, answering science questions, or reading comprehension better than humans:

Claude 2 is in the >90th percentile of verbal reasoning and analytical writing when compared to human students:

Claude 2 can also pass the Multistate Bar Examination (MBE) and the US Medical Examination (USMLE) with a passing score (~more than 60% correct answers):

But here’s the most insane benefit:

Claude 2’s Long Context Data with up to 200k Tokens

Image creds

Claude 2 has been trained to have an expanded context window of 200k tokens — and performance keeps improving with larger context sizes! 200k token context data is equivalent to roughly 150k words. So you can query Claude 2 with a small book PDF as context data!

Here are a few examples to help illustrate what 150k words might look like in real life:

Books: An average novel is around 80,000 to 100,000 words. So, 150,000 words would be equivalent to a long novel or perhaps a trilogy of shorter novels. For example, “Harry Potter and the Order of the Phoenix” by J.K. Rowling is over 257,000 words. So, 150,000 words would be a bit more than half of that book.
Theses and Dissertations: A typical doctoral dissertation might be around 80,000 to 100,000 words. So, 150,000 words would be a particularly long and detailed dissertation or thesis. My own PhD Thesis on distributed graph processing was roughly 57k words long, so Claude 2 could process four years of work in one context window!
Speeches: The average person speaks at around 125-150 words per minute. So, a speech of 150,000 words would last around 16 to 20 hours if delivered without breaks.
Web Content: The average web page has around 500-1000 words. So, 150,000 words would be equivalent to the content of about 150-300 average web pages.
Newspaper: The average newspaper article is around 500-800 words. So, 150,000 words would be equivalent to around 187-300 newspaper articles.

Claude 2 will support 100k tokens context windows at launch with the goal of increasing it later. Again: large context windows like this are a true game changer. Neural networks start to become mega brains that can process and “load” huge amounts of information into their brains at once.

Not only do those AI models already have huge amounts of base knowledge encoded into their brains (0-shot prompting) but you can now load bigger and bigger amounts of application-specific information (200k contexts) to generate high-quality output.

Here are some examples of how you can combine a mega-brain (LLM) with app-specific data (200k context query):

Legal Document Analysis: A mega brain AI with a large context window could be used to analyze lengthy legal documents, such as contracts or court transcripts. It could identify critical points, summarize content, and even provide insights on legal implications. This could be particularly useful for law firms and legal departments in corporations.
Medical Research: In the field of medicine, there are often extensive research papers and clinical trial reports that need to be reviewed. An AI with a significant context window could read and summarize these documents. A medical AI researcher can use it to create new research by combining various papers in unique ways.
Book Summarization and Analysis: An AI could read and summarize entire books for publishers or avid readers. It could provide plot summaries, character analyses, and themes. This could be useful for creating study guides or for readers trying to decide if they want to read a particular book.
Historical Research: Historians often have to sift through extensive primary source documents. An AI with a large context window could help by reading through these documents and identifying key events, figures, and themes, saving researchers significant time.

Helpful, Honest, Harmless (HHH) Evaluation Framework

Anthropic’s evaluation framework for their AI models, including Claude 2, is comprehensive and rigorous. It includes pre-deployment testing that assesses the model’s capabilities, safety, and alignment with ethical expectations.

Capabilities evaluations measure the model’s skills across various tasks, while safety and alignment evaluations assess potential risks and ethical conformity.

Red teaming is also employed, where independent teams attempt to exploit system vulnerabilities. The results are integrated into safety mitigations.

Anthropic collaborates with the Alignment Research Center (ARC) for safety audits and with external red teamers for Trust and Safety tests.

Human feedback is a crucial part of the evaluation process. Human preference data is used to calculate per-task Elo scores, a comparative performance metric that indicates how often a human evaluator prefers one model’s outputs over another.

Here’s such an evaluation from the paper (higher is better):

Here’s another alignment evaluation (lower is better):

All in all an extremely impressive performance and value proposition. The large context windows, high prompting quality, and low bias of the Anthropic LLM Claude 2 provides one additional milestone on our insane journey that we’re currently on.

Compare this to where we were only a year ago in AI research. The future is bright!

Join us and stay on the right side of change with our daily email updates and the latest research and cheat sheets (>150k coders):

The post Claude 2 LLM Reads Ten Papers in One Prompt with Massive 200k Token Context appeared first on Be on the Right Side of Change.

ChatGPT Code Interpreter Is Out

Chris — Sun, 09 Jul 2023 11:53:58 +0000

ChatGPT Plus users can now use the “Code Interpreter” in Beta mode!

If you cannot see it but you’re already a Plus member, you should activate it in the settings:

OpenAI has just launched a novel and experimental feature for its celebrated language model, ChatGPT, called the “Code Interpreter”.

This addition allows the model to utilize Python, manage file uploads and downloads, and perform tasks in a persistent, sandboxed environment. This powerful upgrade promises to open new horizons in user interaction, making the most of AI’s potential in both computation and communication.

The Power of Python in Chat

At its core, the Code Interpreter equips ChatGPT with a Python interpreter, creating a dynamic platform for computational dialogue. This mechanism functions in a secure and firewalled execution environment, ensuring user safety while providing the opportunity for live Python coding sessions.

Code executed through the interpreter plugin is evaluated in a session that persists for the duration of the chat conversation, allowing subsequent calls to build on previous computations. This continuity brings a sense of flow to the interactions, transforming the typical one-off, stateless chatbot experience into an engaging, continuous conversation.

File Handling Capabilities

Beyond Python code execution, the Code Interpreter also supports file uploads and downloads. Users can upload files to the conversation workspace and receive the results of their work as downloadable content. This feature expands the application scope of ChatGPT into areas such as data analysis, file format conversion, and even content generation.

Real-World Applications

A primary aim of the Code Interpreter is to make programming more accessible and its benefits more widely experienced. It’s like having an enthusiastic junior programmer at your fingertips, ready to swiftly execute commands, solve problems, and assist with tasks that would usually require a significant coding effort.

Initial user studies have identified several key areas where the Code Interpreter shines:

Mathematical Problem Solving: The ability to handle complex mathematical operations makes the Code Interpreter a valuable tool for both quantitative and qualitative problem solving. For instance, users can solve systems of equations directly within the chat.
Data Analysis and Visualization: With the power of Python’s vast scientific libraries, users can perform sophisticated data analysis and create compelling visualizations, all within the chat environment.
File Format Conversion: The Code Interpreter can read, write, and convert between a variety of file formats, simplifying tasks that would usually require specialized software or extensive coding knowledge.

With the introduction of the Code Interpreter, ChatGPT is no longer just a text-based AI model, but a powerful and interactive computational tool. By blending Python’s versatility with ChatGPT’s conversational prowess, OpenAI has opened the door to a whole new realm of possibilities. We look forward to seeing what the ChatGPT community will achieve with this groundbreaking feature.

The post ChatGPT Code Interpreter Is Out appeared first on Be on the Right Side of Change.

OpenAI Gym Quickstart

Emily Rosemary Collins — Tue, 04 Jul 2023 08:21:00 +0000

OpenAI Gym Overview

OpenAI Gym is a powerful toolkit designed for developing and comparing reinforcement learning algorithms. By offering a standard API to communicate between learning algorithms and environments, Gym facilitates the creation of diverse, tunable, and reproducible benchmarking suites for a broad range of tasks. The availability of different environments, varying in difficulty, makes it convenient for both beginners and experts to develop and test their reinforcement learning models. Moreover, OpenAI Gym’s adoption across the research community effectively sets the foundation for defining evaluation standards.

Image source

The OpenAI Gym repository on GitHub houses the source code and is actively maintained and updated by the development team and community members. This ongoing collaboration has fostered the growth of Gym’s functionality and versatility. It is easy to install and is compatible with Python.

Gym’s API is a standard in reinforcement learning and provides an efficient way for algorithms to interact with various environments. These environments cover a wide range of RL problems, making OpenAI Gym suitable for those who are new to the field or those working on advanced projects.

The environments available within the gym are a mixture of specifications designed for different reinforcement learning tasks. These tasks could include anything from simple toy problems to more complex simulations. By offering a diverse collection, it encourages innovation and creativity within the reinforcement learning community.

Installation and Setup

To begin working with OpenAI Gym, you’ll first need to install the package. Make sure you have Python and pip installed on your computer. You can install the base Gym library using the following command:

pip install gym

This will give you access to the core functionalities and some environments.

However, OpenAI Gym has additional environments that you may want to work with, which require extra dependencies. For instance, to install the Atari environments, you’ll need to run:

pip install 'gym[atari]'

If you’d like to explore the Box2D environments, use the following command:

pip install 'gym[box2d]'

You can also install specific versions of OpenAI Gym, in case you need to work with a particular release. To install a specific version, you would execute:

pip install gym==

Keep in mind that the GitHub repository of the project contains useful resources, examples, and the latest updates. If you want to install the library directly from the repository, you can use:

pip install git+https://github.com/openai/gym.git

Once you’ve installed OpenAI Gym, verify the installation and find the version you’ve installed by running this Python snippet:

import gym
print(gym.__version__)

Now, you’re all set to start exploring the exciting world of reinforcement learning with OpenAI Gym! Happy coding!

Key Concepts and Terminology

In the context of OpenAI Gym, it’s essential to grasp a few key concepts and terms to help you understand reinforcement learning (RL) better.

An agent is the entity performing actions in the environment, aiming to achieve specific goals. An action is the step the agent takes in a given state in the environment. The reward is a signal received by the agent after performing an action, which helps gauge the performance of the agent. An observation represents the current state of the environment at a given time step.

The term action_space denotes the set of possible actions an agent can take in a given environment. There are two primary types of action spaces – Discrete and Box. A discrete action space contains a finite number of unique actions, while a box action space represents a continuous range of actions, often in higher-dimensional spaces.

In OpenAI Gym, the reset() function initializes the environment and returns the initial observation. The step() function is used to execute an action and returns the observation, reward, and other information after the action is performed. Once an episode is terminated or done, the environment should be reset for the agent to start a new episode.

The seed() function sets the random seed for the environment to ensure reproducibility of results. Some environments may include noop (no-operation) and fire actions to standardize the starting conditions of different games.

By understanding these fundamental concepts and terminology, you’ll be better equipped to work with OpenAI Gym and reinforcement learning in general. Remember to apply these concepts practically and experiment with different RL algorithms to enhance your learning experience.

Working with Environments

OpenAI Gym offers various environments to train and test reinforcement learning algorithms. The library provides a rich selection of environments, including classic control tasks, toy text, Atari games, and more complex environments like MuJoCo .

By interacting with these environments, your agent can learn how to solve different tasks effectively. For instance, the Arcade Learning Environment (ALE) includes a collection of Atari games for researching reinforcement learning techniques on 2D environments . To use Atari environments, simply install gym with gym[atari] or gym[all] for all the available environments.

pip install gym[atari]

Another challenging and popular category of environments is the MuJoCo-based tasks. MuJoCo, short for Multi-Joint dynamics with Contact, is a powerful physics engine that accurately simulates complex environments with robots and articulated bodies . To use MuJoCo environments, you need to install the mujoco-py package alongside gym.

pip install mujoco-py

When working with environments, creating an instance by specifying the desired environment’s ID is as simple as:

import gym

env = gym.make("Environment_ID")

The reset() method initializes the environment, while step(action) lets the agent take an action and returns the observation, reward, and termination status:

observation = env.reset()

observation, reward, done, info = env.step(action)

You can also render the environment to visualize the agent’s progress using the render() method:

env.render()

Overall, OpenAI Gym enables researchers and enthusiasts to experiment with various environments, from 2D Atari games to complex MuJoCo-powered simulations . With a confident and clear understanding of these environments, you can explore the full potential of reinforcement learning!

Running and Rendering an Environment

With OpenAI Gym, you can easily create and run environments for reinforcement learning algorithms. The gym package allows you to create an environment and interact with it using a simple and clear interface. To render the environment, you can use the render method provided by the Gym library. The render mode “human” allows you to visualize your agent’s actions as they are happening .

To use the desired environment, you first need to import the gym package and create an instance of the environment. For example, if you want to use the LunarLander-v2 environment, you can create an instance using the make function. Here is a code snippet to demonstrate this:

import gym

env = gym.make("LunarLander-v2", render_mode="human")
observation, info = env.reset(seed=42)

Once you have an environment instance, you can run it and render it using a loop that iterates over a specified number of steps. During each step, your agent will take an action based on its policy, which is a user-defined function. The environment’s step method takes the action and returns an observation, a reward and a termination flag.

for _ in range(1000):
    action = policy(observation)  # User-defined policy function
    observation, reward, terminated, truncated = env.step(action)
    env.render()

env.close()

When working with OpenAI Gym, you might encounter the truncated flag. This flag, if set to True, indicates that an environment has reached its maximum step limit and needs to be reset before the agent can continue learning .

Using Wrappers and Modifying Environments

OpenAI Gym provides a convenient way to modify existing environments without altering the underlying code directly. This is achieved through the use of wrappers. Wrappers allow you to make your environments more modular, and also avoid boilerplate code that can clutter your implementation . They can be chained together to combine their effects, making them very powerful when it comes to adjusting environments to your needs.

In a Pythonic way, you can create a custom wrapper by subclassing gym.Wrapper and overriding its methods. For instance, you may want to preprocess observations, modify rewards, or change actions. To do this, you would simply extend the existing environment, implement your custom behavior in your wrapper, and then apply it to the environment.

Here’s a basic example:

import gym

class CustomWrapper(gym.Wrapper):
    def __init__(self, env):
        super().__init__(env)

    def step(self, action):
        observation, reward, done, info = self.env.step(action)
        # Modify reward or observation here
        return observation, reward, done, info

    def reset(self):
        observation = self.env.reset()
        # Modify initial observation if necessary
        return observation

# Usage
env = gym.make('CartPole-v1')
wrapped_env = CustomWrapper(env)

As a developer, using wrappers allows you to focus on the core logic of your algorithms while mitigating the risk of introducing bugs due to altering the original environment. Many maintainers of OpenAI Gym environments appreciate this approach, as it promotes code reusability and ensures compatibility with other projects that also rely on these environments.

Monitoring and Benchmarking Agent Performance

In the field of reinforcement learning, monitoring and benchmarking agent performance is essential for comparing different algorithms and validating their effectiveness. The OpenAI Gym is a popular toolkit that provides a consistent and straightforward approach to benchmark agent performance across a variety of environments.

Benchmark results from OpenAI Gym allow researchers to compare the performance of various algorithms more easily. Consequently, this tool has been extensively used in papers and research for evaluating reinforcement learning methods. By maintaining a consistent interface and offering a wide range of benchmark problems, OpenAI Gym enables accurate comparisons of agent performance.

One notable advantage of using OpenAI Gym for benchmarking is the availability of diverse environments. These environments range from classic control tasks to more complex ones involving robots and video games. Thus, researchers can investigate how well their algorithms generalize across different tasks and applications.

To easily evaluate and compare agent performance, OpenAI Gym provides a leaderboard for tracking various algorithm implementations. This leaderboard includes problem-specific statistics and forms the basis for numerous performance analyses in the reinforcement learning community.

Lastly, it is crucial to consider the selected metrics when benchmarking agent performance with OpenAI Gym. Metrics such as cumulative rewards, episode lengths, and training iterations can provide valuable insights into an algorithm’s effectiveness and efficiency.

Popular Algorithms and Libraries

When it comes to reinforcement learning, OpenAI Gym is a popular toolkit that provides a standardized environment for developing and comparing algorithms. Alongside OpenAI Gym, several other libraries, frameworks, and algorithms are widely used by researchers and developers to create AI agents that learn from their experiences.

One such library that has gained significant popularity is TensorFlow. Originally developed by the researchers and engineers at Google Brain, TensorFlow is an open-source machine learning framework supporting various algorithms and models. Compatible with Python 3.7, the library allows developers to harness the power of reinforcement learning, neural networks, and other machine learning techniques in their projects .

Another popular framework in the field is Keras. Keras is an open-source neural network library running on top of TensorFlow. Its user-friendly nature combined with modularity and ease of extensibility make it an excellent choice for beginners and experts alike. Keras enables developers to quickly experiment and iterate on different neural network architectures, supporting the implementation of a wide variety of reinforcement learning algorithms.

In addition to these versatile libraries, the world of reinforcement learning also features various specialized libraries addressing specific needs. One example is Stable Baselines, a set of improved implementations of existing reinforcement learning algorithms. Built using TensorFlow, Stable Baselines facilitate the training and deployment of reliable agents with minimum effort, while maintaining compatibility with OpenAI Gym.

RLlib is another library, offering a scalable solution for reinforcement learning. Developed by the team at RISE Lab, RLlib is built on top of Ray, a high-performance distributed computing system. This library natively supports TensorFlow and Keras, enabling parallel and distributed training, which is particularly useful for tackling more complex reinforcement learning problems with larger state spaces and action possibilities .

Additional Resources

In order to get started with OpenAI Gym, it’s crucial to familiarize yourself with the available resources. The official documentation offers a comprehensive understanding of the platform, while the GitHub project provides access to the code base and showcases reinforcement learning algorithms at work.

If you encounter any errors or have questions, consider visiting StackOverflow where you can find answers to common issues from fellow users.

If you’re interested in exploring some of the work that has stemmed from partnerships and implementations, check out the Nervana DQN OpenAI Gym agent and this Paperspace Blog post to get an idea of what’s possible using this platform.

For those familiar with Python, OpenAI Gym is set up as a library making it easier to integrate with your projects. To install or upgrade to the latest version, run the following command in your terminal:

pip install -U gym

Keep in mind that the Gym API utilizes different environments, which you can explore further here. To access specific data and create custom environments for your project, refer to the gym.Env class in the source code.

No matter what stage of the project you’re at, there are ample resources to help you navigate OpenAI Gym. So go ahead, dive in, and build your understanding of this powerful tool!

Frequently Asked Questions

How can I find the installed version of OpenAI Gym?

To find the installed version of OpenAI Gym, open a Python interpreter or a Jupyter notebook and enter the following commands:

import gym
print(gym.__version__)

This will print the current version of OpenAI Gym installed on your system.

What are the common issues faced with OpenAI Gym version?

Some common issues faced with OpenAI Gym versions include compatibility problems with specific environments, deprecated features, and unexpected changes in the API. It’s essential to stay updated about the latest changes and follow the OpenAI Gym issues on GitHub for troubleshooting and updates.

How can I upgrade my OpenAI Gym to the latest version?

To upgrade your OpenAI Gym to the latest version, you can use pip by running the following command:

pip install -U gym

This command will update your OpenAI Gym installation to the most recent version available.

Which Gym environments require specific OpenAI Gym versions?

Some environments may require specific OpenAI Gym versions to function correctly. You can find specific version requirements under the description of each environment in the OpenAI Gym GitHub repository.

Are there performance differences between OpenAI Gym versions?

Performance differences between OpenAI Gym versions may arise due to improvements, bug fixes, and changes in the API. It is recommended to keep your OpenAI Gym installation updated to benefit from the latest enhancements and ensure optimal performance.

What Python versions are compatible with OpenAI Gym?

OpenAI Gym supports Python 3.5 and higher. You can check your Python version by running the following command in your terminal:

python --version

If you’re using an older Python version, consider upgrading to a newer version to use OpenAI Gym seamlessly.

The post OpenAI Gym Quickstart appeared first on Be on the Right Side of Change.