Battle of the Bots: A Guide to the Best Large Language Model (LLM) Chatbots

Confused by the dozens, if not hundreds of chatbots flooding the market? Worry not because we have done the legwork of assessing the top chatbots to identify the best ones for each use case.

Jul 26, 2023

*The Generative AI race is on*! *Buckle up because we have done the legwork of assessing the top chatbots to identify the best ones for each of your use cases. Image: Bing Image Creator*

Just a year ago, I would have said that Artificial Intelligence (AI) was a domain that was largely accessible only to Big Tech, deep tech start-ups, and the data scientists and machine learning engineers that dwelled there.

OpenAI’s launch of ChatGPT-3.5 in November 2022 changed all of that. For the first time, AI, in the form of Large Language Models (LLMs) became available to “layman” users (i.e., non-technical / non-developer users like me!).

Fast forward just 8 months later and there are now dozens if not hundreds of such chatbots flooding the market. The emergence of these powerful and versatile chatbots promise to change the way we work and live for the better. Yet it can be challenging to navigate the sea of options, especially in the face of such rapid development.

Many of you will therefore be asking these questions: Which chatbot truly delivers on its promise? Which one is best for my specific needs?

Best of Breed

I've undertaken a detailed assessment of the leading chatbots on the market. By examining factors such as pricing / cost, performance, versatility, ease of use, we've identified the top performers in a variety of categories.

Whether you're a business looking to jumpstart productivity, an educator seeking a study aid, or simply an AI enthusiast curious about the latest developments, we've got you covered.

So, without further ado, let's dive into our findings in the Battle of the Bots!

N.B. As this assessment has been developed for non-technical / non-developer users, the comparative coding capabilities of each chatbot have not been factored into the evaluations.

Overall Winner: ChatGPT

The first Generative AI entrant remains the clear market leader, with key strengths in terms of versatility, analytical and logical reasoning capabilities, and for the paid version, integration with third-party tools (known as plugins) and Chrome extensions.

Overall Runner Up: Google Bard

Google Bard got off to a rocky start, but has returned stronger with cool features such as integration with Google Docs, image recognition, providing images as part of its responses, and improved accuracy. Many other features have also been announced.

Research Winner: Perplexity AI

For researchers, students, consultants and other research-focused professionals, Perplexity AI leads the pack with its ability to conduct AI-enabled Internet searches, provide data sources and citations, and high degree of accuracy.

Data Analytics Winner: ChatGPT-4

All LLMs have taken big steps forward on this front, but none more so than ChatGPT-4. OpenAI’s recently ChatGPT-4 Code Interpreter plugin is exceptional, and handle everything from exploratory data analysis through to complex data science techniques.

Longform Content Winner: Claude 2

Claude 2 is miles ahead when it comes to longform content creation and analysis. The LLM can process extremely long prompts and is able to analyse hefty research papers, lengthy articles, even short books, and can compare up to five uploaded documents.

Internet Search Winner: Bing AI

Bing AI is our winner for general AI-enabled Internet searches because of the AI’s direct integration with Bing (only for the Microsoft Edge browser), easy-to-use interface, and it being powered by the most “intelligent” LLM (for now), ChatGPT-4.

Marketing Winner: Jasper AI

Jasper AI is the go-to for marketers - blog posts, product descriptions, marketing copy, It has features such as the ability to train the AI with your brand voice, and integration with Grammarly for spellchecks etc. The catch - it doesn’t come cheap!

Companion & Support Winner: Pi

I like many other chatbot tools, but I simply adore Pi. It has the unique distinction of being designed to act as a coach, confidante, creative partner, and sounding board, and to be emotionally intelligent and human-like in its conversations with users.

The Deep Dives

*We’ve investigated and assessed each chatbot along seven dimensions including availability, price, performance, versatility, accuracy / safety, ease of use, and how current their information is.* *Image: Bing Image Creator*

Now that we’ve identified the category winners, let’s deep dive into each chatbot to better understand their strengths, weaknesses, and unique features.

This assessment has been based on my own testing and evaluation of each tool, supplemented by company announcements and reviews by other analysts. With the focus being on usability and relevant for non-technical users, I have looked at each of the chatbots along the following dimensions and have rated them on a scale of 1 to 5 (with 1 being worst and 5 being the best):

Availability: Extent to which the chatbot is availability across countries
Price: Cost to the individual user
Performance: Efficacy of the chatbot in performing its designated tasks, with a focus on its core function. For instance, Jasper AI is targeted at marketers and its performance has been assessed only for this specific domain
Versatility: Extent to which the chatbot can perform well at a variety of tasks
Accuracy / Safety: Measure of the chatbot’s responses being accurate and unbiased, and is protected from attempts to “jailbreak” it (using various methods to get the chatbot to do something was not designed to do)
Ease of Use: Extent to which the user interface is intuitive and easy-to-navigate, and enables the user to organise previous conversations.

Current: Extent to which the chatbot’s data is current and up-to-date.

It is important to note that because the world of Generative AI is moving so quickly, I’d expect the assessment and rankings to change within a few months tops. My plan is to keep this evaluation fresh and to update this article regularly.

ChatGPT

There are two versions of ChatGPT, including the free model, ChatGPT-3.5, and the paid model, ChatGPT-4.

ChatGPT-3.5

The Generative AI war was kickstarted by OpenAI in November 2022 with ChatGPT-3.5. This model remains in use today - albeit with significant updates to improve its capabilities and accuracy - and is the free-to-use version most users are familiar with.