Open-sourcing 4 solutions from the Enterprise RAG Challenge

We’ve written earlier about the first round of our Enterprise RAG Challenge. In this friendly challenge different AI Assistants competed in answering questions based on the annual reports of public companies. Teams from the different countries competed in building such assistants. There were even a few commercial solutions in the mix.

We are open sourcing code of four solutions from this leaderboard, including the winning one.

You can find code for the solutions marked with “TTA” (TimeToAct) on Github. Solutions include description of the approach, code itself, sometimes even the log of failed experiments.

Code is real, without any cleanups and beautification.

In short:

Daniel - winning and surprisingly simple solution that used checklist pattern with structured outputs. First place.
→ Check out Daniel's solution
Felix - multi-agent solution with ChatGPT-4o. 12th place.
→ Check out Felix' solution
Maria - solution using OpenAI Assistants API. 13th place.
→ Check out Maria's solution
Pedro - locally-capable solution using openchat-3.5-0106. ninth place.
→ Check out Pedro's solution

What's next?

Next round of Enterprise RAG Challenge will take place later this fall, with a bigger audience. The exact time depends on the organisation process within the TIMEOTACT GROUP. Perhaps, around November.

In the next round, question generator will be rebalanced, so that:

There are less questions that don't have an answer (agents must respond N/A to these)
There is more variability in the questions, so that “brute force” approach with checklists+structured outputs will not be able to win the competition so easily.

Questionnaires for the participants will also be reworked, so that we all together could learn more about approaches for Enterprise AI that work well in practice.

Insights

These are the proud winners of the Enterprise RAG Challenge

Discover the winners of the Enterprise RAG Challenge! Explore top RAG solutions, watch the official announcement, and see how AI-driven retrieval and LLMs shaped the best-performing models.

Blog

Team-Leaderboard of the Enterprise RAG Challenge

The team-leaderboard includes all submitted entries – including those submitted after the Ground Truth was released. Therefore, we consider this ranking an unofficial overview.

Blog 3/11/25

Answering Business Questions with LLMs

8th place in Enterprise RAG Challenge 2025: Answering Business Questions with LLMs

Blog 1/21/25

AI Contest - Enterprise RAG Challenge

TIMETOACT GROUP Austria demonstrates how RAG technologies can revolutionize processes with the Enterprise RAG Challenge.

Blog

How I Won the Enterprise RAG Challenge

In this article, Ilia Ris describes the approach that helped him achieve first place in both prize categories and the overall SotA leaderboard.

Blog 7/22/24

So You are Building an AI Assistant?

So you are building an AI assistant for the business? This is a popular topic in the companies these days. Everybody seems to be doing that. While running AI Research in the last months, I have discovered that many companies in the USA and Europe are building some sort of AI assistant these days, mostly around enterprise workflow automation and knowledge bases. There are common patterns in how such projects work most of the time. So let me tell you a story...

Blog 11/27/23

Part 4: Save Time and Analyze the Database File

ChatGPT-4 enables you to analyze database contents with just two simple steps (copy and paste), facilitating well-informed decision-making.

Blog 2/21/22

The Power of Event Sourcing

This is how we used Event Sourcing to maintain flexibility, handle changes, and ensure efficient error resolution in application development.

Blog 11/12/24

ChatGPT & Co: LLM Benchmarks for October

Find out which large language models outperformed in the October 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 12/4/24

ChatGPT & Co: LLM Benchmarks for November

Find out which large language models outperformed in the November 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 1/7/25

ChatGPT & Co: LLM Benchmarks for December

Find out which large language models outperformed in the December 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 10/1/24

ChatGPT & Co: LLM Benchmarks for September

Find out which large language models outperformed in the September 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 2/3/25

ChatGPT & Co: LLM Benchmarks for January

Find out which large language models outperformed in the January 2025 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 5/25/21

From the idea to the product: The genesis of Skwill

We strongly believe in the benefits of continuous learning at work; this has led us to developing products that we also enjoy using ourselves. Meet Skwill.

Blog 11/12/24

ChatGPT & Co: LLM Benchmarks for October

Find out which large language models outperformed in the October 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 1/7/25

ChatGPT & Co: LLM Benchmarks for December

Find out which large language models outperformed in the December 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 2/3/25

ChatGPT & Co: LLM Benchmarks for January

Find out which large language models outperformed in the January 2025 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 12/4/24

ChatGPT & Co: LLM Benchmarks for November

Find out which large language models outperformed in the November 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog 10/1/24

ChatGPT & Co: LLM Benchmarks for September

Find out which large language models outperformed in the September 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Insights

LLM Benchmarks March 2025

What's new in the world of LLMs? Find out and read why Google DeepMind managed to surprise us more than once last month.

Open-sourcing 4 solutions from the Enterprise RAG Challenge

We are open sourcing code of four solutions from this leaderboard, including the winning one.

What's next?

More on this topic

These are the proud winners of the Enterprise RAG Challenge

Team-Leaderboard of the Enterprise RAG Challenge

Answering Business Questions with LLMs

AI Contest - Enterprise RAG Challenge

How I Won the Enterprise RAG Challenge

So You are Building an AI Assistant?

Part 4: Save Time and Analyze the Database File

The Power of Event Sourcing

ChatGPT & Co: LLM Benchmarks for October

ChatGPT & Co: LLM Benchmarks for November

ChatGPT & Co: LLM Benchmarks for December

ChatGPT & Co: LLM Benchmarks for September

ChatGPT & Co: LLM Benchmarks for January

From the idea to the product: The genesis of Skwill

ChatGPT & Co: LLM Benchmarks for October

ChatGPT & Co: LLM Benchmarks for December

ChatGPT & Co: LLM Benchmarks for January

ChatGPT & Co: LLM Benchmarks for November

ChatGPT & Co: LLM Benchmarks for September

LLM Benchmarks March 2025