AIssert

A lightweight testing suite for AI applications to ensure your generative outputs behave as expected.

This repo contains a sample app (chatbot) that you can ask questions about the following context:

On a warm spring afternoon, the city park was alive with activity. 
Alice, a literature enthusiast, was sitting on a brightly colored bench under the shade of a centuries-old oak tree, deeply immersed in a thick, worn-out novel. 
Nearby, Bob, a longtime friend, was engaged in an animated conversation with another visitor about recent community events and local happenings. 
The park was a melting pot of cultures—families picnicking on the grass, joggers pacing along winding paths, and street performers entertaining passersby with music and dance. 
As the day progressed, a small group of musicians arrived and began playing a gentle melody, blending harmoniously with the ambient chatter and the rustling of leaves. 
People spoke in various languages including English, Spanish, and French, reflecting the vibrant diversity of the neighborhood. 
This dynamic environment not only provided a scenic escape from the bustle of city life but also served as a microcosm of the city's rich cultural tapestry.

The idea behind this project is to give an example of what AIssert could be, a way of testing LLMs integrations with product apps.

Installation

Create the virtual environment

python3 -m venv .venv source .venv/bin/activate

Install dependencies

pip install -r requirements.txt

Run the app

python backend/main.py

Open the file index.html in your browser and then you can ask the chatbot anything about the context.

To the important bit, using Aissert

Running tests

python -m backend/test_llm

One of the AIssert tests we have will fail, as we are checking whether the prompt language and output language match (And this won't be true for anything that's not english)

======================================================================
FAIL: test_language_match_spanish (backend.test_llm.TestLLMQuery.test_language_match_spanish)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Users/agonzalez/repos/aissert/example_app/backend/test_llm.py", line 31, in test_language_match_spanish
    self.assertTrue(result["passed"], msg=result["message"])
AssertionError: False is not true : Language mismatch: question is 'es', answer is 'en'.

----------------------------------------------------------------------

There are different ways of fixing this test (maybe using a model that would do that automatically, injecting instructions in the prompt, etc.)

We could have hundreds of small tests like this, that any developer can run when they iterate their model or their prompts. Think about PII leakage, gender bias, Prompt injection, token optimization etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

AIssert

Installation

To the important bit, using Aissert

Files

README.md

Latest commit

History

README.md

File metadata and controls

AIssert

Installation

To the important bit, using Aissert