basejump-ai/basejump: Chat with your data. Reliably. A "batteries-included" approach to building your own AI application to talk with your database.

Basejump indexes a database and connects it with an AI data agent to chat with your data.

Key Features

✅ Accuracy: Uses SQLglot to parse and validate queries, preventing hallucinated tables, columns, or filters
🔒 Security: Role-based access control ensures users and AI agents only access provisioned data
⚡ Fast Indexing: Redis vector database integration for rapid semantic search
🗄️ Full Tracking: Pre-configured schema tracks chat history, clients, teams, users, and query results
💾 Smart Caching: Support semantic caching for retrieval of datasets based on similar questions
📦 Result Storage: Saves data results for later reference and auditing

Why Basejump?

Reliability, Reproducibility, and Robustness (yes, we forced the alliteration).

W…

Basejump indexes a database and connects it with an AI data agent to chat with your data.

Key Features

✅ Accuracy: Uses SQLglot to parse and validate queries, preventing hallucinated tables, columns, or filters
🔒 Security: Role-based access control ensures users and AI agents only access provisioned data
⚡ Fast Indexing: Redis vector database integration for rapid semantic search
🗄️ Full Tracking: Pre-configured schema tracks chat history, clients, teams, users, and query results
💾 Smart Caching: Support semantic caching for retrieval of datasets based on similar questions
📦 Result Storage: Saves data results for later reference and auditing

Why Basejump?

Reliability, Reproducibility, and Robustness (yes, we forced the alliteration).

We provide a data agent with features designed for robustness:

Deterministic validation using SQLglot to parse and verify every query
Multi-level access control supporting RBAC through clients, teams, and users
Complete audit trail with queries and results saved in a pre-configured database schema

Installation

Create a virtual environment and then install from PyPI:

pip install basejump

This installs the core basejump library and common packages.

Getting Started

A demo starter project has been created under basejump-demo to help you get started quickly.

Dependencies

To run the demo, you will need the following:

Either Azure and/or AWS account with access to LLM AI models
Docker installed on your computer

Note

Azure is required since only Azure embeddings are currently supported. However, Claude Sonnet can still be used as the primary agent LLM. We hope to add more embedding options soon!

Steps

Follow the following steps to get the demo set up:

Clone this repo
Copy .env.example to .env and fill in your credentials
Run the demo:

cd basejump-demo
docker compose up -d
docker compose exec app python main.py

That’s it! You should also be able to run code outside the container using localhost to refer to the postgres and redis instances running in docker. After completing these steps, you should see the AI respond to your question based on the basejump database schema.

Example usage

A complete working example can be found in basejump-demo/main.py. Here’s the core functionality in just 10 lines:

async with service.run_session() as (core_session, db):
service_context = service.create_service_context(core_session)
user_info = await service.create_internal_user_info(db, service_context)
connection = await service.setup_database(db, service_context, user_info, client_conn_params)
await service.chat(
db,
"Provide a report of all clients.",
service_context,
user_info,
connection,
)

Contributing

Interested in contributing? See our Contribution Guide for more details.

Documentation

Full documentation can be found here. Please check it out for the most up-to-date information!

Next steps

Index your own database

Modify the client_conn_params in basejump-demo/settings to your own test database to explore how Basejump AI responds to your data.

Try Basejump Cloud

If you want to see the basejump open source project in action, you can check out https://basejump.ai/ to see how we implemented it. Docs on using the web interface can be found here: https://docs.basejump.ai/

The Basejump API docs can be found here: https://docs.basejump.ai/api/api-reference

Related Projects

Basejump would not be possible without all of the open source projects it is built on. The following are vital to the success of Basejump 👏

Llama Index: For AI data agents
SQLAlchemy: For database inspection and connections
SQLGlot: SQL query parsing

Supported Databases

The following databases are currently supported. If you don’t see one, submit a PR to get yours added:

Postgres
Snowflake
Athena
MySQL
Redshift
SQL Server

Supported AI Models

Basejump is built on Llama Index and can theoretically support any AI models Llama Index supports. Adding support for a new model is relatively straightforward, so please request one if you don’t see it.

Most of the LLMS from OpenAI and Anthropic are available via Azure and AWS respectively. Supported Claude models can be found here. Supported OpenAI models can be found here.

To add a new model to Basejump, just update the AIModelSchema in basejump.core.models.schemas and submit a PR.

Key Features

Why Basejump?

Key Features

Why Basejump?

Installation

Getting Started

Dependencies

Steps

Example usage

Contributing

Documentation

Next steps

Index your own database

Try Basejump Cloud

Related Projects

Supported Databases

Supported AI Models

Similar Posts