Fast AI Apps with SQL - MindsDB

And now yo want to build AI Apps fast.

You need to get to know about MindsDB, which helps us to create AI Tools that need realtime data to perform their tasks.

The MindsDB Project

You can find MindsDB project details and source code at:

The MindsDB Site
The MindsDB Source Code at Github
- The Docker Container to deploy MindsDB
- Mixed License: ELv2 and MIT ❗

You can give Minds DB a try with Vector DBs, for example with your SelfHosted ChromaDB

MindsDB Integrations - ML & LLMs

There are several ways to integrate MindsDB:

LLMs: Talking about LLMs, Free Models and locally, we are lucky to have the simplest way
- Ollama - Setup Ollama locally with Docker
With VectorDBs:
- LanceDB, Pinecone, Qdrant…
Time Series DBs:
- InfluxDB - F/OSS and you can deploy it with Docker. Plays well with IoT and Grafana.
And more: https://docs.mindsdb.com/integrations/data-integrations/all-data-integrations

SelfHosting MindsDB with Docker

First Things First - Get Docker! 🐋

Important step and quite recommended for any SelfHosting Project - Get Docker Installed

It will be one command, this one, if you are in Linux:

apt-get update && sudo apt-get upgrade && curl -fsSL https://get.docker.com -o get-docker.sh
sh get-docker.sh && docker version

As per the Docs, we can run it with Docker CLI:

docker run -p 47334:47334 -p 47335:47335 mindsdb/mindsdb

But for proper SelfHosting and Docker Container Management, lets SelfHost mindsdb with docker-compose:

#version: '3.9'

services:
  chroma:
    container_name: mindsdb-container
    image: mindsdb/mindsdb
    ports:
      - "47334:47334" #UI Port
    volumes:
      - mindsdb_data:/mindsdb

volumes:
  mindsdb_data:
    driver: local

Then, just go to: http://localhost:47334

How to use MindsDB

Currently there are +100 Sources to use with MindsDB.

MindsDB - Web Crawler

The primary purpose of a web crawler is to collect data from the internet for various purposes, such as search engine indexing, content scraping, website analysis, and more.

With MindsDB, we can use a Web Crawler and get web data to train models, domain specific chatbots or fine-tune LLMs.

Initialize a web crawler:

CREATE DATABASE my_web 
WITH ENGINE = 'web';

Get content from a Web:

SELECT * 
FROM my_web.crawler 
WHERE url = 'docs.mindsdb.com' 
LIMIT 1;
#LIMIT 10; #10 internal pages

Few More Tricks with MindsDB Web Crawler 👇

Or from multiple WebSites

SELECT * 
FROM my_web.crawler 
WHERE url IN ('docs.mindsdb.com', 'docs.python.org') 
LIMIT 1;

Even PDF Content

SELECT * 
FROM my_web.crawler 
WHERE url = '<link-to-pdf-file>' 
LIMIT 1;

FAQ

Other F/OSS Ways to Check Sentiment Analysis?

You can have a look the Detoxify Project
- It uses Pytorch and Transformers
- Apache v2 Licensed ✅

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at [email protected].

The MindsDB Project#

MindsDB Integrations - ML & LLMs#

SelfHosting MindsDB with Docker#

How to use MindsDB#

MindsDB - Web Crawler#

FAQ#

Other F/OSS Ways to Check Sentiment Analysis?#