So you know some SQL already. And want to build AI Apps fast.

You need to get to know about MindsDB, which helps us to create AI Tools that need realtime data to perform their tasks.

The MindsDB Project

You can find project details and source code at:

You can give Minds DB a try with Vector DBs, for example with your SelfHosted ChromaDB

MindsDB Integrations - ML & LLMs

There are several ways to integrate MindsDB:

SelfHosting MindsDB with Docker

First Things First - Get Docker! 🐋

Important step and quite recommended for any SelfHosting Project - Get Docker Installed

It will be one command, this one, if you are in Linux:

apt-get update && sudo apt-get upgrade && curl -fsSL https://get.docker.com -o get-docker.sh
sh get-docker.sh && docker version

As per the Docs, we can run it with Docker CLI:

docker run -p 47334:47334 -p 47335:47335 mindsdb/mindsdb

But for proper SelfHosting and Docker Container Management, lets SelfHost mindsdb with docker-compose:

version: '3.9'

services:
  chroma:
    container_name: mindsdb-container
    image: mindsdb/mindsdb
    ports:
      - "47334:47334"
    volumes:
      - mindsdb_data:/mindsdb

volumes:
  mindsdb_data:
    driver: local

Then, just go to: http://localhost:47334

How to use MindsDB

Currently there are +100 Sources to use with MindsDB.

MindsDB - Web Crawler

The primary purpose of a web crawler is to collect data from the internet for various purposes, such as search engine indexing, content scraping, website analysis, and more.

With MindsDB, we can use a Web Crawler and get web data to train models, domain specific chatbots or fine-tune LLMs.

Initialize a web crawler:

CREATE DATABASE my_web 
WITH ENGINE = 'web';

Get content from a Web:

SELECT * 
FROM my_web.crawler 
WHERE url = 'docs.mindsdb.com' 
LIMIT 1;
#LIMIT 10; #10 internal pages
Few More Tricks with MindsDB Web Crawler 👇
  • Or from multiple WebSites
SELECT * 
FROM my_web.crawler 
WHERE url IN ('docs.mindsdb.com', 'docs.python.org') 
LIMIT 1;
  • Even PDF Content
SELECT * 
FROM my_web.crawler 
WHERE url = '<link-to-pdf-file>' 
LIMIT 1;

FAQ

Other F/OSS Ways to Check Sentiment Analysis?