There are few AI Scrapping Tools out there already.

Lets have a look to repo reader

The RepoReader Project

Explore and ask questions about a GitHub code repository using OpenAI’s GPT.

Star History Chart

RepoReader Setup

git clone https://github.com/cmooredev/RepoReader
cd RepoReader

We need to provide OpenAI API key (which is not OSS):

#source .env
export OPENAI_API_KEY="sk-..." #linux

And make a proper python setup:

#sudo apt install python3.12-venv
#sudo apt install libxml2-dev libxslt1-dev

python3 -m venv RepoReader_venv

source RepoReader_venv/bin/activate

pip install -r requirements.txt
python3 app.py

More AI Powered Scrapping

Star History Chart

Crawl4AI

Crawl4AI

ScrapeGraph

ScrapeGraph

FireCrawl

Conclusions

There will always be space for beautiful soup.

But we have to recognize the power of these AI powered scrappers.