Skip to content

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

License

Notifications You must be signed in to change notification settings

fr0ziii/frumenta

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Frumenta

Description

This project provides a set of tools for web scraping and data extraction. It includes features for crawling websites, extracting data from HTML content, and storing the extracted data in a structured format.

Features

  • URL crawling
  • Data extraction from HTML
  • Data storage in file-based storage

Getting Started

  1. Clone the repository: git clone [repository URL]
  2. Install the dependencies: pip install -r backend/requirements.txt (for the backend) and cd frontend && npm install (for the frontend)
  3. Run the application: python backend/main.py (for the backend) and cd frontend && npm run dev (for the frontend)

Usage

  1. Enter the URL to crawl in the form.
  2. Click the "Crawl" button.
  3. View the extracted data in the results display.

Contributing

Please read the CONTRIBUTING.md file for information on how to contribute to this project.

License

This project is licensed under the MIT License - see the LICENSE file for details.

↑ Back to Top ↑

About

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published