Skip to content

An open-source alternative to OpenAI Deep Research, combining web searching, content crawling, and AI-driven synthesis for in-depth topic exploration

Notifications You must be signed in to change notification settings

chihebnabil/open-deep-search

Repository files navigation

Open Deep Search

An AI-powered web research tool that performs deep research on any topic by combining web searching, content crawling, and AI synthesis capabilities.

Note: I'm using my own affordable Google Search API , which is currently hosted on RapidAPI.

Features

  • Automated multi-step research process
  • Web content crawling and analysis
  • AI-powered synthesis of findings
  • Automatic follow-up query generation
  • Research paper generation
  • Detailed progress logging

Requirements

  • Node.js 16+
  • OpenAI API key
  • RapidAPI key (for Google Search API)
  • Firecrawl API key

Installation

  1. Clone the repository
  2. Install dependencies:
npm install
  1. Create a .env file with your API keys:
OPENAI_API_KEY=your_openai_key
RAPIDAPI_KEY=your_rapidapi_key
FIRECRAWL_API_KEY=your_firecrawl_key
MAX_RESEARCH_STEPS=5
SEARCH_COUNTRY=us
SEARCH_LANG=en
SEARCH_DATE_RANGE=lastYear

Usage

Run the research agent with a topic:

npm start "Your research topic"

The research paper will be saved in the research folder.

Research Process Flow

flowchart TB
    Start[Start Research] --> InitQuery[Initialize Query]
    InitQuery --> SearchLoop{Research Loop}
    
    SearchLoop --> WebSearch[Search Web APIs]
    WebSearch --> Crawl[Crawl Web Content]
    Crawl --> Synthesis[AI Synthesis]
    Synthesis --> SaveStep[Save Research Step]
    
    SaveStep --> CheckSteps{More Steps?}
    CheckSteps -->|Yes| GenQueries[Generate Follow-up Queries]
    GenQueries --> SearchLoop
    
    CheckSteps -->|No| GenPaper[Generate Research Paper]
    GenPaper --> SavePaper[Save Paper to File]
    SavePaper --> End[End Research]
Loading

Architecture

  1. Web Research Agent

    • Manages the research process
    • Coordinates between different services
    • Handles error recovery and logging
  2. Services Used

    • OpenAI API (GPT-4) for content analysis and synthesis
    • RapidAPI's Google Search API for web searches
    • Firecrawl for web content extraction
  3. Output

    • Structured research paper in Markdown format
    • Detailed research steps and findings
    • Progress logs with emoji indicators

Limitations

  • Depends on third-party API availability
  • Web crawling may be blocked by some websites

License

MIT

About

An open-source alternative to OpenAI Deep Research, combining web searching, content crawling, and AI-driven synthesis for in-depth topic exploration

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published