Quick Start

Get up and running with Memorious in minutes.

Install

pip install memorious

Create a Crawler

Create a file called my_crawler.yml:

name: my_crawler
pipeline:
  init:
    method: seed
    params:
      url: https://example.com
    handle:
      pass: fetch

  fetch:
    method: fetch
    handle:
      pass: parse

  parse:
    method: parse
    params:
      store:
        mime_group: documents
    handle:
      fetch: fetch
      store: store

  store:
    method: directory
    params:
      path: ./output

This crawler:

Starts with a seed URL
Fetches the page
Parses it for links and documents
Recursively fetches linked pages
Stores documents to the ./output directory

Run It

memorious run my_crawler.yml

What's Next?

Installation - Installation options and environment setup
Crawlers - Learn how to configure crawlers
Operations - Explore available operations
CLI Reference - All CLI commands