nakiri

Simple cli tool written in Crystal to take a url (or html doc via stdin) and select elements from it.

Nakiri

A command-line tool for extracting data from web pages using CSS selectors. Nakiri can fetch content from URLs or read HTML from standard input, making it useful for web scraping and HTML parsing tasks.

Installation

shards install
crystal build src/nakiri.cr

Usage

nakiri -u URL -s SELECTOR [-a ATTRIBUTE]

Options

  • -u, --url=URL: URL to scrape (optional, reads from stdin if not provided)
  • -s, --selector=SELECTOR: CSS selector (required)
  • -a, --attribute=ATTR: Attribute to extract (optional)
  • -h, --help: Show help message

Examples

Extract all links from a webpage:

nakiri -u https://example.com -s "a" -a href

Extract all image sources:

nakiri -u https://example.com -s "img" -a src

Extract text content from specific elements:

nakiri -u https://example.com -s ".article-content p"

Process HTML from stdin:

curl https://example.com | nakiri -s "h1"

Requirements

  • Crystal >= 1.0.0

License

This project is open source and available under the MIT License.

Repository

nakiri

Owner
Statistic
  • 0
  • 0
  • 0
  • 0
  • 4
  • 6 months ago
  • November 20, 2024
License

MIT License

Links
Synced at

Sun, 08 Jun 2025 21:26:35 GMT

Languages