1 unstable release

Uses new Rust 2024

new 0.1.3 Mar 21, 2025

#1189 in Text processing

Download history 79/week @ 2025-03-16

79 downloads per month

MIT license

5.5MB
524 lines

Parser CLI

Command-line interface for the parser-core library, enabling text extraction from various document formats.

Features

  • Extract text from multiple files in a single command
  • Support for all formats handled by parser-core
  • Stream results to stdout for piping to other tools

Installation

# From source
cargo install --path .

# Or within the workspace
cargo build -p parser-cli

Usage

Parse one or more files and extract their text content to stdout:

parser-cli <FILES>...

Example:

parser-cli document.pdf presentation.pptx report.docx

Integration

Useful in shell pipelines:

# Count words in a document
parser-cli document.pdf | wc -w

# Search for text in multiple documents
parser-cli *.pdf | grep "search term"

Dependencies

~35–48MB
~748K SLoC