#readability #port #content #updated #liveboat #arc90

readability-liveboat

Port of arc90's readability project to rust, updated for use with liveboat

2 releases

Uses old Rust 2015

new 0.3.1 Jan 13, 2025
0.3.0 Jan 11, 2025

#4 in #updated

Download history 129/week @ 2025-01-06

129 downloads per month

MIT license

26KB
695 lines

readability-rs

NOTE:

This is fork of readability-rs which contains changes required by the Liveboat project. I'm not original author of the project

Build Status

readability-rs is a library for extracting the primary readable content of a webpage. This is a rust port of arc90's readability project. inspired by kingwkb/readability.

Hot to use

  • Add readability to dependencies in Cargo.toml
[dependencies]
readability = "^0"
  • Then, use it as below

extern crate readability;
use readability::extractor;

fn main() {
  match extractor::scrape("https://spincoaster.com/chromeo-juice") {
      Ok(product) => {
          println!("------- html ------");
          println!("{}", product.content);
          println!("---- plain text ---");
          println!("{}", product.text);
      },
      Err(_) => println!("error occured"),
  }
}

Demo

Visit demo page.

License

MIT

Dependencies

~9–22MB
~322K SLoC