8 releases

Uses old Rust 2015

0.3.0 Dec 20, 2023
0.2.0 Apr 1, 2021
0.1.5 May 7, 2019
0.1.4 Feb 17, 2018
0.1.2 Nov 13, 2017

#552 in Text processing

Download history 545/week @ 2024-12-17 326/week @ 2024-12-24 534/week @ 2024-12-31 1357/week @ 2025-01-07 1210/week @ 2025-01-14 1816/week @ 2025-01-21 1060/week @ 2025-01-28 1673/week @ 2025-02-04 1530/week @ 2025-02-11 1832/week @ 2025-02-18 1913/week @ 2025-02-25 2251/week @ 2025-03-04 3467/week @ 2025-03-11 2572/week @ 2025-03-18 2434/week @ 2025-03-25 1989/week @ 2025-04-01

10,737 downloads per month
Used in 21 crates (11 directly)

MIT license

26KB
697 lines

readability-rs

Build Status

readability-rs is a library for extracting the primary readable content of a webpage. This is a rust port of arc90's readability project. inspired by kingwkb/readability.

Hot to use

  • Add readability to dependencies in Cargo.toml
[dependencies]
readability = "^0"
  • Then, use it as below

extern crate readability;
use readability::extractor;

fn main() {
  match extractor::scrape("https://spincoaster.com/chromeo-juice") {
      Ok(product) => {
          println!("------- html ------");
          println!("{}", product.content);
          println!("---- plain text ---");
          println!("{}", product.text);
      },
      Err(_) => println!("error occured"),
  }
}

Demo

Visit demo page.

License

MIT

Dependencies

~9–21MB
~318K SLoC