3 unstable releases
0.2.0 | Oct 5, 2024 |
---|---|
0.1.1 | Sep 17, 2024 |
0.1.0 | Sep 16, 2024 |
#1522 in Web programming
29KB
598 lines
Page Meta Extractor
Page Meta Extractor is an HTTP service that takes a URL as input and returns JSON with page meta tags extracted from the specified URL.
Features
- Extract page title, description, favicon, and web app manifest information
- Support for HTTP and HTTPS URLs
- JSON output for easy integration with other services
Installation
From Binary (Release Page)
- Go to the Releases page
- Download the latest binary for your platform
- Make the binary executable:
chmod +x pages-meta
- Run the binary:
./pages-meta
Using cargo-binstall
If you have cargo-binstall
installed:
cargo binstall pages-meta
Using Cargo Install
If you have Rust and Cargo installed:
cargo install pages-meta
Using Docker
docker pull henek/pages-meta
docker run -p 3000:3000 henek/pages-meta
Configuration
The application can be configured using the following environment variables:
HOST
: The host address to bind the server to (default: 127.0.0.1)PORT
: The port number to listen on (default: 3000)
Example:
HOST=0.0.0.0 PORT=8080 ./pages-meta
Usage
Web Frontend
You can also interact with the service using the web frontend available at https://pages-meta.henek.ru/.
Example
- Open the web frontend in your browser: https://pages-meta.henek.ru/.
- Enter the URL
https://example.com
in the input field. - Click the "Extract" button.
- The extracted meta information will be displayed in JSON format.
API Usage
Send a GET request to the service with the URL you want to extract meta information from:
http://localhost:3000/https://example.com
The service will return a JSON response with the extracted meta information.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Dependencies
~16–29MB
~451K SLoC