14 releases
new 0.2.1 | Nov 3, 2024 |
---|---|
0.2.0 | Nov 3, 2024 |
0.1.46 | Oct 26, 2024 |
0.1.44 | Sep 27, 2024 |
0.1.35 | May 23, 2023 |
#102 in Cargo plugins
1,501 downloads per month
Used in ognibuild
575KB
15K
SLoC
Contains (autotools obfuscated code, 1KB) testdata/native/configure, (autotools obfuscated code, 1KB) testdata/configure/configure
Upstream Ontologist
The upstream ontologist provides a common interface for finding metadata about upstream software projects.
It will gather information from any sources available, prioritize data that it has higher confidence in as well as report the confidence for each of the bits of metadata.
The ontologist originated in Debian and the currently reported metadata fields are loosely based on DEP-12, but it is meant to be distribution-agnostic.
Provided Fields
Standard fields:
Homepage
: homepage URLName
: human name of the upstream projectContact
: contact address of some sort of the upstream (e-mail, mailing list URL)Repository
: VCS URLRepository-Browse
: Web URL for viewing the VCSBug-Database
: Bug database URL (for web viewing, generally)Bug-Submit
: URL to use to submit new bugs (either on the web or an e-mail address)Screenshots
: List of URLs with screenshotsArchive
: Archive used - e.g. SourceForgeSecurity-Contact
: e-mail or URL with instructions for reporting security issuesDocumentation
: Link to documentation on the web
Extensions for upstream-ontologist, not defined in DEP-12:
SourceForge-Project
: sourceforge project nameWiki
: Wiki URLSummary
: one-line description of the projectDescription
: longer description of the projectLicense
: Single line license (e.g. "GPL 2.0")Copyright
: List of copyright holdersVersion
: Current upstream versionSecurity-MD
: URL to markdown file with security policyAuthor
: List of people who contributed to the projectMaintainer
: The maintainer of the projectFunding
: URL to more information about funding
Supported Data Sources
At the moment, the ontologist can read metadata from the following upstream data sources:
- Python package metadata (PKG-INFO, setup.py, setup.cfg, pyproject.timl)
- package.json
- composer.json
- package.xml
- Perl package metadata (dist.ini, META.json, META.yml, Makefile.PL)
- Perl POD files
- GNU configure files
- R DESCRIPTION files
- Rust Cargo.toml
- Maven pom.xml
- metainfo.xml
- .git/config
- SECURITY.md
- DOAP
- Haskell cabal files
- go.mod
- ruby gemspec files
- nuspec files
- OPAM files
- Debian packaging metadata (debian/watch, debian/control, debian/rules, debian/get-orig-source.sh, debian/copyright, debian/patches)
- Dart's pubspec.yaml
- meson.build
It will also scan README and INSTALL for possible upstream repository URLs (and will attempt to verify that those match the local repository).
In addition to local files, it can also consult external directories using their APIs:
Example Usage
The easiest way to use the upstream ontologist is by invoking the
guess-upstream-metadata
command in a software project:
$ guess-upstream-metadata ~/src/dulwich
Security-MD: https://github.com/dulwich/dulwich/tree/HEAD/SECURITY.md
Name: dulwich
Version: 0.20.15
Bug-Database: https://github.com/dulwich/dulwich/issues
Repository: https://www.dulwich.io/code/
Summary: Python Git Library
Bug-Submit: https://github.com/dulwich/dulwich/issues/new
Alternatively, there is a Python API as part of the upstream_ontologist
Python package. There are also autocodemeta
and autodoap
commands that
can generate output in the codemeta and
DOAP formats, respectively.
Dependencies
~32–58MB
~1M SLoC