34 stable releases (5 major)

6.0.1 Feb 27, 2025
5.0.0 Jan 21, 2025
4.2.0 Jan 21, 2025
4.0.5 Oct 29, 2024
1.4.5 Oct 8, 2024

#273 in Text processing

Download history 1/week @ 2024-11-13 1/week @ 2024-11-20 32/week @ 2024-11-27 268/week @ 2024-12-04 220/week @ 2024-12-11 31/week @ 2024-12-18 51/week @ 2025-01-01 302/week @ 2025-01-08 141/week @ 2025-01-15 74/week @ 2025-01-22 7/week @ 2025-02-05 126/week @ 2025-02-19 151/week @ 2025-02-26

284 downloads per month

Apache-2.0

41KB
847 lines

UAST: Unicode Aware Saṃskṛta Transliteration

[!IMPORTANT] For implementation in more than 10 scripts, visit https://uast.dev for web-version or https://github.com/aneri0x4f/uast-cli for CLI version. This Rust implementation does the following conversions:

  • [d]: UAST-IO/IAST to देवनागरी
  • [i]: देवनागरी to IAST
  • [g]: देवनागरी to ગુજરાતી
  • [s]: SLP1 to IAST

UAST - A tool for transliterating and typing Saṃskṛta in the easiest computer, language, and human-friendly way.

Direct installation

To directly install with Cargo,

cargo install -v uast

This is the preferred method as it handles everything for you.

Manual installation

To install this program, you will need to install Rust.

After that, simply run:

cargo install -v --git https://github.com/dhruvildave/uast.rs

To build locally,

make

To remove,

make clean

If you use this repository, please cite the following paper:

@misc{uast_2022,
  doi = {10.48550/ARXIV.2203.14277},
  url = {https://arxiv.org/abs/2203.14277},
  author = {Dalwadi, Aneri and Dave, Dhruvil},
  keywords = {Human-Computer Interaction (cs.HC), FOS: Computer and information sciences, H.5.2},
  title = {UAST: Unicode Aware Saṃskṛta Transliteration},
  publisher = {arXiv},
  year = {2022},
  copyright = {Creative Commons Attribution 4.0 International}
}

poster

Devanāgarī is the writing system that is adapted by various languages like Sanskrit (IAST: saṃskṛta). International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme for romanisation of Sanskrit language. IAST makes use of diacritics to represent various characters. On a computer, these are represented using Unicode standard which differs from how the Sanskrit language behaves at a very fundamental level. This results in an issue that is encountered while designing typesetting software for devanāgarī and IAST. We hereby discuss the problems and provide a solution that solves the issue of incompatibilities between various transliteration and encoding schemes.

Web version URL: https://uast.dev

Click on the encoding name for available options.

No runtime deps