#google-cloud #speech-recognition #google

google-cognitive-apis-custom

Library wrapping Google speech-to-text, text-to-speech and dialogflow APIs. Provides high level API layer wrapping the underlying complexity of GRPC.

1 unstable release

0.1.0 Aug 13, 2024

#411 in Multimedia

25 downloads per month

MIT/Apache

1.5MB
23K SLoC

Rust 18K SLoC // 0.0% comments Bazel 4.5K SLoC // 0.1% comments

Google Cognitive APIs


CI License License: MIT Crates.io rustdoc

Asynchronous Rust bindings for Google Cloud Platform cognitive gRPC APIs. Provides high level interfaces wrapping complexity of low-level GRPC implementation. Bidirectional gRPC streaming is supported with two alternative approaches:

  • tokio.rs channels
  • asynchronous streams facilitated by crate async-stream

Following APIs are currently supported:

Cognitive API Feature name Status
Dialogflow ES dialogflow Complete
Speech-to-text speech-to-text Complete
Text-to-speech text-to-speech Complete

IMPORTANT: Version 0.2.0 is upgrading underlying GRPC stack (tonic, prost and prost-build libraries). Please keep in mind that prost-build requires now external Protocol Buffer compiler (protoc). Details can be found here. You can download protoc from github.

Google API proto definitions

Google proto definitions have been taken from this repo.

Limitations

  • Only limited subset of Google cognitive APIs is supported. Feel free to raise PR with new additions!
  • Dialogflow CX is not yet supported.
  • For Dialogflow we currently support only SessionClient (The purpose of this library is not support different DialogFlow management APIs).
  • REST APIs are supported with single purpose: to define structs that will enable deserialization of JSON config structures and their conversion into GRPC counterparts. Full support for REST APIs will be not introduced.

Examples

You can find all examples here.

License

Licensed under either Apache-2.0 or MIT license.

Dependencies

~19–32MB
~583K SLoC