4 releases
0.1.3 | Jan 5, 2024 |
---|---|
0.1.2 | Oct 22, 2023 |
0.1.1 | Oct 22, 2023 |
0.1.0 | Oct 22, 2023 |
#402 in Audio
34KB
624 lines
desk-talk
Transcription for your desktop.
A software that records what you say when you press a button down, and types what you said when you release it.
https://github.com/sloganking/desk-talk/assets/16965931/e5da605b-3a9d-4394-b4ec-a3de65605a65
Setup
Make sure ffmpeg is installed and added to your PATH
Quickstart
Assign your OpenAI API key to the OPENAI_API_KEY
environment variable and run:
desk-talk --ptt-key scroll-lock
Or pass your OpenAI API key as a flag like so:
desk-talk --ptt-key scroll-lock --api-key [YOUR_API_KEY]
desk-talk
will now record every time you hold down the ptt-key, and type what you spoke every time you release it.
Other Usage
Special Keys
To find the name of a key by pressing it, run:
desk-talk show-key-presses
If your key shows as Unknown(number)
, pass number
to the --special-ptt-key
flag like so:
desk-talk --special-ptt-key 125
Non-default recording device
To use a microphone other than the system default, run
desk-talk list-devices
to get a list of system microphone names. And pass the desired microphone name to --device
like so:
desk-talk --ptt-key scroll-lock --device "Microphone (3- USB Audio Device)"
Dependencies
~15–53MB
~820K SLoC