#desktop #transcription #down #button #records #dictation #press

app desk-talk

A software that records when you press a button down, and types what you said when you release it

4 releases

0.1.3 Jan 5, 2024
0.1.2 Oct 22, 2023
0.1.1 Oct 22, 2023
0.1.0 Oct 22, 2023

#402 in Audio

MIT license

34KB
624 lines

desk-talk

Transcription for your desktop.

A software that records what you say when you press a button down, and types what you said when you release it.

https://github.com/sloganking/desk-talk/assets/16965931/e5da605b-3a9d-4394-b4ec-a3de65605a65

Setup

Make sure ffmpeg is installed and added to your PATH

Quickstart

Assign your OpenAI API key to the OPENAI_API_KEY environment variable and run:

desk-talk --ptt-key scroll-lock

Or pass your OpenAI API key as a flag like so:

desk-talk --ptt-key scroll-lock --api-key [YOUR_API_KEY]

desk-talk will now record every time you hold down the ptt-key, and type what you spoke every time you release it.

Other Usage

Special Keys

To find the name of a key by pressing it, run:

desk-talk show-key-presses

If your key shows as Unknown(number), pass number to the --special-ptt-key flag like so:

desk-talk --special-ptt-key 125

Non-default recording device

To use a microphone other than the system default, run

desk-talk list-devices    

to get a list of system microphone names. And pass the desired microphone name to --device like so:

desk-talk --ptt-key scroll-lock --device "Microphone (3- USB Audio Device)"

Dependencies

~15–53MB
~820K SLoC