โ† Back to Speech & Transcription

deepgram

โ€” command-line interface for Deepgram speech-to-text

0
Source Code

Deepgram CLI Skills Guide

Tool

@deepgram/cli โ€” command-line interface for Deepgram speech-to-text.

Auth

deepgram login

Uses your Deepgram API key (stored locally).


Core Skill: Speech โ†’ Text

Transcribe a Local Audio File

deepgram listen prerecorded audio.wav

Transcribe with Options

deepgram listen prerecorded audio.wav \
  --model nova-2 \
  --language en \
  --punctuate \
  --diarize

Core Skill: Read / Reach Content

From URL (remote audio)

deepgram listen prerecorded https://example.com/audio.mp3

From STDIN (pipes)

cat audio.wav | deepgram listen prerecorded -

From Microphone (live)

deepgram listen microphone

Stop with Ctrl+C. Congrats, you just dictated reality.


Output Handling

Save Transcript

deepgram listen prerecorded audio.wav > transcript.json

Plain Text Output

deepgram listen prerecorded audio.wav --format text

Useful Flags (Memorize These)

  • --model โ€“ nova-2, general, etc.
  • --language โ€“ en, tr, de, โ€ฆ
  • --punctuate โ€“ adds punctuation
  • --diarize โ€“ speaker separation
  • --format โ€“ json, text, srt, vtt

Typical Workflow

  1. Reach content (file / URL / mic)
  2. Run deepgram listen
  3. Capture output (JSON or text)
  4. Post-process (search, summarize, subtitle)

Skill Summary

  • CLI-based speech-to-text
  • Local, remote, and live audio
  • Scriptable, pipe-friendly
  • Fast, accurate, no UI nonsense

Deepgram CLI: because keyboards are overrated.