speech-mine¶
Speech diarization and transcript analysis toolkit. Extract speaker-labeled transcripts from audio using WhisperX, then format, search, and chunk them.
Installation¶
Claude Code / MCP¶
speech-mine includes an MCP server. To add it to Claude Code:
See MCP Server for all available tools and usage.
Modules¶
| Module | Description |
|---|---|
extract |
Transcribe audio with speaker diarization |
format |
Format CSV transcripts into readable scripts |
chunk |
Split audio into segments via YAML config |
search |
Fuzzy search transcripts by word or phrase |
Quick Start¶
# 1. Extract a transcript
speech-mine extract interview.mp3 output.csv \
--hf-token YOUR_TOKEN \
--num-speakers 2 \
--compute-type float32
# 2. Format it into a readable script
speech-mine format output.csv script.txt
# 3. Search it
speech-mine search "topic of interest" output.csv --pretty
See Installation to get started.