Skills

All Skills

speech

Skills tagged with #speech

@kangig94

bid

Submit a bid or speech in an active --user discuss session

kangig94/coral+4 more
18d ago
50
@xvirobotics

Doubao TTS — 豆包语音合成

Generate high-quality speech audio from text using Volcengine's Doubao TTS API. Supports short-form (real-time) and long-form (async, up to 100K characters) synthesis.

xvirobotics/metabot+5 more
18d ago
4580
@second-state

Qwen3 ASR — Voice Transcription

Transcribe speech from audio files to text.

second-state/qwen3_asr_rs
18d ago
1880
@cinience

alicloud-ai-audio-asr

Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.

cinience/alicloud-skills+61 more
19d ago
3530
@tenequm

audio-quality-check

Analyze audio recording quality - echo detection, loudness, speech intelligibility, SNR, spectral analysis. Use when the user wants to check a recording's quality, detect echo or duplication in audio files, measure speech clarity, compare original vs processed audio, diagnose why a recording sounds bad, or analyze audio tracks from Blackbox or any call recording app. Triggers on audio quality, recording analysis, echo detection, check recording, sound quality, analyze audio, speech quality, PESQ, STOI, loudness, SNR, audio diagnostics, recording sounds bad, echo in recording, audio duplication.

tenequm/skills+25 more
9d ago
180