| summary |
read_when |
title |
| SenseAudio batch speech-to-text for inbound voice notes |
| You want SenseAudio speech-to-text for audio attachments |
| You need the SenseAudio API key env var or audio config path |
|
SenseAudio |
SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared tools.media.audio pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as {{Transcript}} plus an [Audio] block.
| Property |
Value |
| Provider id |
senseaudio |
| Plugin |
bundled, enabledByDefault: true |
| Contract |
mediaUnderstandingProviders (audio) |
| Auth env var |
SENSEAUDIO_API_KEY |
| Default model |
senseaudio-asr-pro-1.5-260319 |
| Default URL |
https://api.senseaudio.cn/v1 |
| Website |
senseaudio.cn |
| Docs |
senseaudio.cn/docs |
Getting started
```bash
export SENSEAUDIO_API_KEY="..."
```
```json5
{
tools: {
media: {
audio: {
enabled: true,
models: [{ provider: "senseaudio", model: "senseaudio-asr-pro-1.5-260319" }],
},
},
},
}
```
Send an audio message through any connected channel. OpenClaw uploads the
audio to SenseAudio and uses the transcript in the reply pipeline.
Options
| Option |
Path |
Description |
model |
tools.media.audio.models[].model |
SenseAudio ASR model id |
language |
tools.media.audio.models[].language |
Optional language hint |
prompt |
tools.media.audio.prompt |
Optional transcription prompt |
baseUrl |
tools.media.audio.baseUrl or model |
Override the OpenAI-compatible base |
headers |
tools.media.audio.request.headers |
Extra request headers |
SenseAudio is batch STT only in OpenClaw. Voice Call realtime transcription
continues to use providers with streaming STT support.
Related