Callmac Skill
Remote voice control for your Mac from mobile devices. Broadcast announcements, play alarms, tell stories, wake up kids - all triggered from messaging apps like Telegram or WhatsApp. Uses edge-tts with mixed Chinese/English support.
从移动设备远程控制 Mac 语音功能。广播公告、播放闹钟、讲故事、叫孩子起床 - 全部通过 Telegram 或 WhatsApp 触发。使用 edge-tts 支持中英文混合。
Quick Start | 快速开始
快速开始使用语音技能,生成和播放中英文混合的 TTS 音频。
Basic Usage | 基本用法
# Generate and play a simple announcement
python3 scripts/generate_tts.py --text "Hello world" --play
# Generate mixed Chinese/English content
python3 scripts/generate_tts.py --text "Hello 你好" --play
# Save to file
python3 scripts/generate_tts.py --text "Your message" --output announcement.mp3
Voice Selection
Edge TTS provides high-quality neural voices:
- English (US):
en-US-JennyNeural(friendly),en-US-AriaNeural(confident) - Chinese (Mandarin):
zh-CN-XiaoxiaoNeural(warm),zh-CN-XiaoyiNeural(lively) - Other languages: See VOICES.md for complete list
Features
1. Mixed Language Support
Automatically detects language segments and uses appropriate voices:
- English segments → English neural voice
- Chinese segments → Chinese neural voice
- Other languages → Default or specified voice
2. Playback Control
- Local playback on Mac using
afplay - Loop playback support (1-∞ times)
- Volume control (0-100%)
- Background/foreground playback options
3. File Management
- Save as MP3 files
- Concatenate multiple audio segments
- Batch processing support
4. Advanced Features
- Custom voice selection per segment
- Speech rate adjustment
- Pitch modification
- SSML support for advanced control
Workflows
Workflow 1: Simple Announcement Playback
- User provides text to speak
- System detects language(s)
- Generates appropriate TTS audio
- Plays locally on Mac
Workflow 2: Mixed Language Audio Creation
- User provides Chinese/English mixed text
- System splits by language segments
- Generates separate audio for each language
- Concatenates into single MP3 file
- Optionally plays or saves
Workflow 3: Scheduled/Repeated Announcements
- User provides text and playback schedule
- System creates cron job or loop
- Plays at specified intervals
- Can be stopped on demand
Scripts
scripts/generate_tts.py
Main script for TTS generation and playback.
Usage:
# Basic generation and playback
python3 generate_tts.py --text "Message" --play
# Save to file
python3 generate_tts.py --text "Message" --output file.mp3
# Mixed language with custom voices
python3 generate_tts.py --text "Hello 你好" --voice-en "en-US-AriaNeural" --voice-zh "zh-CN-XiaoyiNeural"
# Loop playback
python3 generate_tts.py --text "Message" --play --loops 5
# Volume control
python3 generate_tts.py --text "Message" --play --volume 80
# SSML support
python3 generate_tts.py --ssml "<speak>Hello <break time='500ms'/> world</speak>"
Parameters:
--text: Text to convert to speech--ssml: SSML markup for advanced control--output: Save to MP3 file--play: Play immediately after generation--voice: Default voice (overrides auto-detection)--voice-en: English voice override--voice-zh: Chinese voice override--loops: Number of times to play (default: 1)--volume: Playback volume 0-100 (default: system volume)--rate: Speech rate adjustment (+/- %)--pitch: Pitch adjustment (+/- Hz)
scripts/play_audio.py
Audio playback control utilities.
Usage:
# Play existing audio file
python3 play_audio.py --file audio.mp3
# Loop playback
python3 play_audio.py --file audio.mp3 --loops 3
# Volume control
python3 play_audio.py --file audio.mp3 --volume 75
# Stop all playback
python3 play_audio.py --stop
scripts/merge_audio.py
Merge multiple audio files.
Usage:
# Merge two files
python3 merge_audio.py --files part1.mp3 part2.mp3 --output combined.mp3
# Create playlist and merge
python3 merge_audio.py --playlist playlist.txt --output combined.mp3
References
VOICES.md
Complete list of available Edge TTS voices with language, gender, and style information.
INSTALLATION.md
Setup instructions for edge-tts and ffmpeg dependencies.
EXAMPLES.md
Example usage patterns and common scenarios.
Dependencies
- edge-tts:
pip3 install edge-tts - ffmpeg:
brew install ffmpeg(for audio merging) - macOS tools:
afplay(built-in),osascript(volume control)
Common Patterns
Pattern 1: Welcome Message
python3 scripts/generate_tts.py --text "Welcome to the system. 系统欢迎您。" --play --voice-en "en-US-JennyNeural" --voice-zh "zh-CN-XiaoxiaoNeural"
Pattern 2: Scheduled Reminder
# Create audio file
python3 scripts/generate_tts.py --text "Time for your meeting. 会议时间到了。" --output reminder.mp3
# Schedule playback (using cron)
echo "*/30 * * * * afplay /path/to/reminder.mp3" | crontab -
Pattern 3: Multi-language Announcement
python3 scripts/generate_tts.py --text "Alert: System update in progress. 警报:系统更新进行中。" --play --loops 3 --volume 90
Troubleshooting
No Audio Output
- Check system volume:
osascript -e "output volume of (get volume settings)" - Check mute status:
osascript -e "output muted of (get volume settings)" - Test with simple audio:
afplay /System/Library/Sounds/Ping.aiff
edge-tts Not Working
- Verify installation:
python3 -m edge_tts --list-voices - Check internet connection (edge-tts requires online access)
- Update package:
pip3 install --upgrade edge-tts
Audio Quality Issues
- Use neural voices for best quality
- Adjust speech rate if too fast/slow
- Consider SSML for precise control
Notes
- Edge TTS requires internet connection for voice synthesis
- Local playback uses macOS built-in
afplaycommand - Volume control uses
osascriptto adjust system volume - For offline use, consider alternative TTS solutions
- Large texts may be split into multiple segments automatically