AzureSpeechProject Code

AzureSpeechProject

Brought to you by: stranger90

Tree [a34524] master /

History

HTTPS access

File	Date	Author	Commit
.github	2025-06-27	Bohdan Harabadzhyu	[0493e8] Create dotnet-desktop.yml
AzureSpeechProject	2025-06-27	Bohdan Harabadzhyu	[d1fa9b] SettingsView and converters...
Screenshots	2025-06-27	Bohdan Harabadzhyu	[320085] Added screens and updated README.md...
.gitignore	2025-06-27	Bohdan Harabadzhyu	[9291a6] ...
AzureSpeechProject.sln	2025-06-25	Bohdan Harabadzhyu	[0d13cb] Initial commit
LICENSE	2025-06-25	Bohdan Harabadzhyu	[0c4d79] Init...
README.md	2025-06-28	Bohdan Harabadzhyu	[a34524] added YouTube Review...

Read Me

AzureSpeechProject

Azure Speech Recognition and Translation

A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy.

Features

🎤 Real-time Speech Recognition
- High-quality speech-to-text conversion
- Support for multiple audio formats and sample rates

🌍 Multi-language Translation
- Real-time translation to 9+ languages
- Support for Russian, Spanish, French, German, Italian, Portuguese, Japanese, Korean, and Chinese.
- Side-by-side original and translated text display

💾 Flexible Export Options
- Save transcripts in multiple formats (TXT, JSON, SRT)
- Automatic timestamping for all segments
- Organized file management with custom output directories

🔒 Secure Credential Management
- Encrypted storage of Azure Speech Service keys

⚙️ Advanced Audio Configuration
- Customizable sample rates (8kHz to 48kHz)
- Bit depth options (8-bit to 32-bit)
- Mono/stereo channel selection
- Optimized for Azure Speech Services (16kHz, 16-bit, mono recommended)

Prerequisites

Azure Account Setup

Create an Azure Account
- Visit Azure Portal
- Sign up for a free account or use your existing subscription
Create a Speech Service Resource
- Use the Azure Portal:
- Navigate to "Create a resource"
- Search for "Speech" and select "Speech Services"
- Fill in the required information:
- Subscription: Your Azure subscription
- Resource Group: Create new or select existing
- Region: Choose your preferred region (e.g., East US, West Europe)
- Name: Unique name for your service
- Pricing tier: Select appropriate tier (F0 for free tier, S0 for standard)
Get Your Credentials
- After deployment, go to your Speech Service resource
- Navigate to "Keys and Endpoint"
- Copy one of the keys and the region
- You'll need these for the application configuration

System Requirements

Operating System: Windows 11
.NET Runtime: .NET 8.0 or later
Microphone: Any compatible audio input device
Internet Connection: Required for Azure Speech Services

Configuration

Initial Setup

Launch the Application
- Open the application
- Navigate to the "Settings" tab
Configure Azure Speech Service
- Region: Enter your Azure Speech Service region (e.g., eastus, westeurope)
- Key: Enter your Azure Speech Service key
- Use the 🔒 Show/Hide button to verify your key entry
Audio Settings (Optional)
- Sample Rate: 16000 Hz (recommended for Azure)
- Bits Per Sample: 16 bits (recommended)
- Channels: 1 (mono, recommended)
Output Directory
- Click "Browse..." to select where transcripts will be saved
- Default: Documents/Azure Speech Services/Transcripts
Save Settings
- Click "Save Settings" to store your configuration

Advanced Configuration

Audio Quality Settings

For optimal Azure Speech Services performance:
- Sample Rate: 16 kHz (16000 Hz)
- Bit Depth: 16-bit
- Channels: Mono (1 channel)

Higher quality settings may not improve accuracy and will use more bandwidth.

Supported Languages

Speech Recognition: English (en-US)
Translation Targets: Russian (ru), Spanish (es), French (fr), German (de), Italian (it), Portuguese (pt), Japanese (ja), Korean (ko), Chinese Simplified (zh-Hans)

Usage

Basic Transcription

Start Recording
- Click the "Start Recording" button
- Grant microphone permissions if prompted
- Speak clearly into your microphone
Monitor Progress
- Real-time transcription appears in the left panel
- Status bar shows recording progress
Stop Recording
- Click "Stop Recording" when finished
- Review the transcription results
Save Results
- Click "Save" to export your transcript
- Choose from TXT, JSON, or SRT formats

Translation Workflow

Enable Translation
- Check "Enable Translation" in the control panel
- Select your target language from the dropdown
Start Recording
- Begin recording as normal
- Translation appears in real-time in the right panel
Export Results
- Save both original and translated versions
- Files are automatically named with language suffixes

Troubleshooting

Common Issues

Authentication Errors
- Verify your Azure Speech Service key and region
- Ensure your Azure subscription is active
- Check that your Speech Service resource is properly configured

Audio Issues
- Confirm microphone permissions are granted
- Test microphone functionality in system settings
- Try different audio quality settings

Network Connectivity
- Ensure stable internet connection
- Check firewall settings for outbound HTTPS traffic
- Verify Azure service availability in your region

Performance Optimization

For Best Results:
- Use a high-quality USB microphone
- Record in a quiet environment
- Maintain consistent speaking volume
- Speak clearly and at a moderate pace

Resource Management:
- Close unnecessary applications during recording
- Ensure sufficient disk space for transcripts
- Monitor Azure Speech Service usage quotas

Security Best Practices

🔐 Credential Security
- Never share your Azure Speech Service keys
- Use the show/hide feature to verify key entry
- Rotate keys periodically in Azure Portal
- Don't commit keys to version control

🛡️ Data Privacy
- Audio data is processed by Azure Speech Services
- Review Azure's data handling policies
- Consider data residency requirements
- Implement network security measures

Contributing

Contributions are welcome. Please fork the repository and create a pull request with your changes.

Author

Bohdan Harabadzhyu

License

MIT

This application is not affiliated with Microsoft Corporation. Azure and Azure Speech Services are registered trademarks of Microsoft Corporation.

YouTube Review

📺 Watch Video Review

[![YouTube](https://img.youtube.com/vi/m6Bu3btp9uw/maxresdefault.jpg)](https://youtu.be/m6Bu3btp9uw)

AzureSpeechProject Code