Healthcare AI

VocalWell - Voice Pathology Diagnosis

AI-powered speech analysis system for early detection of vocal disorders using deep learning and acoustic feature analysis.

Project Overview

VocalWell addresses the critical issue of late-stage diagnosis of vocal disorders by engineering a sophisticated speech analysis AI using TensorFlow, Librosa, and advanced acoustic processing techniques.

The system performs comprehensive ablation studies on acoustic features including MFCCs, formants, and spectrograms to identify the most predictive indicators of vocal pathology, achieving superior accuracy compared to existing tools.

Optimized for on-device inference with processing times under 10 seconds to ensure patient privacy and enable real-time clinical applications without requiring cloud connectivity.

Key Features

Advanced acoustic feature extraction using MFCCs, formants, and spectrograms
Deep learning models trained on comprehensive vocal pathology datasets
Real-time on-device inference optimized for privacy and speed
Comprehensive ablation studies validating feature importance
7% accuracy improvement over existing open-source acoustic analysis tools
Open-source preprocessing scripts and evaluation pipeline for reproducibility

Technologies Used

TensorFlowLibrosaPythonMFCC AnalysisSpectrogram ProcessingDeep LearningAudio Processing

View Source Code

Project Details

Client

Personal Project

Timeline

3 months (2024)

Role

AI/ML Researcher & Developer

More Projects

AI Scheduling Assistant

AI/ML Project

OpenWorldLearning

EdTech AI