--- title: HNTAI - Medical Data Extraction API emoji: 📉 colorFrom: blue colorTo: green sdk: docker app_port: 7860 pinned: false --- # HNTAI - Medical Data Extraction API This is a Flask-based API for extracting and processing medical data from various document formats. ## Features - Document text extraction (PDF, DOCX, Images) - Audio transcription - Medical data extraction - PHI (Protected Health Information) scrubbing - Text summarization ## Deployment on Hugging Face Spaces - Uses Docker for deployment - All models and data are cached in /tmp - Optimized for memory usage - Auto-retries for model loading - Proper error handling ## Environment Variables All necessary environment variables are pre-configured for Hugging Face Spaces deployment. ## API Endpoints - POST /upload - Upload and process medical documents - POST /transcribe - Transcribe audio files - POST /extract_medical_data - Extract structured medical data - POST /api/generate_summary - Generate text summaries - POST /api/extract_medical_data_from_audio - Process audio recordings For more details, check the API documentation.