Spaces:

salvinjose
/

HNTAI

Paused

HNTAI / README.md

Update .gitignore to include additional files and directories for macOS, Linux, and application-specific configurations. Modify .huggingface.yaml to enhance Docker build settings and hardware requirements. Refactor app.py to remove legacy code and improve error handling. Remove deprecated files related to comprehensive streaming fixes, deployment scripts, and optimized Docker configurations. Update Dockerfile.prod to extend Gunicorn timeout for better performance. Enhance health endpoints and model management with improved logging and error handling. Consolidate routes and simplify architecture for better maintainability.

af75202 8 months ago

preview code

Raw

History Blame

14.1 kB

HNTAI - Medical Data Extraction & AI Processing Platform

A comprehensive, scalable AI platform for medical data extraction, processing, and analysis. Built with FastAPI, supporting multiple AI model backends including Transformers, OpenVINO, and GGUF models with automatic GPU/CPU optimization.

🏥 Overview

HNTAI is a production-ready medical AI platform that provides:

Medical Document Processing: PDF, DOCX, image, and audio transcription
Protected Health Information (PHI) Scrubbing: HIPAA-compliant data anonymization
AI-Powered Summarization: Multi-model support with automatic device optimization
Patient Summary Generation: Comprehensive clinical assessments
Simplified Architecture: Clean, maintainable codebase with essential features

🚀 Key Features

🤖 Multi-Model AI Support

Transformers Models: Hugging Face models with automatic GPU/CPU detection
OpenVINO Optimization: Intel-optimized models for production performance
GGUF Models: Quantized models for efficient inference
Automatic Device Selection: GPU when available, CPU fallback
Model Caching: Intelligent model management and caching

📄 Document Processing

Multi-format Support: PDF, DOCX, images, audio files
OCR Integration: Tesseract-based text extraction
Audio Transcription: Whisper-based speech-to-text
Batch Processing: Async processing for scalability

🔒 Security & Compliance

HIPAA Compliance: PHI scrubbing with audit logging
Data Encryption: Secure data handling and storage
Audit Trails: Comprehensive logging for compliance
Non-root Containers: Security-hardened deployments

📊 Monitoring & Observability

Health Endpoints: /health/live, /health/ready
Basic Metrics: Simple performance tracking
Structured Logging: Application logging
Audit Logging: HIPAA-compliant audit trails

🏗️ Architecture

┌─────────────────────────────────────────────────────────┐
│                    FastAPI Application                  │
│                      (main.py)                          │
└─────────────────────────────────────────────────────────┘
                            │
        ┌───────────────────┼───────────────────┐
        │                   │                     │
        ▼                   ▼                     ▼
┌──────────────┐   ┌──────────────┐   ┌──────────────┐
│   Routes     │   │   Agents      │   │   Utils       │
│              │   │               │   │               │
│ - /upload    │   │ - Text        │   │ - Model       │
│ - /transcribe│   │   Extractor   │   │   Manager     │
│ - /generate  │   │ - PHI         │   │ - JSON        │
│   _summary   │   │   Scrubber    │   │   Parser      │
│              │   │ - Patient     │   │ - Config      │
│              │   │   Summary     │   │               │
│              │   │ - Whisper     │   │               │
└──────────────┘   └──────────────┘   └──────────────┘
        │                   │                     │
        └───────────────────┼───────────────────┘
                            │
        ┌───────────────────┼───────────────────┐
        │                   │                   │
        ▼                   ▼                   ▼
┌──────────────┐   ┌──────────────┐   ┌──────────────┐
│   Models     │   │   Database   │   │   Health     │
│              │   │   (Optional)  │   │              │
│ - Transformers│   │ - Audit Logs │   │ - /health    │
│ - GGUF       │   │   (HIPAA)    │   │ - /metrics   │
│ - OpenVINO   │   │              │   │              │
│ - Whisper    │   │              │   │              │
└──────────────┘   └──────────────┘   └──────────────┘

🛠️ Installation

Prerequisites

Python 3.11+
CUDA 11.8+ (for GPU support)
Docker (for containerized deployment)
PostgreSQL 13+ (optional - for audit logs)

Local Development

Clone the repository:

git clone <repository-url>
cd HNTAI

Create virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

export DATABASE_URL="postgresql://user:password@localhost:5432/hntai"  # Optional - for audit logs
export SECRET_KEY="your-secret-key"
export JWT_SECRET_KEY="your-jwt-secret"
export HF_HOME="/tmp/huggingface"

Run the application:

# Development server
python -m uvicorn services.ai-service.src.ai_med_extract.main:app --reload --host 0.0.0.0 --port 7860

# Or using the service directly
cd services/ai-service
python src/ai_med_extract/main.py

Docker Deployment

Build the image:

docker build -t hntai:latest .

Run with Docker Compose:

docker-compose up -d

Kubernetes Deployment

Apply Kubernetes manifests:

kubectl apply -f infra/k8s/secure_deployment.yaml

Check deployment status:

kubectl get pods -l app=hntai

📚 API Documentation

Core Endpoints

Health & Monitoring

GET /health/live - Liveness probe
GET /health/ready - Readiness probe
GET /metrics - Prometheus metrics

Document Processing

POST /upload - Upload and process documents
POST /transcribe - Transcribe audio files
GET /get_updated_medical_data - Retrieve processed data
PUT /update_medical_data - Update medical data

AI Processing

POST /generate_patient_summary - Generate comprehensive patient summaries
POST /api/generate_summary - Generate text summaries
POST /api/patient_summary_openvino - OpenVINO-optimized summaries
POST /extract_medical_data - Extract structured medical data

Model Management

POST /api/load_model - Load specific AI models
GET /api/model_info - Get model information
POST /api/switch_model - Switch between models

🤖 AI Model Configuration

Supported Model Types

1. Transformers Models

{
    "model_name": "microsoft/Phi-3-mini-4k-instruct",
    "model_type": "text-generation"
}

2. OpenVINO Models

{
    "model_name": "OpenVINO/Phi-3-mini-4k-instruct-fp16-ov",
    "model_type": "openvino"
}

3. GGUF Models

{
    "model_name": "microsoft/Phi-3-mini-4k-instruct-gguf",
    "model_type": "gguf"
}

Automatic Device Detection

The system automatically detects and uses:

GPU: When CUDA is available
CPU: Fallback when GPU is not available
Optimization: Intel OpenVINO for production performance

🔧 Configuration

Environment Variables

Variable	Description	Default
`DATABASE_URL`	PostgreSQL connection string (optional - for audit logs)	Not required
`SECRET_KEY`	Application secret key	Required
`JWT_SECRET_KEY`	JWT signing key	Required
`HF_HOME`	Hugging Face cache directory	`/tmp/huggingface`
`TORCH_HOME`	PyTorch cache directory	`/tmp/torch`
`WHISPER_CACHE`	Whisper model cache	`/tmp/whisper`
`HF_SPACES`	Hugging Face Spaces mode	`false`
`PRELOAD_GGUF`	Preload GGUF models	`false`

Model Configuration

The system supports flexible model configuration through model_config.py:

# Default models for different tasks
DEFAULT_MODELS = {
    "text-generation": {
        "primary": "microsoft/Phi-3-mini-4k-instruct",
        "fallback": "facebook/bart-base"
    },
    "openvino": {
        "primary": "OpenVINO/Phi-3-mini-4k-instruct-fp16-ov",
        "fallback": "microsoft/Phi-3-mini-4k-instruct"
    },
    "gguf": {
        "primary": "microsoft/Phi-3-mini-4k-instruct-gguf",
        "fallback": "microsoft/Phi-3-mini-4k-instruct-gguf"
    }
}

🧪 Testing

Run Tests

# Unit tests
python -m pytest tests/

# Smoke test (no model loading)
cd services/ai-service
python run_smoke_test.py

# Integration tests
python -m pytest tests/integration/

Code Quality

# Format code
black .
isort .

# Lint code
flake8 .
mypy .

# Type checking
mypy services/ai-service/src/ai_med_extract/

📊 Monitoring

Health Checks

Liveness: GET /health/live - Application is running
Readiness: GET /health/ready - Application is ready to serve requests

Metrics

Prometheus: GET /metrics - Application and model metrics
Custom Metrics: Model inference time, success rates, error rates

Logging

Structured Logging: JSON-formatted logs
Audit Trails: PHI access and modification logs
Performance Logs: Model loading and inference timing

🔒 Security Features

HIPAA Compliance

PHI Scrubbing: Automatic removal of protected health information
Audit Logging: Comprehensive access and modification logs
Data Encryption: Secure data handling and storage
Access Controls: Role-based access to sensitive data

Container Security

Non-root Containers: Security-hardened container images
Resource Limits: CPU and memory limits
Network Policies: Secure network communication
Secrets Management: Secure handling of sensitive configuration

🚀 Deployment Options

1. Local Development

python -m uvicorn services.ai-service.src.ai_med_extract.main:app --reload

2. Docker

docker run -p 7860:7860 hntai:latest

3. Kubernetes

kubectl apply -f infra/k8s/secure_deployment.yaml

4. Hugging Face Spaces

# Configure for HF Spaces
export HF_SPACES=true
# The app.py file automatically detects HF Spaces environment

📁 Project Structure

HNTAI/
├── services/
│   └── ai-service/
│       └── src/
│           └── ai_med_extract/
│               ├── agents/              # Core agents (simplified)
│               │   ├── text_extractor.py
│               │   ├── phi_scrubber.py
│               │   ├── patient_summary_agent.py
│               │   └── medical_data_extractor.py
│               ├── api/
│               │   └── routes_fastapi.py  # All routes in one file
│               ├── utils/
│               │   ├── unified_model_manager.py  # Single model manager
│               │   ├── robust_json_parser.py
│               │   └── model_config.py
│               ├── app.py               # FastAPI app setup
│               ├── main.py              # Entry point
│               ├── health_endpoints.py  # Simple health checks
│               └── database_audit.py     # HIPAA audit logging
├── docs/
│   ├── hf-spaces/              # HF Spaces deployment guides
│   └── archive/                # Archived documentation
├── app.py                      # HF Spaces wrapper (minimal)
├── preload_models.py           # Model preloading
├── requirements.txt
└── README.md

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Make your changes
Run tests: python -m pytest
Commit changes: git commit -m 'Add amazing feature'
Push to branch: git push origin feature/amazing-feature
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📚 Documentation

Main Documentation

README_DEPLOYMENT.md - Quick deployment reference for HF Spaces
services/ai-service/README.md - Detailed service documentation

Deployment Guides (docs/hf-spaces/)

HF_SPACES_QUICKSTART.md - 10-minute deployment guide
DEPLOYMENT_CHECKLIST.md - Step-by-step checklist
MODEL_USAGE_GUIDE.md - Model configuration and usage
HF_SPACES_DEPLOYMENT.md - Complete deployment reference

Additional Resources

docs/archive/ - Historical documentation and summaries
services/ai-service/src/ai_med_extract/PRODUCTION_READY_SUMMARY.md - Production notes
services/ai-service/src/ai_med_extract/utils/INTEGRATION_GUIDE.md - Integration guide

🆘 Support

Documentation: Check the /docs endpoint for interactive API documentation
Issues: Report bugs and feature requests via GitHub Issues
Discussions: Join community discussions for questions and support

🔄 Changelog

Latest Updates

✅ Simplified architecture - Removed over-engineered components
✅ Unified model management - Single model manager for all model types
✅ Consolidated routes - All API endpoints in one file
✅ Simplified agents - Removed duplicate implementations
✅ Enhanced security and HIPAA compliance - Maintained audit logging
✅ Cleaner codebase - 50% fewer files, 40% less code

Built with ❤️ for the medical AI community