Enhance AI service by updating token limits for Phi-3.5 models, improving model loading logic, and refining prompt formatting in patient summary documentation.
Enhance patient data processing by adding robust serialization and flattening utilities. The conversion function now captures all fields from EHR payloads, ensuring no data is lost during preprocessing. Additional logging improvements and error handling for plain text processing have been implemented.
Revert " implement `JobManager` for thread-safe asynchronous job tracking, including creation, updates, and cleanup, with supporting demo and test files."
Revert " implement `JobManager` for thread-safe asynchronous job tracking, including creation, updates, and cleanup, with supporting demo and test files."
Revert "feat: Establish AI medical extraction service with performance optimizations, unified model management, and detailed Hugging Face Spaces deployment guides."
Revert "feat: Add T4-optimized unified model manager for seamless loading and generation across various AI model types, updating model configurations."
feat: Establish AI medical extraction service with performance optimizations, unified model management, and detailed Hugging Face Spaces deployment guides.
Update maximum token limits to 8192 in `fallback_pipeline.py` and `unified_model_manager.py` for improved handling of longer inputs and enhanced performance in summary generation.
Update maximum token limits in `patient_summary_agent.py` and `summarizer.py` to 8192 for enhanced summary generation capabilities, allowing for better handling of longer inputs and fuller outputs.
Increase `max_new_tokens` limit to 8192 in `unified_model_manager.py` for improved summary generation, allowing for fuller outputs and better handling of longer inputs.
Remove obsolete documentation files related to HF Spaces fixes, including summaries and troubleshooting guides, to streamline the repository and reduce clutter. This cleanup enhances maintainability and focuses on the most relevant resources for users.
Update model configuration to increase maximum token limits for improved summary generation. Adjusted `max_length`, `max_new_tokens`, and context window settings from 2048 to 8192 in `model_config.py` and `unified_model_manager.py` for enhanced performance and better handling of longer inputs.
Enhance model loading and result handling in `async_patient_summary` by adding fallback tracking. Update `build_result_dict` to include fallback status and reason, improving error reporting and user feedback. Refactor model loading logic to propagate fallback information throughout the summary generation process.
Revert "Enhance patient summary generation by adding model information tracking in `build_result_dict` and related functions. Update `async_patient_summary` and `process_patient_summary_background` to accept and propagate model info, improving error handling and logging for model loading. Refactor model configuration in `async_patient_summary_optimized` to dynamically handle requested models and fallback scenarios, ensuring better user feedback and performance."
Enhance patient summary generation by adding model information tracking in `build_result_dict` and related functions. Update `async_patient_summary` and `process_patient_summary_background` to accept and propagate model info, improving error handling and logging for model loading. Refactor model configuration in `async_patient_summary_optimized` to dynamically handle requested models and fallback scenarios, ensuring better user feedback and performance.
Implement T4 Medium optimizations in model handling and logging. Enhance PatientSummarizerAgent for dynamic model configuration, allowing flexible model loading based on user input. Update environment variable checks for improved performance on Hugging Face Spaces. Refactor model generation settings for T4 compatibility, including memory management and generation parameters. Improve error handling and logging throughout the application for better user support.
Enhance PatientSummarizerAgent and user_models_config with improved environment variable handling for Hugging Face Spaces. Introduce async support for clinical summary generation and refine model loading error handling. Update model type definitions for clarity and adjust model retrieval functions to ignore active status.
Enhance model loading in PatientSummarizerAgent with improved error handling and fallback mechanism. Introduce environment variable check for HF Spaces, update logging for better clarity, and refine fallback summary generation to include extracted patient information and error details.