Commit History

Revert "Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities."
6303241

sachinchandrankallar commited on

Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities.
28d1689

sachinchandrankallar commited on

Implement Hugging Face Spaces configuration and memory management utilities. Enhance model loading and cleanup processes, enabling optimized deployment on HF Spaces. Update memory optimization settings and model configurations for improved performance and resource management.
b190ecb

sachinchandrankallar commited on

Refactor patient summary generation to standardize custom prompt formatting. Update logic to ensure consistent structure across different modes, enhancing clarity and usability in generating comprehensive summaries. Adjust context handling to align with expected input formats for summarization models.
bcaa540

sachinchandrankallar commited on

Enhance patient summary generation by introducing support for custom prompts. Modify the processing logic to append visit data when a custom prompt is provided, improving flexibility and user experience in generating patient summaries. Update related sections to ensure consistent handling of prompts across different modes.
7be0e14

sachinchandrankallar commited on

Refactor patient summary generation to support a flexible structure, allowing for comprehensive summaries without enforcing fixed sections. Update related methods and prompts to enhance clarity and usability. Improve error handling and logging for summary generation processes, ensuring better performance and user experience.
2fb6319

sachinchandrankallar commited on

Remove obsolete documentation and test files related to GGUF operations, streaming fixes, and device parameter handling. This cleanup enhances project maintainability by eliminating unused code and files that are no longer relevant to the current implementation.
8012840

sachinchandrankallar commited on

Enhance patient summary generation with optimized parallel processing and intelligent chunking for large datasets. Introduce extended timeout configurations for complex cases, improving error handling and logging. Update API endpoints for large data processing and streaming, ensuring better performance and user experience. Refactor model loading to support OpenVINO and standard transformers with improved fallback strategies.
992b8bf

sachinchandrankallar commited on

Refactor model management by replacing the legacy model manager with a unified model manager across the application. Update imports and method calls to ensure compatibility with the new structure. Enhance error handling and logging for model loading processes, improving overall performance and maintainability.
416c047

sachinchandrankallar commited on

Refactor summarizer pipeline creation and enhance model loading for HF Spaces compatibility. Introduce a unified approach for model management, including new user models endpoint and improved error handling. Update model configurations and logging for better monitoring during model loading processes.
a5e6a2d

sachinchandrankallar commited on

Implement global exception handling and memory-aware logging across the application. Introduce logging enhancements in the AI service to capture memory snapshots during errors and key operations. Update middleware for request/response logging and improve model loading with detailed progress updates. Refactor patient summary generation to include concise logging for each step, ensuring better monitoring and error handling.
117f00b

sachinchandrankallar commited on

Remove 'Connection: keep-alive' header from event-stream response in patient summary generation. Update binary cache files for model configurations and loaders.
618340b

sachinchandrankallar commited on

Enhance GGUF model loading and generation process with improved progress updates and logging. Updated job status messages to include visual indicators for different stages of model loading and text generation. Streamlined the use of extended streaming for all requests to prevent timeout issues, ensuring a more responsive user experience.
8a71d89

sachinchandrankallar commited on

Refactor GGUF model handling for HF Spaces compatibility. Adjusted timeouts for GGUF operations, introduced an extended SSE generator for long-running tasks, and optimized model loading with environment checks. Enhanced logging for job status and progress updates.
310522b

sachinchandrankallar commited on

Enhance SSE generator with debug logging and improved responsiveness. Added debug statements for job status and completion, reduced sleep duration for more frequent updates, and updated CORS headers for API responses.
5493e02

sachinchandrankallar commited on

Update application logs and model loading mechanisms; enhance error handling for Transformers models. Adjusted GGUF model path for improved loading and added new API routes for performance metrics. Cleaned up binary cache files and improved logging for model initialization and processing steps.
2d40a5d

sachinchandrankallar commited on

Add Colab-ready patient summary generation script with robust JSON parsing and clinical summarization features. Includes flexible key matching, error handling, and detailed usage instructions for Google Colab. Implements a comprehensive patient summary generator with AI model integration and fallback mechanisms.
b1d4b09

sachinchandrankallar commited on

Remove __pycache__ from tracking and update .gitignore
3894956

sachinchandrankallar commited on

Enhanced patient summary generation with robust data processing and flexible key matching. Introduced new API endpoints for performance metrics and cache management. Improved logging for better traceability during data handling and model generation.
511a880

sachinchandrankallar commited on

Enhanced EHR data processing with robust key matching and error handling. Updated context window settings for model loading to 8192. Added new function for improved patient record processing.
92034ed

sachinchandrankallar commited on

token limit set to 8192 and changed the section to 500 tokens each
353c355

sachinchandrankallar commited on

`Added memory monitoring and cleanup features to model loading and generation pipelines. Updated model manager to track memory usage and perform periodic cleanup. Added API endpoints for monitoring memory status and performance metrics.`
736ac0d

sachinchandrankallar commited on

`Update dependencies and model loading for AI service`
3d8b270

sachinchandrankallar commited on

`Added detection for Hugging Face Spaces and adjusted security headers accordingly`
d244373

sachinchandrankallar commited on

`Refactor import logic and add logging in app.py`
1ffabfe

sachinchandrankallar commited on

βœ… Application imports successfully βœ… App instance creates without errors βœ… All agents initialize properly βœ… Routes register correctly βœ… No memory leaks or hanging operations
bfe20a9

sachinchandrankallar commited on

`Refactor import logic and add logging in app.py`
af3c3e0

sachinchandrankallar commited on