Spaces:

salvinjose
/

HNTAI

Paused

App Files Files Community

HNTAI

Commit History

Update Dockerfiles to use `asgi:app` as the entry point, resolving deployment issues caused by the removal of `app.py`. This change ensures compatibility with the new structure and improves initialization for production environments.

2b64d2e

sachinchandrankallar commited on Nov 6, 2025

Remove legacy `app.py` file and streamline startup process for Hugging Face Spaces. Refactor `start_hf_spaces.py` to simplify environment setup and application initialization. Enhance `ai_med_extract.app` with improved logging and error handling during app creation and agent initialization. Update route registration in `routes_fastapi.py` for better organization and clarity.

354c0a9

sachinchandrankallar commited on Nov 6, 2025

Revert "Refactor `routes_fastapi.py` to enhance performance and maintainability. Introduced `CacheManager`, `ErrorResponseBuilder`, and `PerformanceTracker` for optimized caching, consistent error handling, and improved performance metrics. Updated logging to use safe methods, eliminated redundant code, and maintained backward compatibility. Overall, these changes streamline the patient summary generation process and improve error visibility."

f65552c

sachinchandrankallar commited on Nov 6, 2025

Revert "Refactor `build_result_dict` function by moving it to `routes_helpers.py` to eliminate duplication and improve code organization. Updated timing calculations for better precision and added prompt information handling. This change enhances maintainability and streamlines the result building process."

3dc1795

sachinchandrankallar commited on Nov 6, 2025

Refactor `build_result_dict` function by moving it to `routes_helpers.py` to eliminate duplication and improve code organization. Updated timing calculations for better precision and added prompt information handling. This change enhances maintainability and streamlines the result building process.

a4c597e

sachinchandrankallar commited on Nov 6, 2025

Refactor `routes_fastapi.py` to enhance performance and maintainability. Introduced `CacheManager`, `ErrorResponseBuilder`, and `PerformanceTracker` for optimized caching, consistent error handling, and improved performance metrics. Updated logging to use safe methods, eliminated redundant code, and maintained backward compatibility. Overall, these changes streamline the patient summary generation process and improve error visibility.

91fbcb8

sachinchandrankallar commited on Nov 6, 2025

Refactor patient summary processing to improve job status updates. Removed redundant progress updates and ensured accurate visit count reporting after data parsing and computation steps. Enhanced error handling and streamlined the workflow for better maintainability.

f4b93b9

sachinchandrankallar commited on Nov 6, 2025

Enhance patient summary generation with improved progress updates and error handling. Updated SSEGenerator to ensure frequent data transmission, preventing HTTP/2 protocol errors. Refined job status monitoring and heartbeat intervals for better connection stability during long-running tasks. Enhanced user feedback with detailed progress messages throughout the generation process.

bf04da8

sachinchandrankallar commited on Nov 6, 2025

Implement timeout protection and progress updates for patient summary generation. Enhanced error handling for both text generation and summarization processes, ensuring robust job management and improved user feedback during long-running tasks. Updated request queue management to handle job IDs more flexibly, allowing for better tracking and processing of requests.

9c0cc6f

sachinchandrankallar commited on Nov 6, 2025

Enhance patient summary processing with queue management and improved error handling. Introduced a queue manager to handle request slots, ensuring efficient processing and timeout management. Updated background task logic to include performance metrics and detailed error responses, enhancing overall reliability and maintainability of the patient summary generation workflow.

47d30e5

sachinchandrankallar commited on Nov 6, 2025

Enhance SSEGenerator job monitoring and error handling. Introduced a mechanism to wait for job creation before erroring out, improved timeout handling to send warnings instead of stopping processing, and adjusted max wait times for operations. Updated heartbeat and progress reporting to ensure more reliable streaming responses.

d58f7ff

sachinchandrankallar commited on Nov 6, 2025

Refactor streaming response handling in patient summary generation to utilize a centralized SSE generator service. This change simplifies the code by removing custom streaming logic, enhances job status monitoring, and improves error handling. The job management process is also streamlined for better maintainability and performance.

11a1181

sachinchandrankallar commited on Nov 6, 2025

Refactor patient summary generation to enhance performance and reliability. Key improvements include a centralized job management service, standardized error handling, and optimized SSE generation. Introduced new constants for data size thresholds and chunking configurations, ensuring better maintainability and scalability. All changes maintain backward compatibility and improve overall code quality.

6d48abb

sachinchandrankallar commited on Nov 6, 2025

Refactor PyTorch compatibility handling by centralizing the RMSNorm patch into a dedicated utility function. This ensures consistent application across modules and improves maintainability. Update logging to reflect the new approach.

299444a

sachinchandrankallar commited on Nov 6, 2025

Implement RMSNorm patch for PyTorch in ai_med_extract modules to ensure compatibility with models like Phi-3, enhancing tensor normalization functionality and logging.

84a9aa6

sachinchandrankallar commited on Nov 6, 2025

Remove obsolete .pyc files and add RMSNorm compatibility patch for PyTorch in model_loader_spaces.py, enhancing error handling and fallback mechanisms for model loading.

7fcf280

sachinchandrankallar commited on Nov 6, 2025

Revert "Add RMSNorm implementation to torch if missing for compatibility with HF models"

9c0908e

sachinchandrankallar commited on Nov 6, 2025

Revert "Refactor async_patient_summary to unify model selection and enhance summary generation. Introduce robust fallback mechanisms for model types, including support for summarization, seq2seq, gguf, and causal-openvino. Improve logging and error handling for better diagnostics during summary generation."

03de9e2

sachinchandrankallar commited on Nov 6, 2025

Refactor async_patient_summary to unify model selection and enhance summary generation. Introduce robust fallback mechanisms for model types, including support for summarization, seq2seq, gguf, and causal-openvino. Improve logging and error handling for better diagnostics during summary generation.

d288f98

sachinchandrankallar commited on Nov 6, 2025

Add RMSNorm implementation to torch if missing for compatibility with HF models

a127f51

sachinchandrankallar commited on Nov 6, 2025

Revert "requirements update"

3463c83

sachinchandrankallar commited on Nov 6, 2025

Revert "test"

54d246e

sachinchandrankallar commited on Nov 6, 2025

test

c8449e4

sachinchandrankallar commited on Nov 6, 2025

requirements update

1e7907f

sachinchandrankallar commited on Nov 6, 2025

refactor

5b000dc

sachinchandrankallar commited on Nov 5, 2025

Refactor text generation in routes_fastapi.py to return raw summaries instead of formatted markdown. Remove unnecessary markdown processing functions and streamline summary handling, enhancing performance and clarity in the output structure.

6aa6b6a

sachinchandrankallar commited on Nov 3, 2025

Enhance caching behavior in text generation processes across multiple files. Update patient_summary_agent.py and routes_fastapi.py to ensure proper dynamic cache handling, preventing stale cache issues during single generations. Modify model_loader_spaces.py and unified_model_manager.py to explicitly manage cache settings based on model capabilities, improving overall generation reliability. Update binary files in pycache directories.

6202dd0

sachinchandrankallar commited on Nov 3, 2025

Revert "merge conflicts"

6c585d3

sachinchandrankallar commited on Nov 3, 2025

merge conflicts

3de57f3

sachinchandrankallar commited on Oct 31, 2025

Refactor memory management and logging in routes_fastapi.py to enhance monitoring and prevent leaks. Introduce helper functions for safe logging and streamline text generation processes. Update cleanup_memory function to provide detailed memory usage metrics and warnings for high usage scenarios, improving overall performance and reliability.

83603a0

sachinchandrankallar commited on Oct 31, 2025

Update requirements to pin transformers version and modify caching behavior for OpenVINO models. Adjust logic in routes_fastapi.py to disable cache for compatibility with newer transformers, ensuring stability in model generation processes.

16be7d2

sachinchandrankallar commited on Oct 31, 2025

Revert "change routes_fastapi.py to use unifiedmodel loader for everywhere"

29a03e3

sachinchandrankallar commited on Oct 30, 2025

change routes_fastapi.py to use unifiedmodel loader for everywhere

c6110a9

sachinchandrankallar commited on Oct 30, 2025

fix

b2ba308

sachinchandrankallar commited on Oct 30, 2025

map casual openvino to text generation

ebbec22

sachinchandrankallar commited on Oct 30, 2025

Refactor text generation logic to utilize unified model manager and improve model loading and configuration

bc359db

sachinchandrankallar commited on Oct 30, 2025

Update transformers version to 4.57.1 for improved compatibility and features

bd0be8c

sachinchandrankallar commited on Oct 30, 2025

Update transformers version to 4.57.1 for improved compatibility and features

18d3466

sachinchandrankallar commited on Oct 30, 2025

Revert "Pin transformers version to 4.37.0 for compatibility with existing dependencies"

c729e2e

sachinchandrankallar commited on Oct 30, 2025

Pin transformers version to 4.37.0 for compatibility with existing dependencies

0f74ef3

sachinchandrankallar commited on Oct 30, 2025

Enhance caching and chunking mechanisms in PatientSummarizerAgent for improved performance and reliability

38e2f33

sachinchandrankallar commited on Oct 30, 2025

Refactor caching configuration in OpenVinoPipeline to allow models to manage their own caching behavior, improving compatibility and flexibility.

be33ded

sachinchandrankallar commited on Oct 30, 2025

Revert "Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures."

1666dba

sachinchandrankallar commited on Oct 30, 2025

Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures.

7f933a5

sachinchandrankallar commited on Oct 30, 2025

Refactor text generation handling in OpenVinoPipeline to prioritize max_new_tokens over max_length, ensuring proper token management for causal models.

871e862

sachinchandrankallar commited on Oct 30, 2025

Update caching behavior in model configuration to use None for use_cache, allowing the model to manage caching dynamically.

12df82a

sachinchandrankallar commited on Oct 30, 2025

Refactor caching behavior in model configuration and pipeline to prevent DynamicCache errors. Set use_cache to None for model's default handling and update related settings in TransformersModel generation parameters.

2b5dd8c

sachinchandrankallar commited on Oct 30, 2025

revert

bb973cb

sachinchandrankallar commited on Oct 30, 2025

transformers downgraded

195c13e

sachinchandrankallar commited on Oct 30, 2025

Add cache configuration and max length handling in OpenVinoPipeline

b92b395

sachinchandrankallar commited on Oct 30, 2025

Commit History

Update Dockerfiles to use `asgi:app` as the entry point, resolving deployment issues caused by the removal of `app.py`. This change ensures compatibility with the new structure and improves initialization for production environments. 2b64d2e

Refactor patient summary processing to improve job status updates. Removed redundant progress updates and ensured accurate visit count reporting after data parsing and computation steps. Enhanced error handling and streamlined the workflow for better maintainability. f4b93b9

Refactor PyTorch compatibility handling by centralizing the RMSNorm patch into a dedicated utility function. This ensures consistent application across modules and improves maintainability. Update logging to reflect the new approach. 299444a

Implement RMSNorm patch for PyTorch in ai_med_extract modules to ensure compatibility with models like Phi-3, enhancing tensor normalization functionality and logging. 84a9aa6

Remove obsolete .pyc files and add RMSNorm compatibility patch for PyTorch in model_loader_spaces.py, enhancing error handling and fallback mechanisms for model loading. 7fcf280

Revert "Add RMSNorm implementation to torch if missing for compatibility with HF models" 9c0908e

Add RMSNorm implementation to torch if missing for compatibility with HF models a127f51

Revert "requirements update" 3463c83

Revert "test" 54d246e

test c8449e4

requirements update 1e7907f

refactor 5b000dc

Refactor text generation in routes_fastapi.py to return raw summaries instead of formatted markdown. Remove unnecessary markdown processing functions and streamline summary handling, enhancing performance and clarity in the output structure. 6aa6b6a

Revert "merge conflicts" 6c585d3

merge conflicts 3de57f3

Update requirements to pin transformers version and modify caching behavior for OpenVINO models. Adjust logic in routes_fastapi.py to disable cache for compatibility with newer transformers, ensuring stability in model generation processes. 16be7d2

Revert "change routes_fastapi.py to use unifiedmodel loader for everywhere" 29a03e3

change routes_fastapi.py to use unifiedmodel loader for everywhere c6110a9

fix b2ba308

map casual openvino to text generation ebbec22

Refactor text generation logic to utilize unified model manager and improve model loading and configuration bc359db

Update transformers version to 4.57.1 for improved compatibility and features bd0be8c

Update transformers version to 4.57.1 for improved compatibility and features 18d3466

Revert "Pin transformers version to 4.37.0 for compatibility with existing dependencies" c729e2e

Pin transformers version to 4.37.0 for compatibility with existing dependencies 0f74ef3

Enhance caching and chunking mechanisms in PatientSummarizerAgent for improved performance and reliability 38e2f33

Refactor caching configuration in OpenVinoPipeline to allow models to manage their own caching behavior, improving compatibility and flexibility. be33ded

Revert "Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures." 1666dba

Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures. 7f933a5

Refactor text generation handling in OpenVinoPipeline to prioritize max_new_tokens over max_length, ensuring proper token management for causal models. 871e862

Update caching behavior in model configuration to use None for use_cache, allowing the model to manage caching dynamically. 12df82a

Refactor caching behavior in model configuration and pipeline to prevent DynamicCache errors. Set use_cache to None for model's default handling and update related settings in TransformersModel generation parameters. 2b5dd8c

revert bb973cb

transformers downgraded 195c13e

Add cache configuration and max length handling in OpenVinoPipeline b92b395

Update Dockerfiles to use `asgi:app` as the entry point, resolving deployment issues caused by the removal of `app.py`. This change ensures compatibility with the new structure and improves initialization for production environments.

2b64d2e

Refactor patient summary processing to improve job status updates. Removed redundant progress updates and ensured accurate visit count reporting after data parsing and computation steps. Enhanced error handling and streamlined the workflow for better maintainability.

f4b93b9

Refactor PyTorch compatibility handling by centralizing the RMSNorm patch into a dedicated utility function. This ensures consistent application across modules and improves maintainability. Update logging to reflect the new approach.

299444a

Implement RMSNorm patch for PyTorch in ai_med_extract modules to ensure compatibility with models like Phi-3, enhancing tensor normalization functionality and logging.

84a9aa6

Remove obsolete .pyc files and add RMSNorm compatibility patch for PyTorch in model_loader_spaces.py, enhancing error handling and fallback mechanisms for model loading.

7fcf280

Revert "Add RMSNorm implementation to torch if missing for compatibility with HF models"

9c0908e

Add RMSNorm implementation to torch if missing for compatibility with HF models

a127f51

Revert "requirements update"

3463c83

Revert "test"

54d246e

test

c8449e4

requirements update

1e7907f

refactor

5b000dc

Refactor text generation in routes_fastapi.py to return raw summaries instead of formatted markdown. Remove unnecessary markdown processing functions and streamline summary handling, enhancing performance and clarity in the output structure.

6aa6b6a

Revert "merge conflicts"

6c585d3

merge conflicts

3de57f3

Update requirements to pin transformers version and modify caching behavior for OpenVINO models. Adjust logic in routes_fastapi.py to disable cache for compatibility with newer transformers, ensuring stability in model generation processes.

16be7d2

Revert "change routes_fastapi.py to use unifiedmodel loader for everywhere"

29a03e3

change routes_fastapi.py to use unifiedmodel loader for everywhere

c6110a9

fix

b2ba308

map casual openvino to text generation

ebbec22

Refactor text generation logic to utilize unified model manager and improve model loading and configuration

bc359db

Update transformers version to 4.57.1 for improved compatibility and features

bd0be8c

Update transformers version to 4.57.1 for improved compatibility and features

18d3466

Revert "Pin transformers version to 4.37.0 for compatibility with existing dependencies"

c729e2e

Pin transformers version to 4.37.0 for compatibility with existing dependencies

0f74ef3

Enhance caching and chunking mechanisms in PatientSummarizerAgent for improved performance and reliability

38e2f33

Refactor caching configuration in OpenVinoPipeline to allow models to manage their own caching behavior, improving compatibility and flexibility.

be33ded

Revert "Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures."

1666dba

Enhance error logging in model generation and pipeline handling to improve debugging capabilities for DynamicCache and GGUF wrapper failures.

7f933a5

Refactor text generation handling in OpenVinoPipeline to prioritize max_new_tokens over max_length, ensuring proper token management for causal models.

871e862

Update caching behavior in model configuration to use None for use_cache, allowing the model to manage caching dynamically.

12df82a

Refactor caching behavior in model configuration and pipeline to prevent DynamicCache errors. Set use_cache to None for model's default handling and update related settings in TransformersModel generation parameters.

2b5dd8c

revert

bb973cb

transformers downgraded

195c13e

Add cache configuration and max length handling in OpenVinoPipeline

b92b395