Spaces:

salvinjose
/

HNTAI

Paused

App Files Files Community

HNTAI

Commit History

Revert "Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities."

6303241

sachinchandrankallar commited on Oct 30, 2025

Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities.

28d1689

sachinchandrankallar commited on Oct 30, 2025

Implement Hugging Face Spaces configuration and memory management utilities. Enhance model loading and cleanup processes, enabling optimized deployment on HF Spaces. Update memory optimization settings and model configurations for improved performance and resource management.

b190ecb

sachinchandrankallar commited on Oct 30, 2025

Refactor patient summary generation to standardize custom prompt formatting. Update logic to ensure consistent structure across different modes, enhancing clarity and usability in generating comprehensive summaries. Adjust context handling to align with expected input formats for summarization models.

bcaa540

sachinchandrankallar commited on Oct 29, 2025

Enhance patient summary generation by introducing support for custom prompts. Modify the processing logic to append visit data when a custom prompt is provided, improving flexibility and user experience in generating patient summaries. Update related sections to ensure consistent handling of prompts across different modes.

7be0e14

sachinchandrankallar commited on Oct 29, 2025

Refactor patient summary generation to support a flexible structure, allowing for comprehensive summaries without enforcing fixed sections. Update related methods and prompts to enhance clarity and usability. Improve error handling and logging for summary generation processes, ensuring better performance and user experience.

2fb6319

sachinchandrankallar commited on Oct 29, 2025

Remove obsolete documentation and test files related to GGUF operations, streaming fixes, and device parameter handling. This cleanup enhances project maintainability by eliminating unused code and files that are no longer relevant to the current implementation.

8012840

sachinchandrankallar commited on Oct 29, 2025

Enhance patient summary generation with optimized parallel processing and intelligent chunking for large datasets. Introduce extended timeout configurations for complex cases, improving error handling and logging. Update API endpoints for large data processing and streaming, ensuring better performance and user experience. Refactor model loading to support OpenVINO and standard transformers with improved fallback strategies.

992b8bf

sachinchandrankallar commited on Oct 28, 2025

Refactor model management by replacing the legacy model manager with a unified model manager across the application. Update imports and method calls to ensure compatibility with the new structure. Enhance error handling and logging for model loading processes, improving overall performance and maintainability.

416c047

sachinchandrankallar commited on Oct 23, 2025

Refactor summarizer pipeline creation and enhance model loading for HF Spaces compatibility. Introduce a unified approach for model management, including new user models endpoint and improved error handling. Update model configurations and logging for better monitoring during model loading processes.

a5e6a2d

sachinchandrankallar commited on Oct 22, 2025

Implement global exception handling and memory-aware logging across the application. Introduce logging enhancements in the AI service to capture memory snapshots during errors and key operations. Update middleware for request/response logging and improve model loading with detailed progress updates. Refactor patient summary generation to include concise logging for each step, ensuring better monitoring and error handling.

117f00b

sachinchandrankallar commited on Oct 22, 2025

Remove 'Connection: keep-alive' header from event-stream response in patient summary generation. Update binary cache files for model configurations and loaders.

618340b

sachinchandrankallar commited on Oct 17, 2025

Revert "requirements chande"

b7ad04a

sachinchandrankallar commited on Oct 17, 2025

requirements chande

544efc0

sachinchandrankallar commited on Oct 17, 2025

Enhance GGUF model loading and generation process with improved progress updates and logging. Updated job status messages to include visual indicators for different stages of model loading and text generation. Streamlined the use of extended streaming for all requests to prevent timeout issues, ensuring a more responsive user experience.

8a71d89

sachinchandrankallar commited on Oct 17, 2025

streaming

82cec3c

sachinchandrankallar commited on Oct 17, 2025

Refactor GGUF model handling for HF Spaces compatibility. Adjusted timeouts for GGUF operations, introduced an extended SSE generator for long-running tasks, and optimized model loading with environment checks. Enhanced logging for job status and progress updates.

310522b

sachinchandrankallar commited on Oct 17, 2025

Enhance SSE generator with debug logging and improved responsiveness. Added debug statements for job status and completion, reduced sleep duration for more frequent updates, and updated CORS headers for API responses.

5493e02

sachinchandrankallar commited on Oct 17, 2025

Update application logs and model loading mechanisms; enhance error handling for Transformers models. Adjusted GGUF model path for improved loading and added new API routes for performance metrics. Cleaned up binary cache files and improved logging for model initialization and processing steps.

2d40a5d

sachinchandrankallar commited on Oct 17, 2025

Add Colab-ready patient summary generation script with robust JSON parsing and clinical summarization features. Includes flexible key matching, error handling, and detailed usage instructions for Google Colab. Implements a comprehensive patient summary generator with AI model integration and fallback mechanisms.

b1d4b09

sachinchandrankallar commited on Oct 17, 2025

prompt and timeout increased

5d2111a

sachinchandrankallar commited on Oct 16, 2025

Remove pycache from tracking and update .gitignore

3894956

sachinchandrankallar commited on Oct 16, 2025

Enhanced patient summary generation with robust data processing and flexible key matching. Introduced new API endpoints for performance metrics and cache management. Improved logging for better traceability during data handling and model generation.

511a880

sachinchandrankallar commited on Oct 16, 2025

Enhanced EHR data processing with robust key matching and error handling. Updated context window settings for model loading to 8192. Added new function for improved patient record processing.

92034ed

sachinchandrankallar commited on Oct 15, 2025

token limit set to 8192 and changed the section to 500 tokens each

353c355

sachinchandrankallar commited on Oct 15, 2025

truncation removed

c23102b

sachinchandrankallar commited on Oct 14, 2025

`Added memory monitoring and cleanup features to model loading and generation pipelines. Updated model manager to track memory usage and perform periodic cleanup. Added API endpoints for monitoring memory status and performance metrics.`

736ac0d

sachinchandrankallar commited on Oct 14, 2025

updates

202f345

sachinchandrankallar commited on Oct 10, 2025

Revert "memory limits"

0b59a2f

sachinchandrankallar commited on Oct 9, 2025

memory limits

c8bc93f

sachinchandrankallar commited on Oct 9, 2025

summary device parameter issue

c20c8b5

sachinchandrankallar commited on Oct 9, 2025

error fix

5dc4520

sachinchandrankallar commited on Oct 9, 2025

`Update dependencies and model loading for AI service`

3d8b270

sachinchandrankallar commited on Oct 9, 2025

summary progeress

a52d85f

sachinchandrankallar commited on Oct 9, 2025

`Added detection for Hugging Face Spaces and adjusted security headers accordingly`

d244373

sachinchandrankallar commited on Oct 8, 2025

landing screen for huggingface

93dff1a

sachinchandrankallar commited on Oct 8, 2025

Revert "/ removed"

b083114

sachinchandrankallar commited on Oct 8, 2025

/ removed

bc5f622

sachinchandrankallar commited on Oct 8, 2025

404

af0cc08

sachinchandrankallar commited on Oct 8, 2025

deployment issue fixes

4909775

sachinchandrankallar commited on Oct 8, 2025

deploy

8dbd6f6

sachinchandrankallar commited on Oct 8, 2025

overshadowing ai medextract shim removed

b77fd51

sachinchandrankallar commited on Oct 8, 2025

upload directory

fe9a7ad

sachinchandrankallar commited on Oct 8, 2025

`Refactor import logic and add logging in app.py`

1ffabfe

sachinchandrankallar commited on Oct 8, 2025

✅ Application imports successfully ✅ App instance creates without errors ✅ All agents initialize properly ✅ Routes register correctly ✅ No memory leaks or hanging operations

bfe20a9

sachinchandrankallar commited on Oct 7, 2025

service mesh

2890ee7

sachinchandrankallar commited on Oct 7, 2025

app.py

f95c45e

sachinchandrankallar commited on Oct 7, 2025

import error fix

d57d5a9

sachinchandrankallar commited on Oct 7, 2025

APP.PY CHANGES

86d1b99

sachinchandrankallar commited on Oct 7, 2025

`Refactor import logic and add logging in app.py`

af3c3e0

sachinchandrankallar commited on Oct 7, 2025

Commit History

Revert "Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities." 6303241

Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities. 28d1689

Implement Hugging Face Spaces configuration and memory management utilities. Enhance model loading and cleanup processes, enabling optimized deployment on HF Spaces. Update memory optimization settings and model configurations for improved performance and resource management. b190ecb

Remove obsolete documentation and test files related to GGUF operations, streaming fixes, and device parameter handling. This cleanup enhances project maintainability by eliminating unused code and files that are no longer relevant to the current implementation. 8012840

Remove 'Connection: keep-alive' header from event-stream response in patient summary generation. Update binary cache files for model configurations and loaders. 618340b

Revert "requirements chande" b7ad04a

requirements chande 544efc0

streaming 82cec3c

Refactor GGUF model handling for HF Spaces compatibility. Adjusted timeouts for GGUF operations, introduced an extended SSE generator for long-running tasks, and optimized model loading with environment checks. Enhanced logging for job status and progress updates. 310522b

Enhance SSE generator with debug logging and improved responsiveness. Added debug statements for job status and completion, reduced sleep duration for more frequent updates, and updated CORS headers for API responses. 5493e02

prompt and timeout increased 5d2111a

Remove __pycache__ from tracking and update .gitignore 3894956

Enhanced patient summary generation with robust data processing and flexible key matching. Introduced new API endpoints for performance metrics and cache management. Improved logging for better traceability during data handling and model generation. 511a880

Enhanced EHR data processing with robust key matching and error handling. Updated context window settings for model loading to 8192. Added new function for improved patient record processing. 92034ed

token limit set to 8192 and changed the section to 500 tokens each 353c355

truncation removed c23102b

`Added memory monitoring and cleanup features to model loading and generation pipelines. Updated model manager to track memory usage and perform periodic cleanup. Added API endpoints for monitoring memory status and performance metrics.` 736ac0d

updates 202f345

Revert "memory limits" 0b59a2f

memory limits c8bc93f

summary device parameter issue c20c8b5

error fix 5dc4520

`Update dependencies and model loading for AI service` 3d8b270

summary progeress a52d85f

`Added detection for Hugging Face Spaces and adjusted security headers accordingly` d244373

landing screen for huggingface 93dff1a

Revert "/ removed" b083114

/ removed bc5f622

404 af0cc08

deployment issue fixes 4909775

deploy 8dbd6f6

overshadowing ai medextract shim removed b77fd51

upload directory fe9a7ad

`Refactor import logic and add logging in app.py` 1ffabfe

✅ Application imports successfully ✅ App instance creates without errors ✅ All agents initialize properly ✅ Routes register correctly ✅ No memory leaks or hanging operations bfe20a9

service mesh 2890ee7

app.py f95c45e

import error fix d57d5a9

APP.PY CHANGES 86d1b99

`Refactor import logic and add logging in app.py` af3c3e0

Revert "Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities."

6303241

Enhance model configuration and unified model manager to improve performance. Update max_length and max_new_tokens for consistency, and explicitly disable cache to prevent DynamicCache errors. Add logger import in FastAPI routes for better logging capabilities.

28d1689

Implement Hugging Face Spaces configuration and memory management utilities. Enhance model loading and cleanup processes, enabling optimized deployment on HF Spaces. Update memory optimization settings and model configurations for improved performance and resource management.

b190ecb

Remove obsolete documentation and test files related to GGUF operations, streaming fixes, and device parameter handling. This cleanup enhances project maintainability by eliminating unused code and files that are no longer relevant to the current implementation.

8012840

Remove 'Connection: keep-alive' header from event-stream response in patient summary generation. Update binary cache files for model configurations and loaders.

618340b

Revert "requirements chande"

b7ad04a

requirements chande

544efc0

streaming

82cec3c

Refactor GGUF model handling for HF Spaces compatibility. Adjusted timeouts for GGUF operations, introduced an extended SSE generator for long-running tasks, and optimized model loading with environment checks. Enhanced logging for job status and progress updates.

310522b

Enhance SSE generator with debug logging and improved responsiveness. Added debug statements for job status and completion, reduced sleep duration for more frequent updates, and updated CORS headers for API responses.

5493e02

prompt and timeout increased

5d2111a

Remove pycache from tracking and update .gitignore

3894956

Enhanced patient summary generation with robust data processing and flexible key matching. Introduced new API endpoints for performance metrics and cache management. Improved logging for better traceability during data handling and model generation.

511a880

Enhanced EHR data processing with robust key matching and error handling. Updated context window settings for model loading to 8192. Added new function for improved patient record processing.

92034ed

token limit set to 8192 and changed the section to 500 tokens each

353c355

truncation removed

c23102b

`Added memory monitoring and cleanup features to model loading and generation pipelines. Updated model manager to track memory usage and perform periodic cleanup. Added API endpoints for monitoring memory status and performance metrics.`

736ac0d

updates

202f345

Revert "memory limits"

0b59a2f

memory limits

c8bc93f

summary device parameter issue

c20c8b5

error fix

5dc4520

`Update dependencies and model loading for AI service`

3d8b270

summary progeress

a52d85f

`Added detection for Hugging Face Spaces and adjusted security headers accordingly`

d244373

landing screen for huggingface

93dff1a

Revert "/ removed"

b083114

/ removed

bc5f622

404

af0cc08

deployment issue fixes

4909775

deploy

8dbd6f6

overshadowing ai medextract shim removed

b77fd51

upload directory

fe9a7ad

`Refactor import logic and add logging in app.py`

1ffabfe

✅ Application imports successfully ✅ App instance creates without errors ✅ All agents initialize properly ✅ Routes register correctly ✅ No memory leaks or hanging operations

bfe20a9

service mesh

2890ee7

app.py

f95c45e

import error fix

d57d5a9

APP.PY CHANGES

86d1b99

`Refactor import logic and add logging in app.py`

af3c3e0