HNTAI / PROGRESS_UPDATE.md
sachinchandrankallar's picture
Revert "Merge branch 'FT-DEV-17/09/2025'"
aba0d25
|
Raw
History Blame
1.2 kB
# GGUF Timeout Fix - Progress Update
## βœ… Completed Steps:
1. **Increased GGUF timeout**: Changed from 120s to 300s for Hugging Face Spaces
2. **Configurable timeout**: Added GGUF_GENERATION_TIMEOUT environment variable support
3. **Better error handling**: Enhanced timeout and fallback mechanisms in routes.py
4. **Fallback pipeline**: Added robust fallback when GGUF model fails to load or times out
## πŸ”§ Changes Made:
### model_loader_gguf.py:
- Updated `_generate_with_timeout()` to use 300s default for Spaces, 120s for local
- Made timeout configurable via environment variable
- Updated `generate()` to use configurable timeout
### routes.py:
- Added fallback pipeline usage when GGUF times out
- Added better logging for timeout errors
- Added fallback for GGUF model loading failures
- Improved error messages and response handling
## πŸš€ Next Steps:
- Test the changes with the GGUF model
- Verify timeout is sufficient for Phi-3 model
- Test fallback mechanisms
- Add progress logging for generation
## βš™οΈ Configuration:
- Default timeout: 300s (Spaces) / 120s (local)
- Environment variable: `GGUF_GENERATION_TIMEOUT`
- Fallback: Template-based summary when GGUF fails