Spaces:
Sleeping
Sleeping
| title: ZeroEngine V0.1 | |
| emoji: ๐ | |
| colorFrom: gray | |
| colorTo: gray | |
| sdk: gradio | |
| sdk_version: 6.5.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| python_version: 3.11 | |
| # ZeroEngine V0.1 (Kernel) | |
| High-performance inference engine for 2-vCPU / 16GB RAM constraints. | |
| ## Optimizations | |
| - **KV-Cache Stitching**: Asynchronous pre-evaluation of queue inputs. | |
| - **Hard Partitioning**: Dedicated core assignment per concurrent user. | |
| - **Memory Mapping**: weights mapped via `mmap` to preserve RAM for context. |