ZeroEngine / README.md
turtle170's picture
Update README.md
111b6d9 verified
|
Raw
History Blame
510 Bytes
---
title: ZeroEngine V0.1
emoji: ๐Ÿš€
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 6.5.0
app_file: app.py
pinned: false
license: apache-2.0
python_version: 3.11
---
# ZeroEngine V0.1 (Kernel)
High-performance inference engine for 2-vCPU / 16GB RAM constraints.
## Optimizations
- **KV-Cache Stitching**: Asynchronous pre-evaluation of queue inputs.
- **Hard Partitioning**: Dedicated core assignment per concurrent user.
- **Memory Mapping**: weights mapped via `mmap` to preserve RAM for context.