File size: 510 Bytes
2838f15
ddd856f
 
c9c4656
ddd856f
2838f15
 
 
 
 
111b6d9
2838f15
 
c9c4656
 
ddd856f
c9c4656
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
title: ZeroEngine V0.1
emoji: 🚀
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 6.5.0
app_file: app.py
pinned: false
license: apache-2.0
python_version: 3.11
---

# ZeroEngine V0.1 (Kernel)
High-performance inference engine for 2-vCPU / 16GB RAM constraints.

## Optimizations
- **KV-Cache Stitching**: Asynchronous pre-evaluation of queue inputs.
- **Hard Partitioning**: Dedicated core assignment per concurrent user.
- **Memory Mapping**: weights mapped via `mmap` to preserve RAM for context.