fffiloni commited on
Commit
6934269
Β·
verified Β·
1 Parent(s): 7701cd5

Upload 3 files

Browse files
Files changed (1) hide show
  1. docs/NEXT_STEPS.md +24 -60
docs/NEXT_STEPS.md CHANGED
@@ -1,69 +1,33 @@
1
- # Next steps
2
 
3
- ## Current milestone
4
 
5
- V2 validates private Space creation and live Gradio API validation.
 
 
 
6
 
7
- ## Phase 3 β€” Pi inside the Job
8
 
9
- Add Pi as the first agentic runtime:
10
 
11
- ```text
12
- HF Job
13
- β†’ install Pi
14
- β†’ configure HF Inference Providers
15
- β†’ give Pi a simple goal
16
- β†’ let Pi modify the generated hello Space
17
- β†’ upload files
18
- β†’ validate live API
19
- β†’ collect Pi traces from ~/.pi/agent/sessions
20
- β†’ copy raw/private traces into the Bucket
21
- ```
22
 
23
- The first Pi task should be deliberately small:
24
 
25
- ```text
26
- Improve the title/description/examples of the generated Gradio app.
27
- Do not change the function signature.
28
- Do not mark success until the live Gradio API test passes.
29
- ```
30
 
31
- ## Phase 4 β€” Apply the gist recipe
32
 
33
- Give Pi a goal close to the article:
34
-
35
- ```text
36
- Read the HF Spaces Agent Quickstart gist.
37
- The deployed Space is your AI lab.
38
- Use hf CLI / Hub API.
39
- Read logs before patching.
40
- Verify through gradio_client.
41
- Do not declare success until live API validation passes.
42
- ```
43
-
44
- ## Phase 5 β€” First model-card template
45
-
46
- Add one narrow model family, ideally a simple Transformers pipeline or a standard Diffusers text-to-image model.
47
-
48
- ## Phase 6 β€” ZeroGPU
49
-
50
- Add a ZeroGPU-specific Gradio template with `@spaces.GPU` only around inference.
51
-
52
- ## Phase 2.1 fix
53
-
54
- This version fixes private Space validation with `gradio_client>=2.0.0`, whose
55
- `Client` constructor uses `token=` instead of `hf_token=`. The worker now uses
56
- signature introspection so older/newer versions can still work. Bucket events
57
- are also printed to stdout so HF Job logs are readable while the worker runs.
58
-
59
-
60
- ## V3 notes
61
-
62
- Phase 3 is a controlled Pi smoke test, not the full gist workflow yet. The next pass should make Pi read the HF Spaces Agent Quickstart gist and operate on the already-created target Space with logs β†’ patch β†’ live API validation.
63
-
64
- Potential improvements after the smoke test passes:
65
-
66
- - replace apt-based Node install with a Python+Node base image to reduce startup time;
67
- - expose a model/provider picker with known-good defaults;
68
- - improve Pi trace links for the Hub trace viewer;
69
- - add stronger secret scanning before showing traces.
 
1
+ # Next Steps
2
 
3
+ ## Current status
4
 
5
+ - Phase 1 validates OAuth β†’ Job β†’ Bucket.
6
+ - Phase 2 validates private Space creation/upload β†’ live API validation.
7
+ - Phase 3 validates Pi inside an HF Job editing `app.py` before upload.
8
+ - Phase 4 validates a closer article-style workflow: Pi receives the HF Spaces Agent Quickstart gist, uses `hf` CLI, creates/uploads a private Space, and the wrapper independently validates the live API.
9
 
10
+ ## Next pass: Phase 5
11
 
12
+ Add a narrow model-card workflow:
13
 
14
+ 1. User provides a model ID.
15
+ 2. Worker fetches `model_info`, README, pipeline tag, library name, and sibling files.
16
+ 3. Worker selects one of a few safe templates.
17
+ 4. Pi applies the same gist-style loop to create a private Space.
18
+ 5. Wrapper validates the live API.
 
 
 
 
 
 
19
 
20
+ Recommended first model families:
21
 
22
+ - simple Transformers text generation / text classification
23
+ - ASR with `pipeline("automatic-speech-recognition")`
24
+ - Diffusers text-to-image only after the CPU/simple path is stable
 
 
25
 
26
+ Keep these out of scope for Phase 5:
27
 
28
+ - Docker Spaces
29
+ - multi-GPU
30
+ - fine-tuning
31
+ - ComfyUI
32
+ - video generation
33
+ - public publication automation