fix(lab/service): close 3 sweep-blocking lab service bugs (#254/#255/#256) #257
No reviewers
Labels
No labels
prio_critical
prio_low
type_bug
type_contact
type_issue
type_lead
type_question
type_story
type_task
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
lhumina_code/hero_skills!257
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "development_mik"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Closes 3 lab service bugs that surfaced during the hero_proc#102 service.toml + lab service sweep on its first repo (hero_code, hero_code#15 — squash-merged).
lab service <name> --startonly starts one binary_admincompanions. Doc-comment notes that dynamic discovery from service.toml is the proper long-term fix.lab servicedestructively deletes hero_proc rpc.sock on false-negative liveness probeping_hero_proc(&sock)probe immediately before thestd::fs::remove_file(&sock)call instart_hero_proc. Bail with a clear message if a daemon is still answering, instead of deleting the socket and orphaning supervised services.lab service hero_proc --startshells out toscreenwithout ensuring it's installedwhich screencheck at the top ofstart_hero_proc, BEFORE any state cleanup. Error message points atlab install base(which already installs screen in its apt list at base.rs:29).Test plan
cargo build --release -p labcleanlab service resetall→ bootstrap chain (hero_proc + hero_db + hero_aibroker + hero_code) →hero_proc service listshows all multi-binary services with both server + admin registered + runninglab service hero_code --startnow starts BOTHhero_code_server(6 smoke ✓) ANDhero_code_admin(2 smoke ✓) in a single invocation; previously only the server startedlab service hero_db --startnow starts BOTHhero_db_server(4 smoke ✓) ANDhero_db_admin(2 smoke ✓)hero_proc,hero_router,hero_runner_rhai, etc.) unaffected — SERVICE_MAP entries unchangedAcceptance gate carry — retroactive validation for hero_code#15
hero_code PR #15 documented criterion 5 as ✅ "with per-binary workaround" because lab couldn't start both binaries via service-name. After this PR lands, that workaround is no longer needed —
lab service hero_code --startis the canonical single-command path.Out of scope (filed separately or noted)
hero_aibroker_adminruntime failure: discovered during testing oflab service hero_aibroker --start— admin failed to start with a 10s validation timeout. Out of scope for this PR; s96 hero_lib sweep / subsequent hero_aibroker sweep will address.Signed-off-by: mik-tf
mik-tf referenced this pull request from lhumina_code/hero_demo2026-05-16 00:33:48 +00:00