Opsmas 2025 Day 11: pyjobby & hiproc

TOC:

pyjobby
hiproc

`pyjobby`

pyjobby is a python-centric postgres-backed dynamic workqueue system. I updated it this year, but it is originally a system I wrote in 2021 as a backend to a photography sharing site I was working on (but never released, of course). Image processing pipelines can invole many steps (extract data, mange multi-resolution thumbnails, upload image to cold storage, upload image to CDN storage, upload image to “middle-hot” storage, begin object/person detection/extraction in AI image on GPU servers, update user billing for new image storage, notify users of new image uploads if followers registered) and it all needs to be cleanly tracked, managed, with clean overflow/backpressure/reporting/accounting systems in place.

hence, pyjobby.

big whoop, you think.

but wait - i made a neat job queue work queue distributed processing reporting and accounting system this time.

pyjobby has/does:

work queue state machine flows: waiting → queued → claimed → running → finished (success) or crashed (error)

waiting ──┐
          │
          ├──> queued ──> claimed ──> running ──┬──> finished
          │                                     │
          └─────────────────────────────────────┴──> crashed
                                                          │
                                                          ├──> queued (retry)
                                                          └──> crashed (final)

postgres database with atomic row locking/checkout/updating as global queue primitive
any worker can join the job system by attaching to the datbaase with a work capability descriptor

Machine 1:  pj --workers 4 --cap "web-1"
Machine 2:  pj --workers 4 --cap "web-2"
Machine 3:  pj --workers 8 --cap "ml-gpu" --queue ml

jobs are allocated based on capability matching (anything: has gpu? big ram? big cpu? storage? location?)

Route jobs to workers with specific resources:

# Job requiring GPU
await addJob(db,
    job_class="job.ml.TrainModel",
    kwargs={...},
    capability="gpu")  # ← Only runs on workers with "gpu" capability

# Job requiring specific server (local files)
# (every worker automatically has capability hostname:<node hostname>)
await addJob(db,
    job_class="job.file.Process",
    kwargs={...},
    capability=f"hostname:{platform.node()}")

jobs can natively be assigned to a customer/user for tracking which jobs are for which “people” for activities
multiple job queues
duplicate prevention with idempotent keys

# User uploads file at 10:00 AM
await addJob(db,
    job_class="job.billing.UpdateUsage",
    kwargs={"user_id": 123},
    deadline_key=f"billing:123:2025-11-18",  # ← Unique key
    run_after="2025-11-18 23:59:00",         # ← Run at midnight
    queue="default")

# User uploads another file at 11:00 AM
await addJob(db,
    job_class="job.billing.UpdateUsage",
    kwargs={"user_id": 123},
    deadline_key=f"billing:123:2025-11-18",  # ← Same key
    run_after="2025-11-18 23:59:00",
    queue="default")
# ↑ This INSERT will fail (unique constraint violation)
# Only one billing update will run at midnight

jobs can be submitted via web API or direct database API writing
tree/graph ability: can register jobs only “waking up” when another job completes first

# Job 1: Process uploaded file
job1_id = await addJob(db,
    job_class="job.file.Upload",
    kwargs={"filepath": "/tmp/upload.jpg"},
    queue="default")

# Job 2: Generate thumbnail (waits for Job 1)
await addJob(db,
    job_class="job.image.Thumbnail",
    kwargs={"filepath": "/tmp/upload.jpg"},
    state="waiting",         # ← Must start in 'waiting' state
    waitfor_job=job1_id,     # ← Depends on Job 1
    queue="default")

Group Dependency for concurrency waiting (run_group + waitfor_group):

import secrets
group_id = secrets.randbits(63)  # Generate unique group ID

# Create 3 parallel jobs in a group
for task in ["hash", "exif", "upload"]:
    await addJob(db,
        job_class=f"job.image.{task.capitalize()}",
        kwargs={"filepath": "/tmp/upload.jpg"},
        run_group=group_id,  # ← All part of same group
        queue="default")

# Create job that waits for ALL group members to finish
await addJob(db,
    job_class="job.email.NotifyComplete",
    kwargs={"user_id": 123},
    state="waiting",              # ← Starts in waiting
    waitfor_group=group_id,       # ← Waits for entire group
    queue="default")

priority levels (lower numbers run first; in the same prio level, jobs processed FIFO (by id ascending)):

# High priority (paid users)
await addJob(db, job_class="job.email.SendEmail",
             kwargs={...}, prio=-10)

# Normal priority (default)
await addJob(db, job_class="job.email.SendEmail",
             kwargs={...}, prio=0)

# Low priority (background cleanup)
await addJob(db, job_class="job.cleanup.TempFiles",
             kwargs={...}, prio=100)

recurring jobs/tasks either built-in or by just having one job re-schedule itself again before it completes for a future delayed start time
crashed jobs auto-reschedule themselves due to finally block ultimate fallback handling
cli and web management interfaces; direct DB APIs and web APIs for managing job lifecycles and reporting

basic db schema:

id: Primary key (auto-increment)
state: Enum (waiting, queued, claimed, running, heartbeat, crashed, finished)
job_class: Full Python path to job class (e.g., “job.email.SendEmail”)
kwargs: JSONB of arguments passed to task(**kwargs)
queue: String identifier for job queue (default: “default”)
prio: Priority (lower number = higher priority, default: 0)
run_after: Minimum start time (TIMESTAMP, default: NOW())
capability: Required worker capability to run this job
waitfor_job: Job ID this job depends on
waitfor_group: Group ID this job depends on
run_group: Group ID this job belongs to
deadline_key: Unique key for singleton future jobs
result: JSONB result from successful job execution
backtrace: Error message and stack trace if job crashed
uid: User ID (for multi-tenant tracking; identifying resource hogs)

some rando exampos

architecture

┌─────────────────────────────────────────────────────────────┐
│                     pj Command (CLI)                        │
│                      workit() Entry                         │
└──────────────────────┬──────────────────────────────────────┘
                       │
                       │ Spawns N workers via multiprocessing
                       ▼
        ┌──────────────────────────────────────────┐
        │                                          │
        ▼                                          ▼
┌───────────────┐                          ┌───────────────┐
│   Worker 1    │                          │   Worker N    │
│  JobSystem    │         ...              │  JobSystem    │
│   Instance    │                          │   Instance    │
└───────┬───────┘                          └───────┬───────┘
        │                                          │
        │ Polls DB every 5-6 seconds               │
        │ Optional: Listens for web requests       │
        │                                          │
        ▼                                          ▼
┌──────────────────────────────────────────────────────────┐
│                   PostgreSQL Database                    │
│                      'jorb' Table                        │
│                                                          │
│  States: waiting → queued → claimed → running →          │
│          finished (success) or crashed (error)           │
└──────────────────────────────────────────────────────────┘

the waiting state is a special condition for jobs not even eligibile to run yet becuse they are waiting on conditions like parent or group jobs to complete. Only queued jobs can be fetched by workers.

job polling loop (what workers do)

while True:
    job_data = await self.claim()  # Atomically claim next eligible job
    if job_data:
        await self.executeJob(job_data)  # Run the job
        # Immediately check for more work because we see jobs exist in the queue (no delay)
    else:
        # else, the database has no jobs at all for us, so wait a couple seconds before
        # checking again since we don't want to busy loop hammering the database for no reason.
        await asyncio.sleep(5 + random.uniform(0, 0.001))  # 5-6s delay

there’s still room to improve the system with endless newer features, but it’s stable and feature rich enough to shake a stick at these days.

stats

===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Python                 67        40962        30581         3298         7083
 Shell                  10          761          514          119          128
 SQL                     9         1197          672          287          238
 TOML                    1           99           89            0           10
-------------------------------------------------------------------------------
 HTML                    1           83           78            0            5
 |- CSS                  1          275          230            0           45
 |- JavaScript           1          277          212           20           45
 (Total)                            635          520           20           95
-------------------------------------------------------------------------------
 Markdown               28        10845            0         7023         3822
 |- BASH                19          785          437          213          135
 |- CSS                  1            5            4            1            0
 |- HTML                 1           26           24            0            2
 |- INI                  2           91           62           13           16
 |- JavaScript           2          140          108           12           20
 |- JSON                 2          194          194            0            0
 |- Python              26         5864         4158          858          848
 |- SQL                 15          692          518          112           62
 |- YAML                 3          173          162            0           11
 (Total)                          18916         5746         8242         4928
===============================================================================
 Total                 119        54020        31992        10727        11301
===============================================================================

`hiproc`

I feel like half my brain is always just living in terminal history search/recall of previous commands. When I lose my terminal command history it sometimes takes me hours or days to reconstruct workflows if I didn’t have things documented or automated properly.

When something bad happens like your terminal app crashes or your system reboots, you lose all your per-terminal history and often get reset back to “the last terminal to close overwrote the global history for every terminal now” and you lose dozens or hundreds of recent commands which had useful work happening inside of them (connecting to other systems, detailed report runs with command line arguments, ad-hoc ssh tunnels and port mappings all over the place), then those commands are all just “gone forever” and have to be re-constructed from other logical outcomes (at least if you use “per-terminal” histories and not the disaster of “shared global realtime history between all sessions” thing modern werid people use).

so when my system did its yearly “Freeze and OOM and crash” routine this summer, I had an idea: what if i had a way to, like, save commands? woah, nobody has thought of this before.

my requirements were:

easy to add commands
easy to recall commands
maybe multi-system compatible
maybe even save dffferent commands per-context / directory / workspace but with the same recall name

so i dreamed up hiproc or hp for cli usage.

but then the next problem is: what architecture do you use? It must be near-instant to use; I don’t want to have “the curse of the interperted VM” like all the python utilities with 1-3 second startup times to just run a command wrapper. I don’t want any command to “read a database of commands and pick the right one;” everything must be as active/responsive/low-latency/“online” as possible wich means a “live DB” in memory without any per-command-request loading/parsing/saving/updating in the request flow.

So I invented (yes, again, nobody has ever thought of this before): python in the back, compiled in the front.

hiproc uses a dual architecture with the data model running as a python API server then the actual hp cli command you run is a tiny rust binary connecting to the python API for near-instant recall and exec of saved commands. (why rust? mainly because it has a clean package manager so i could use web and json libraries and interfaces easily without having to reinvent everything in C for this project at least.)

samples

Usage: hp <COMMAND>

Commands:
  find                  Interactively find and execute a command
  save                  Save a new command with smart defaults
  search                Search for commands
  namespaces            List all namespaces
  list                  List user's commands with IDs
  info                  Show detailed info about a command by ID
  here                  Show commands relevant to current directory and context
  suggest               Get intelligent command suggestions based on context
  similar               Show commands similar to a given command ID
  analytics             Show execution analytics and insights
  rename                Rename a command by ID
  delete                Delete one or more commands by ID
  edit                  Edit a command by ID
  generate-completions  Generate shell completion scripts
  exec                  Execute a command by ID with optional arguments (also: hp <id>)
  run                   Execute a command by name with smart contextual matching
  quick-save            Quick-save the last executed shell command
  do                    Execute and save a command with smart defaults
  help                  Print this message or the help of the given subcommand(s)

Options:
  -h, --help     Print help
  -V, --version  Print version

QUICK WORKFLOWS:
  hp save "command"        Save command with auto-detected name/namespace
  hp save "command" name   Save command with custom name, auto-detect namespace
  hp do "command"          Execute and save command in one step (alias: hp x)
  hp quick-save name       Save last shell command with custom name

DIRECT EXECUTION:
  hp <id>                  Execute stored command by ID
  hp <namespace> <name>    Execute stored command by namespace and name

Examples:
  hp save "cargo build"             # Saves as 'cargo' in current project namespace
  hp save "ls -la" list             # Saves as 'list' with auto-detected namespace
  hp do git status                  # Executes and saves 'git status' as 'git/status'
  hp 123                            # Run stored command ID 123
  hp rust build                     # Run 'build' command from 'rust' namespace

$ hp search code
+----+-----------+---------------+------+-------------------+----------+-------------+---------------------+---------------------+------+------------------------------------------------------------------------------------------------------------------------------------+
| ID | Namespace | Name          | User | Hostname       | Scope    | Directory   | Created             | Last Used           | Uses | Command                                                                                                                            |
+============================================================================================================================================================================================================================================================================+
| 36 | matt      | update-coders | matt | computer.local | personal | /Users/matt | 2025-10-31 12:20:32 | 2025-12-21 14:39:04 | 34   | uv tool install --python 3.13 kimi-cli -U; npm install -g @qwen-code/qwen-code@latest @anthropic-ai/claude-code @google/gemini-cli |
+----+-----------+---------------+------+-------------------+----------+-------------+---------------------+---------------------+------+------------------------------------------------------------------------------------------------------------------------------------+

$ hp run update-coders
Executing command 36: uv tool install --python 3.13 kimi-cli -U; npm install -g @qwen-code/qwen-code@latest @anthropic-ai/claude-code @google/gemini-cli

one neat benefit: if you configure the hiproc python backend to listen to a local network interface (not public, for globs sake), then you can configure your hp config file to connect to the multi-host service so you can have multiple internal clients connect to the same “command backend,” then you can use hp run <cmd> across all your machines and update/manage them all centrally (eveything converges back to “feeling like you have a globally auto-mounted NFS home directory” in the end i guess).

hiproc does support multiple users and hosts and namespaces all mixed together, but there’s currently no authentication or control other than “trust me, bro” so either add more features or only use it in a very narrow local scope.

Adding a new command is as simple as:

hp save "ssh me@mysuperhost" login

then recall is obviously

hp run login

But when you run hp save it also notices: your hostname, your directory, your user; so if you save “more specific” commands, it uses a hierarchy to track “save depth” to run more specific commands first (though you can always use hp find or hp search to recall comands and run them by numeric database row id directly). or another way: you could save a deploy command to every project directory with different behaviors and just to hp run deploy from any project dir for the correct behavior to be recalled (though, obviously things like deploy should be better managed than ‘in a magic single-purpose un-revision-controlled command recall system’ amrite)

there’s also a tiny web interface for searching your commands across systems too.

there’s no “security” currently so, uh, only run it only localhost and never port forward it anywhere outside of your internal network. it can’t run commands autonomously, but if “bad actors” got in the system they could modify your expected commands into something else when you go to run them again (TODO idea: have the hp binary check if a previous command changed and trigger another TOFU name<->cmd binding update for approval).

stats

===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Python                 13         2554         2084           95          375
 Shell                   2         1042          777          107          158
 TOML                    3           78           68            2            8
-------------------------------------------------------------------------------
 HTML                    1           35           33            1            1
 |- CSS                  1           55           55            0            0
 |- JavaScript           1           54           49            0            5
 (Total)                            144          137            1            6
-------------------------------------------------------------------------------
 Markdown                1          494            0          308          186
 |- BASH                 1          249          134           69           46
 |- TOML                 1            2            2            0            0
 (Total)                            745          136          377          232
-------------------------------------------------------------------------------
 Rust                    8         2645         2254          106          285
 |- Markdown             7          113            0          106            7
 (Total)                           2758         2254          212          292
===============================================================================
 Total                  28         6848         5216          619         1013
===============================================================================

Article Analysis

Summary

This document describes `pyjobby`, a Python-based job queue system designed for complex image processing pipelines, featuring a state machine flow, PostgreSQL for atomic operations, and capability-based job routing. It also introduces `hiproc` (`hp`), a Rust CLI tool with a Python API backend, developed to efficiently save and recall shell commands, addressing frustration with lost command history and enabling centralized command management across multiple machines.

Content Scores

Metric	Min	Max	Mean	Median	Total
Humor	0	6	1.37	1	26
Helpfulness	2	9	7.16	8	136
Aggression	0	5	0.47	0	9
Spiciness	0	3	0.42	0	8

Chunk-by-Chunk Analysis

Chunk Summary

This entry for Opsmas 2025 introduces pyjobby, a Python-based work queue system designed for managing complex image processing pipelines, detailing its state machine flow and architectural components.

Chunk Ratings

Metric	Score	Reason
Humor	2	The text has a slightly self-deprecating tone with phrases like "but never released, of course" and "big whoop, you think," which suggests an attempt at humor, but it's not particularly strong or unique.
Helpfulness	7	The text provides a clear description of the `pyjobby` project, its purpose, and its technical features, including a link to its GitHub repository and architecture documentation. It explains the problem it solves and its core functionalities.
Aggression	1	The text is generally neutral and informative. There are no overtly negative or aggressive sentiments expressed.
Spiciness	1	The tone is professional and technical, with mild self-deprecating humor that doesn't cross into offensive territory.

Show Original Text

---
date: '2025-12-22'
frame: frame-front
frontTitle: 'Opsmas 2025 Day 11: pyjobby & hiproc'
pageClasses: ['opsmas-2025']
published: true
subframe: frame-article
title: 'Opsmas 2025 Day 11: pyjobby & hiproc'
---

# Opsmas 2025 Day 11: pyjobby & hiproc

TOC:

- [`pyjobby`](#pyjobby)
- [`hiproc`](#hiproc)


## [`pyjobby`](https://github.com/mattsta/pyjobby)

pyjobby is a python-centric postgres-backed dynamic workqueue system. I updated it this year, but it is originally a system I wrote in 2021 as a backend to a photography sharing site I was working on (but never released, of course). Image processing pipelines can invole many steps (extract data, mange multi-resolution thumbnails, upload image to cold storage, upload image to CDN storage, upload image to "middle-hot" storage, begin object/person detection/extraction in AI image on GPU servers, update user billing for new image storage, notify users of new image uploads if followers registered) and it all needs to be _cleanly_ tracked, managed, with clean overflow/backpressure/reporting/accounting systems in place.

hence, pyjobby.

big whoop, you think.

but wait - i made a neat job queue work queue distributed processing reporting and accounting system this time.

[`pyjobby`](https://github.com/mattsta/pyjobby) [has/does](https://github.com/mattsta/pyjobby/blob/main/docs/architecture.md):

- work queue state machine flows:  waiting → queued → claimed → running → finished (success) or crashed (error)


```
waiting ──┐
          │
          ├──> queued ──> claimed ──> running ──┬──> finished

Chunk Summary

This technical snippet describes a job queuing system using PostgreSQL for atomic operations, detailing worker configuration and capability-based job routing with Python code examples.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text is primarily technical documentation with a single instance of a word ('crashed') that could be interpreted humorously in a dire context, but it's not intentional humor.
Helpfulness	8	The text provides clear examples of how to configure and utilize a job queuing system, including its database integration and capability-based job routing. It's highly informative for someone looking to implement or understand such a system.
Aggression	0	The text is purely descriptive and informative, with no traces of negativity, anger, or emotional distress.
Spiciness	0	The content is strictly professional and technical, adhering to standard documentation practices without any offensive material.

Show Original Text

          │                                     │
          └─────────────────────────────────────┴──> crashed
                                                          │
                                                          ├──> queued (retry)
                                                          └──> crashed (final)
```

- postgres [database](https://github.com/mattsta/pyjobby/blob/main/docs/database-schema.md) with atomic row locking/checkout/updating as global queue primitive
- any worker can join the job system by attaching to the datbaase with a work capability descriptor

```
Machine 1:  pj --workers 4 --cap "web-1"
Machine 2:  pj --workers 4 --cap "web-2"
Machine 3:  pj --workers 8 --cap "ml-gpu" --queue ml
```

- jobs are allocated based on capability matching (anything: has gpu? big ram? big cpu? storage? location?)

Route jobs to workers with specific resources:

```python
# Job requiring GPU
await addJob(db,
    job_class="job.ml.TrainModel",
    kwargs={...},
    capability="gpu")  # ← Only runs on workers with "gpu" capability

# Job requiring specific server (local files)

Chunk Summary

This technical snippet details job queuing system features, including hostname capabilities, user assignment, duplicate prevention, API submission, and job dependencies, illustrated with Python code examples.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text is primarily technical documentation and code examples; there are no attempts at humor.
Helpfulness	8	The text provides clear explanations of job queuing features with concrete code examples, illustrating concepts like hostname capabilities, customer assignment, duplicate prevention, and job dependencies.
Aggression	0	The text is purely descriptive and informative, lacking any negative or aggressive sentiment.
Spiciness	0	The content is professional and technical, with no offensive or inappropriate language.

Show Original Text

# (every worker automatically has capability hostname:<node hostname>)
await addJob(db,
    job_class="job.file.Process",
    kwargs={...},
    capability=f"hostname:{platform.node()}")
```

- jobs can natively be assigned to a _customer/user_ for tracking which jobs are for which "people" for activities
- multiple job queues

- duplicate prevention with idempotent keys

```python
# User uploads file at 10:00 AM
await addJob(db,
    job_class="job.billing.UpdateUsage",
    kwargs={"user_id": 123},
    deadline_key=f"billing:123:2025-11-18",  # ← Unique key
    run_after="2025-11-18 23:59:00",         # ← Run at midnight
    queue="default")

# User uploads another file at 11:00 AM
await addJob(db,
    job_class="job.billing.UpdateUsage",
    kwargs={"user_id": 123},
    deadline_key=f"billing:123:2025-11-18",  # ← Same key
    run_after="2025-11-18 23:59:00",
    queue="default")
# ↑ This INSERT will fail (unique constraint violation)
# Only one billing update will run at midnight
```

- jobs can be submitted via web API or direct database API writing
- tree/graph ability: can register jobs only "waking up" when another job completes first

```python
# Job 1: Process uploaded file
job1_id = await addJob(db,
    job_class="job.file.Upload",
    kwargs={"filepath": "/tmp/upload.jpg"},
    queue="default")

# Job 2: Generate thumbnail (waits for Job 1)

Chunk Summary

This text provides Python code examples and explanations for managing job dependencies, group concurrency, and priority levels within a job queuing system.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text contains code examples and explanations of technical concepts. There is no attempt at humor, beyond a very minimal, implicit, dry tone in the use of comments.
Helpfulness	8	The text provides clear, concise code examples demonstrating how to add jobs with specific dependencies, group them for concurrency, and set priority levels. This is directly actionable information for a programmer working with this system.
Aggression	0	The text is purely informational and technical, with no emotional content or negativity.
Spiciness	0	The text is professional and focused on technical implementation details, with no offensive or inappropriate content.

Show Original Text

await addJob(db,
    job_class="job.image.Thumbnail",
    kwargs={"filepath": "/tmp/upload.jpg"},
    state="waiting",         # ← Must start in 'waiting' state
    waitfor_job=job1_id,     # ← Depends on Job 1
    queue="default")
```

**Group Dependency** for concurrency waiting (`run_group` + `waitfor_group`):

```python
import secrets
group_id = secrets.randbits(63)  # Generate unique group ID

# Create 3 parallel jobs in a group
for task in ["hash", "exif", "upload"]:
    await addJob(db,
        job_class=f"job.image.{task.capitalize()}",
        kwargs={"filepath": "/tmp/upload.jpg"},
        run_group=group_id,  # ← All part of same group
        queue="default")

# Create job that waits for ALL group members to finish
await addJob(db,
    job_class="job.email.NotifyComplete",
    kwargs={"user_id": 123},
    state="waiting",              # ← Starts in waiting
    waitfor_group=group_id,       # ← Waits for entire group
    queue="default")
```

- priority levels (lower numbers run first; in the same prio level, jobs processed FIFO (by `id` ascending)):

```python
# High priority (paid users)
await addJob(db, job_class="job.email.SendEmail",
             kwargs={...}, prio=-10)

# Normal priority (default)
await addJob(db, job_class="job.email.SendEmail",

Chunk Summary

This text describes the features and database schema of a Python job scheduling system, including recurring jobs, automatic rescheduling, management interfaces, and detailed field descriptions for job tracking.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text is a technical description of a job scheduling system and contains no elements of humor.
Helpfulness	8	The text provides a clear overview of job scheduling features, including recurring jobs, self-rescheduling of crashed jobs, and management interfaces. It also details the database schema with explanations for each field, which is highly useful for understanding and integrating the system.
Aggression	0	The text is purely informational and technical, with no negative sentiment or aggressive language.
Spiciness	0	The content is strictly professional and technical, lacking any potentially offensive or informal language.

Show Original Text

             kwargs={...}, prio=0)

# Low priority (background cleanup)
await addJob(db, job_class="job.cleanup.TempFiles",
             kwargs={...}, prio=100)
```

- recurring jobs/tasks either [built-in](https://github.com/mattsta/pyjobby/blob/main/docs/RECURRING_SCHEDULER.md) or by just having one job re-schedule itself again before it completes for a future delayed start time
- crashed jobs auto-reschedule themselves due to `finally` block ultimate fallback handling
- cli and web management interfaces; direct DB APIs and web APIs for managing job lifecycles and reporting

basic db schema:

- `id`: Primary key (auto-increment)
- `state`: Enum (waiting, queued, claimed, running, heartbeat, crashed, finished)
- `job_class`: Full Python path to job class (e.g., "job.email.SendEmail")
- `kwargs`: JSONB of arguments passed to `task(**kwargs)`
- `queue`: String identifier for job queue (default: "default")
- `prio`: Priority (lower number = higher priority, default: 0)
- `run_after`: Minimum start time (TIMESTAMP, default: NOW())
- `capability`: Required worker capability to run this job
- `waitfor_job`: Job ID this job depends on
- `waitfor_group`: Group ID this job depends on
- `run_group`: Group ID this job belongs to
- `deadline_key`: Unique key for singleton future jobs
- `result`: JSONB result from successful job execution
- `backtrace`: Error message and stack trace if job crashed
- `uid`: User ID (for multi-tenant tracking; identifying resource hogs)


some [rando exampos](https://github.com/mattsta/pyjobby/blob/main/docs/EXAMPLES.md)

Chunk Summary

This diagram depicts the `pj Command (CLI)`'s `workit()` entry spawning multiple worker processes, each with its own JobSystem.

Chunk Ratings

Metric	Score	Reason
Humor	1	The ASCII art is a clever visual representation, but it lacks any actual comedic elements.
Helpfulness	7	The diagram clearly illustrates the `pj Command (CLI)`'s `workit()` entry point and its process of spawning N workers via multiprocessing, providing a good high-level overview of the architecture.
Aggression	0	The text is purely informational and contains no aggressive or negative sentiment.
Spiciness	0	The content is technical and professional, with no offensive or inappropriate material.

Show Original Text



### architecture

```
┌─────────────────────────────────────────────────────────────┐
│                     pj Command (CLI)                        │
│                      workit() Entry                         │
└──────────────────────┬──────────────────────────────────────┘
                       │
                       │ Spawns N workers via multiprocessing
                       ▼
        ┌──────────────────────────────────────────┐
        │                                          │
        ▼                                          ▼
┌───────────────┐                          ┌───────────────┐
│   Worker 1    │                          │   Worker N    │
│  JobSystem    │         ...              │  JobSystem    │

Chunk Summary

This diagram depicts a PostgreSQL database table named 'jorb' with a state machine showing transitions from waiting to running, polling the database every 5-6 seconds and optionally listening for web requests.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text presents technical information without any attempt at humor or wit.
Helpfulness	7	The text clearly illustrates a simplified workflow for a PostgreSQL database table, showing states like "waiting," "queued," "claimed," and "running." It also indicates polling frequency and optional web request listening, which is helpful for understanding a system's basic operational flow.
Aggression	0	The text is purely informational and lacks any emotional tone, negativity, or aggressive language.
Spiciness	0	The content is technical and factual, with no offensive or unprofessional language.

Show Original Text

│   Instance    │                          │   Instance    │
└───────┬───────┘                          └───────┬───────┘
        │                                          │
        │ Polls DB every 5-6 seconds               │
        │ Optional: Listens for web requests       │
        │                                          │
        ▼                                          ▼
┌──────────────────────────────────────────────────────────┐
│                   PostgreSQL Database                    │
│                      'jorb' Table                        │
│                                                          │
│  States: waiting → queued → claimed → running →          │

Chunk Summary

This text explains job states in a queueing system, provides a Python code snippet for worker job polling, and presents project statistics.

Chunk Ratings

Metric	Score	Reason
Humor	2	The phrase "shake a stick at these days" is a mild, slightly informal idiom, but the overall tone is technical and not primarily aiming for humor.
Helpfulness	8	The text clearly explains job states like 'waiting' and 'queued', provides a functional Python code example for a worker's polling loop, and includes useful project statistics.
Aggression	0	The text is purely technical and objective, with no emotional or negative sentiment present.
Spiciness	0	The content is strictly professional and technical, containing no offensive or inappropriate material.

Show Original Text

│          finished (success) or crashed (error)           │
└──────────────────────────────────────────────────────────┘
```

the `waiting` state is a special condition for jobs _not even eligibile to run yet_ becuse they are waiting on conditions like parent or group jobs to complete. Only `queued` jobs can be fetched by workers.

- job polling loop (what workers do)

```python
while True:
    job_data = await self.claim()  # Atomically claim next eligible job
    if job_data:
        await self.executeJob(job_data)  # Run the job
        # Immediately check for more work because we see jobs exist in the queue (no delay)
    else:
        # else, the database has no jobs at all for us, so wait a couple seconds before
        # checking again since we don't want to busy loop hammering the database for no reason.
        await asyncio.sleep(5 + random.uniform(0, 0.001))  # 5-6s delay
```

there's still room to improve the system with endless newer features, but it's stable and feature rich enough to shake a stick at these days.

### stats

```haskell
===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Python                 67        40962        30581         3298         7083
 Shell                  10          761          514          119          128

Chunk Summary

This data table presents numerical counts for SQL, TOML, HTML, CSS, JavaScript, and Markdown.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text is a raw data table with no narrative or intended humor.
Helpfulness	7	The text provides specific quantitative data related to programming languages, which can be helpful for analyzing codebases or understanding language prevalence.
Aggression	0	The text is purely factual data and contains no emotional or negative sentiment.
Spiciness	0	The text is a neutral data presentation with no potentially offensive content.

Show Original Text

 SQL                     9         1197          672          287          238
 TOML                    1           99           89            0           10
-------------------------------------------------------------------------------
 HTML                    1           83           78            0            5
 |- CSS                  1          275          230            0           45
 |- JavaScript           1          277          212           20           45
 (Total)                            635          520           20           95
-------------------------------------------------------------------------------
 Markdown               28        10845            0         7023         3822

Chunk Summary

This is a data table listing programming languages with corresponding numerical values, likely representing usage or code metrics.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text presents raw data in a tabular format without any narrative or commentary, making it purely informational and devoid of humor.
Helpfulness	7	The text provides a structured overview of programming languages and their associated metrics, which can be helpful for understanding relative usage or complexity. However, the meaning of the numerical columns is not defined, limiting its immediate actionability.
Aggression	0	The text is a neutral presentation of data and contains no emotional content, negativity, or aggression.
Spiciness	0	The text is purely factual and professional, with no content that could be considered offensive.

Show Original Text

 |- BASH                19          785          437          213          135
 |- CSS                  1            5            4            1            0
 |- HTML                 1           26           24            0            2
 |- INI                  2           91           62           13           16
 |- JavaScript           2          140          108           12           20
 |- JSON                 2          194          194            0            0
 |- Python              26         5864         4158          858          848

Chunk Summary

The author expresses frustration over losing terminal command history, highlighting the significant time and effort required to reconstruct workflows when it happens due to terminal crashes or system reboots, and criticizing certain "shared global history" practices.

Chunk Ratings

Metric	Score	Reason
Humor	4	The author uses relatable hyperbole ("half my brain is always just living in terminal history") and a touch of exasperation to describe a common developer pain point, which injects a mild, situational humor.
Helpfulness	7	The text clearly articulates a common problem faced by developers regarding terminal history loss and the frustration associated with it. While it doesn't offer a direct solution, it validates the user's experience and highlights the importance of proper documentation or automation.
Aggression	5	The tone is one of frustration and exasperation rather than outright anger. Phrases like "lose all your per-terminal history" and "disaster of 'shared global realtime history'" convey a strong negative emotional response to a technical inconvenience.
Spiciness	3	The language is generally professional, but the author expresses strong dissatisfaction with certain practices ("modern weird people use") and the consequences of losing terminal history, indicating a moderate level of opinionated commentary.

Show Original Text

 |- SQL                 15          692          518          112           62
 |- YAML                 3          173          162            0           11
 (Total)                          18916         5746         8242         4928
===============================================================================
 Total                 119        54020        31992        10727        11301
===============================================================================
```

---

## [`hiproc`](https://github.com/mattsta/hiproc)

I feel like half my brain is always just living in terminal history search/recall of previous commands. When I lose my terminal command history it sometimes takes me hours or days to reconstruct workflows if I didn't have things documented or automated properly.

When something bad happens like your terminal app crashes or your system reboots, you lose all your per-terminal history and often get reset back to "the last terminal to close overwrote the global history for every terminal now" and you lose dozens or hundreds of recent commands which had useful work happening inside of them (connecting to other systems, detailed report runs with command line arguments, ad-hoc ssh tunnels and port mappings all over the place), then those commands are all just "gone forever" and have to be re-constructed from other logical outcomes (at least if you use "per-terminal" histories and not the disaster of "shared global realtime history between all sessions" thing modern werid people use).

Chunk Summary

A programmer, frustrated by system crashes and slow utilities, developed `hiproc` (`hp`), a Rust CLI tool with a Python API backend, to efficiently save and recall commands with near-instant responsiveness.

Chunk Ratings

Metric	Score	Reason
Humor	6	The author uses self-deprecating humor and hyperbole ("woah, nobody has thought of this before," "yes, again, nobody has ever thought of this before") to highlight the novelty of their idea, which adds a lighthearted and relatable tone.
Helpfulness	8	The text clearly outlines the problem the author faced, their requirements for a solution, and the architecture of their proposed tool (`hiproc`/`hp`). It provides a GitHub link for further exploration and a sample of its command-line usage, offering actionable information for interested users.
Aggression	1	The tone is generally positive and solution-oriented. The only hint of negativity is the mild frustration expressed about system crashes and slow startup times of other utilities.
Spiciness	2	The text exhibits a slight sarcastic edge with the repeated claims of inventing common concepts, but it remains well within professional boundaries and is not offensive.

Show Original Text


so when my system did its yearly "Freeze and OOM and crash" routine this summer, I had an idea: what if i had a way to, like, save commands? woah, nobody has thought of this before.

my requirements were:

- easy to add commands
- easy to recall commands
- maybe multi-system compatible
- maybe even save dffferent commands per-context / directory / workspace but with the same recall name

so i dreamed up [`hiproc`](https://github.com/mattsta/hiproc) or [`hp`](https://github.com/mattsta/hiproc) for cli usage.

but then the next problem is: what architecture do you use? It must be near-instant to use; I don't want to have "the curse of the interperted VM" like all the python utilities with 1-3 second startup times to just run a command wrapper. I don't want any command to "read a database of commands and pick the right one;" everything must be as active/responsive/low-latency/"online" as possible wich means a "live DB" in memory without any per-command-request loading/parsing/saving/updating in the request flow. 

So I invented (yes, again, nobody has ever thought of this before): python in the back, compiled in the front.

[`hiproc`](https://github.com/mattsta/hiproc) uses a dual architecture with the data model running as a python API server then the actual [`hp`](https://github.com/mattsta/hiproc) cli command you run is a tiny rust binary connecting to the python API for near-instant recall and exec of saved commands. (why rust? mainly because it has a clean package manager so i could use web and json libraries and interfaces easily without having to reinvent everything in C for this project at least.)

### samples

```haskell
Usage: hp <COMMAND>

Commands:
  find                  Interactively find and execute a command
  save                  Save a new command with smart defaults

Chunk Summary

This text outlines a command-line interface's available commands, their functions, and common options for managing and executing commands.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text is a technical command-line interface (CLI) help output, which is inherently factual and devoid of humor. The single point reflects the slightly playful use of "hp" as an alias for "exec."
Helpfulness	9	This text provides a clear and organized list of commands, their descriptions, and available options. It's highly functional for a user needing to understand or utilize the associated CLI tool.
Aggression	0	The text is purely informative and neutral in tone, containing no negative or aggressive language.
Spiciness	0	The content is standard technical documentation, maintaining a strictly professional and unobjectionable tone.

Show Original Text

  search                Search for commands
  namespaces            List all namespaces
  list                  List user's commands with IDs
  info                  Show detailed info about a command by ID
  here                  Show commands relevant to current directory and context
  suggest               Get intelligent command suggestions based on context
  similar               Show commands similar to a given command ID
  analytics             Show execution analytics and insights
  rename                Rename a command by ID
  delete                Delete one or more commands by ID
  edit                  Edit a command by ID
  generate-completions  Generate shell completion scripts
  exec                  Execute a command by ID with optional arguments (also: hp <id>)
  run                   Execute a command by name with smart contextual matching
  quick-save            Quick-save the last executed shell command
  do                    Execute and save a command with smart defaults
  help                  Print this message or the help of the given subcommand(s)

Options:
  -h, --help     Print help
  -V, --version  Print version

QUICK WORKFLOWS:
  hp save "command"        Save command with auto-detected name/namespace

Chunk Summary

This text provides documentation and examples for the 'hp' command-line tool, detailing how to save, execute, and manage shell commands with custom names and namespaces.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text is primarily functional documentation, with no intentional humor. The single point is the terse presentation.
Helpfulness	9	The text provides clear, concise explanations and practical examples for using the 'hp' command-line tool, making it highly useful for users.
Aggression	0	The text is purely instructional and lacks any negative or aggressive sentiment.
Spiciness	0	The content is professional and technical documentation, with no offensive or inappropriate language.

Show Original Text

  hp save "command" name   Save command with custom name, auto-detect namespace
  hp do "command"          Execute and save command in one step (alias: hp x)
  hp quick-save name       Save last shell command with custom name

DIRECT EXECUTION:
  hp <id>                  Execute stored command by ID
  hp <namespace> <name>    Execute stored command by namespace and name

Examples:
  hp save "cargo build"             # Saves as 'cargo' in current project namespace
  hp save "ls -la" list             # Saves as 'list' with auto-detected namespace
  hp do git status                  # Executes and saves 'git status' as 'git/status'
  hp 123                            # Run stored command ID 123
  hp rust build                     # Run 'build' command from 'rust' namespace
```

```haskell
$ hp search code
+----+-----------+---------------+------+-------------------+----------+-------------+---------------------+---------------------+------+------------------------------------------------------------------------------------------------------------------------------------+

Chunk Summary

This text displays a command history entry showing a user's 'update-coders' command, its creation and last used timestamps, and the associated installation commands.

Chunk Ratings

Metric	Score	Reason
Humor	1	The text is purely informational and does not contain any attempts at humor. The humor rating is very low as there is no inherent comedic value.
Helpfulness	8	The provided text offers a clear and structured view of command history, including relevant metadata. It's useful for understanding past operations and potentially replicating them. The omission of actual execution output prevents a perfect score.
Aggression	0	The text is neutral and factual, presenting data without any emotional charge or negativity. There is no indication of anger, frustration, or any other aggressive sentiment.
Spiciness	0	The content is strictly technical and professional, containing no offensive, controversial, or inappropriate material. It's a straightforward display of command execution data.

Show Original Text

| ID | Namespace | Name          | User | Hostname       | Scope    | Directory   | Created             | Last Used           | Uses | Command                                                                                                                            |
+============================================================================================================================================================================================================================================================================+
| 36 | matt      | update-coders | matt | computer.local | personal | /Users/matt | 2025-10-31 12:20:32 | 2025-12-21 14:39:04 | 34   | uv tool install --python 3.13 kimi-cli -U; npm install -g @qwen-code/qwen-code@latest @anthropic-ai/claude-code @google/gemini-cli |
+----+-----------+---------------+------+-------------------+----------+-------------+---------------------+---------------------+------+------------------------------------------------------------------------------------------------------------------------------------+
```

```haskell
$ hp run update-coders
Executing command 36: uv tool install --python 3.13 kimi-cli -U; npm install -g @qwen-code/qwen-code@latest @anthropic-ai/claude-code @google/gemini-cli

Chunk Summary

This text describes the `hiproc` and `hp` tools, highlighting their ability to centrally manage commands across multiple machines with an example of saving and running commands, while also noting security considerations.

Chunk Ratings

Metric	Score	Reason
Humor	3	The text contains a single instance of mild humor with the phrase "trust me, bro" and a rhetorical question about a "magic single-purpose un-revision-controlled command recall system." The overall tone is informative, not comedic.
Helpfulness	8	The text provides a clear explanation of how to configure and use the `hiproc` and `hp` tools for centralized command management across multiple machines. It outlines benefits, installation examples, and advanced features like hierarchical command saving.
Aggression	1	The text is primarily informative and technical. There's a slight hint of caution regarding the lack of authentication, but it doesn't express negativity or anger.
Spiciness	2	The text uses the phrase "for globs sake," which is a mild, non-offensive exclamation. The rhetorical question "amrite" also adds a slightly informal, conversational touch but is not offensive.

Show Original Text

```

one neat benefit: if you configure the [`hiproc`](https://github.com/mattsta/hiproc) python backend to listen to a local network interface (not public, for globs sake), then you can configure your [`hp`](https://github.com/mattsta/hiproc) config file to connect to the multi-host service so you can have multiple internal clients connect to the same "command backend," then you can use `hp run <cmd>` across all your machines and update/manage them all centrally (eveything converges back to "feeling like you have a globally auto-mounted NFS home directory" in the end i guess).

[`hiproc`](https://github.com/mattsta/hiproc) does support multiple users and hosts and namespaces all mixed together, but there's currently no authentication or control other than "trust me, bro" so either add more features or only use it in a very narrow local scope.

Adding a new command is as simple as:

```haskell
hp save "ssh me@mysuperhost" login
```

then recall is obviously

```haskell
hp run login
```

But when you run `hp save` it also notices: your hostname, your directory, your user; so if you save "more specific" commands, it uses a hierarchy to track "save depth" to run more specific commands first (though you can always use `hp find` or `hp search` to recall comands and run them by numeric database row id directly). or another way: you could save a `deploy` command to every project directory with different behaviors and just to `hp run deploy` from any project dir for the correct behavior to be recalled (though, obviously things like `deploy` should be better managed than 'in a magic single-purpose un-revision-controlled command recall system' amrite)

there's also a tiny web interface for searching your commands across systems too.

Chunk Summary

The text advises running an application only on localhost due to security concerns and suggests a TOFU-based mechanism for verifying command integrity.

Chunk Ratings

Metric	Score	Reason
Humor	2	The text includes a lighthearted "TODO idea" that hints at a humorous potential outcome, but it's not a primary focus.
Helpfulness	8	The text provides critical security advice for running a specific application and outlines a practical, albeit technical, improvement suggestion for command integrity.
Aggression	1	The tone is cautionary but not aggressive, with a focus on preventing negative outcomes rather than expressing anger.
Spiciness	0	The content is purely technical and informative, with no offensive or inappropriate material.

Show Original Text


there's no "security" currently so, uh, only run it only localhost and never port forward it anywhere outside of your internal network. it can't _run commands_ autonomously, but if "bad actors" got in the system they could modify your _expected_ commands into something else when you go to run them again (TODO idea: have the `hp` binary check if a previous command changed and trigger another TOFU name<->cmd binding update for approval).

### stats

```haskell
===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Python                 13         2554         2084           95          375
 Shell                   2         1042          777          107          158
 TOML                    3           78           68            2            8
-------------------------------------------------------------------------------
 HTML                    1           35           33            1            1
 |- CSS                  1           55           55            0            0

Chunk Summary

This text displays raw statistical data for programming languages like JavaScript, Markdown, BASH, TOML, and Rust, showing numerical values across unspecified categories.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text is a raw data output without any attempt at humor or creative expression.
Helpfulness	2	The text presents data that could be helpful to someone who understands the context of programming language statistics, but it lacks any explanation or interpretation to be broadly useful.
Aggression	0	The text is purely informational and exhibits no emotional tone, positive or negative.
Spiciness	0	The content is technical and neutral, devoid of any language that could be considered offensive.

Show Original Text

 |- JavaScript           1           54           49            0            5
 (Total)                            144          137            1            6
-------------------------------------------------------------------------------
 Markdown                1          494            0          308          186
 |- BASH                 1          249          134           69           46
 |- TOML                 1            2            2            0            0
 (Total)                            745          136          377          232
-------------------------------------------------------------------------------
 Rust                    8         2645         2254          106          285

Chunk Summary

This text presents a table of numerical data, with one row detailing "Markdown" and a total row, but lacks sufficient context to be useful.

Chunk Ratings

Metric	Score	Reason
Humor	0	The text consists solely of numerical data presented in a table format, with no discernible attempt at humor.
Helpfulness	2	The data might be helpful to someone specifically looking for these exact metrics, but without context or labels for the columns, its utility is extremely limited.
Aggression	0	The text is purely factual and contains no emotional content or expressions of negativity.
Spiciness	0	The content is completely neutral and professional, lacking any element that could be considered offensive.

Show Original Text

 |- Markdown             7          113            0          106            7
 (Total)                           2758         2254          212          292
===============================================================================
 Total                  28         6848         5216          619         1013
===============================================================================
```