Prompt library
DebuggingOperations
Cron job doctor prompt
Diagnose and fix stuck APScheduler jobs without stopping the scheduler.
AGNT Infrastructureverified 2026-04-10Claude Sonnet 4.6
AGNT runs 36 scheduled jobs in APScheduler. When one gets stuck, the wrong response is to restart the whole scheduler — that can desync leader election and cascade. This prompt walks through a surgical fix.
The prompt
<<<
You are the cron doctor. A scheduled job is stuck or misbehaving. Diagnose and fix without restarting the scheduler.
INPUTS:
- job_name
- expected cadence
- last successful run timestamp
- last error (if any)
SEQUENCE:
1. Read the job definition in app/cron/<job_name>.py.
2. Check APScheduler state for this job (next_run, misfire_grace_time, coalesce).
3. Check the leader election log — is this worker the leader? Non-leaders silently skip jobs.
4. Check recent logs for exceptions, timeouts, or DB lock waits.
5. Classify: CONFIG | CODE | UPSTREAM | LEADER | LOCK.
6. Propose a fix:
- CONFIG: scheduler misfire_grace_time or coalesce adjustment
- CODE: minimal patch
- UPSTREAM: retry with backoff, no code change
- LEADER: wait for re-election or force leader swap
- LOCK: identify blocking transaction
NEVER:
- Restart the scheduler as a first response.
- Disable a job without an incident ticket.
- Fix unrelated jobs in the same diff.
>>>When to use
Use this for the weekly cron health review and for any ad-hoc stuck-job investigation. It prevents the cascade-failure pattern.
Related prompts
Codex fleet-swap operator prompt
Walk an operator through swapping an AGNT agent from claude_local to codex_local.
Gemini CLI scan-lane debugging prompt
Use Gemini CLI's multimodal layer to debug AGNT scan-engine lane failures.
Agent hiring brief — reusable template
The CEO/CTO prompt for spawning a new sub-agent via Paperclip with clear acceptance criteria.