All roles

Python Developer for AI Prototype (LLM + State Comparison, Short Project)

Remote · USA Full-time New today

Python Developer for AI Prototype (LLM + State Comparison, Short Project) ________________________________________ Description I’m looking for a developer to help build a lightweight AI prototype using OpenAI or Anthropic APIs. This is NOT a full product build. This is a focused prototype to test a specific idea. ________________________________________ Project Goal Build a simple Python-based system that: 1.Runs the same LLM task multiple times. 2. Captures outputs and any intermediate state (memory/logs). 3. Compares differences between runs. 4. Classifies differences into simple categories: o Stable o Boundary o Violation ________________________________________ What This Means Think:

  • Run the same prompt 5–10 times.
  • Log results.
  • Detect where outputs or stored data differ.
  • Label those differences.

That is it. ________________________________________ Technical Requirements Must have:

  • Python
  • Experience with OpenAI API or Anthropic API
  • Ability to build simple, clean scripts (no over-engineering)

Nice to have:

  • LangChain or similar frameworks.
  • Streamlit (for simple UI/dashboard).
  • Experience with logging or comparing outputs.

________________________________________ Important Constraints This should be:

  • Lightweight.
  • fast to build.
  • easy to understand.

Please DO NOT:

  • Design complex architectures.
  • build full systems.
  • over-engineer.

________________________________________ Deliverables

  • Python script or small app.
  • Ability to run repeated LLM tasks.
  • Stored logs of runs (JSON or similar).
  • Basic comparison logic between runs.
  • Simple classification output.

________________________________________ Timeline

  • 3–7 days initial build
  • Max 1–2 weeks total

________________________________________ Engagement Style

  • Fixed-price or hourly (open to discussion)
  • Will start with a small paid test task before full project

________________________________________ Screening Question (Required) Please answer this: If you needed to run the same LLM task multiple times and compare outputs/state between runs, how would you build it quickly? ________________________________________ Who This Is For Ideal candidate:

  • Builds fast prototypes.
  • Comfortable with LLM APIs.
  • Prefers simple solutions over complex systems.

________________________________________ Bonus If this goes well, there may be follow-on work. Apply tot his job Apply To this Job

Related roles