Projects

Live

A/B Testing Memory Game

A live experiment studying how game difficulty affects player behavior. Play the memory game, contribute real data, and explore the published statistical analysis.

Astro / React / TailwindPostHogSupabasePython / statsmodels

How I Prompt 2025

Analyze your Claude AI conversation history. Discover your prompting style, persona classification, and get a shareable wrapped-style report.

PythonClaude AIData Visualization

Local LLM Bench

Evaluate local LLM accuracy on structured data extraction. Tests models' ability to extract JSON from unstructured text with ground-truth comparison, F1 scoring, and fuzzy matching. Supports MLX and Ollama backends.

LocalAIMLXOllamaPython