lamng3/tiny-grpo: minimal hackable GRPO implementation lamng3/usaco-guide: learning content for usaco