Sleeping Agents RL S1000rr Finenv π Interact with waterβtreatment simulation tasks via JSON actions