[RLlib] fix: Support custom eval functions that return zero eval_results, env_steps, or agent_steps `{}, 0, 0` by petern48 · Pull Request #61563 · ray-project/ray

petern48 · 2026-03-07T17:48:40Z

Description

Custom evaluation functions are expected to return the following return values: eval_results, env_steps, agent_steps. Sometimes, users may want to return an output of {}, 0, 0 to skip the iteration (see the issue for an example). However, the code would raise a ValueError because it checks for falsy values. This PR updates the validation to check for the exact types instead.

Additionally, I found that the code would throw a KeyError when the custom evaluation didn't write to the metrics store. To handle this, I added a default={} to the peek call. The same peek call a few lines down already does the same.

Related issues

Fixed #61513

Additional information

The following validation logic was originally added in this PR: #45652. It looks like the intention of the PR was to validate the inputs, not to prevent 0 or {} values.

ray/rllib/algorithms/algorithm.py

Lines 1633 to 1637 in 556e206

    
           if not env_steps or not agent_steps: 
        
               raise ValueError( 
        
                   "Custom eval function must return " 
        
                   "`Tuple[ResultDict, int, int]` with `int, int` being " 
        
                   f"`env_steps` and `agent_steps`! Got {env_steps}, {agent_steps}."

Signed-off-by: Peter Nguyen <petern0408@gmail.com>

gemini-code-assist

Code Review

This pull request addresses an issue where custom evaluation functions could not return falsy values like an empty dictionary {} or 0. The changes correctly adjust the validation logic to check for types instead of truthiness, allowing these valid return values. A new test case has been added to verify this fix. The implementation looks correct and effectively solves the described problem.

_{Note: Security Review did not run due to the size of the PR.}

Signed-off-by: Peter Nguyen <petern0408@gmail.com>

petern48 · 2026-03-08T01:34:52Z

cc @simonsays1980 as original author

petern48 added 2 commits March 7, 2026 09:13

Check for types instead of falsy values when validating input

05ab39a

Signed-off-by: Peter Nguyen <petern0408@gmail.com>

Add test test_custom_eval_function_falsy_results

2297e4e

Signed-off-by: Peter Nguyen <petern0408@gmail.com>

gemini-code-assist bot reviewed Mar 7, 2026

View reviewed changes

petern48 marked this pull request as ready for review March 7, 2026 18:00

petern48 requested a review from a team as a code owner March 7, 2026 18:00

petern48 changed the title ~~[RLlib] Allow custom eval functions to return falsy values {} and 0~~ [RLlib] Support custom eval functions that return falsy values {} and 0 or that don't write to metrics store Mar 7, 2026

ray-gardener bot added rllib RLlib related issues community-contribution Contributed by the community labels Mar 7, 2026

Set default={} for cases when custom eval functions don't write metrics

098b286

Signed-off-by: Peter Nguyen <petern0408@gmail.com>

petern48 force-pushed the custom_eval_returns_falsy_vals branch from 386b1fd to 098b286 Compare March 7, 2026 22:12

petern48 changed the title ~~[RLlib] Support custom eval functions that return falsy values {} and 0 or that don't write to metrics store~~ [RLlib] fix: Support custom eval functions that return zero eval_results, env_steps, or agent_steps {}, 0, 0 Mar 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] fix: Support custom eval functions that return zero eval_results, env_steps, or agent_steps `{}, 0, 0`#61563

[RLlib] fix: Support custom eval functions that return zero eval_results, env_steps, or agent_steps `{}, 0, 0`#61563
petern48 wants to merge 3 commits intoray-project:masterfrom
petern48:custom_eval_returns_falsy_vals

petern48 commented Mar 7, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

petern48 commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	if not env_steps or not agent_steps:
	raise ValueError(
	"Custom eval function must return "
	"`Tuple[ResultDict, int, int]` with `int, int` being "
	f"`env_steps` and `agent_steps`! Got {env_steps}, {agent_steps}."

Conversation

petern48 commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related issues

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

petern48 commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

petern48 commented Mar 7, 2026 •

edited

Loading