evaluate_tournament

negmas.tournaments.evaluate_tournament(tournament_path: str | Path | None, scores: DataFrame | None = None, stats: DataFrame | None = None, world_stats: DataFrame | None = None, type_stats: DataFrame | None = None, agent_stats: DataFrame | None = None, metric: str | Callable[[DataFrame], float] = 'mean', verbose: bool = False, recursive: bool = True, extra_scores_to_use: str | None = None, compile: bool = True) → TournamentResults[source]

Evaluates the results of a tournament

Parameters:

tournament_path – Path to save the results to. If scores is not given, it is also used as the source of scores. Pass None to avoid saving the results to disk.
scores – Optionally the scores of all agents in all world runs. If not given they will be read from the file scores.csv in tournament_path
stats – Optionally the stats of all world runs. If not given they will be read from the file stats.csv in tournament_path
world_stats – Optionally the aggregate stats collected in WorldSetRunStats for each world set
type_stats – Optionally the aggregate stats collected in AgentStats for each agent type
agent_stats – Optionally the aggregate stats collected in AgentStats for each agent instance
metric – The metric used for evaluation. Possibilities are: mean, median, std, var, sum, truncated_mean or a callable that receives a pandas data-frame and returns a float.
verbose – If true, the winners will be printed
recursive – If true, ALL scores.csv files in all subdirectories of the given tournament_path will be combined
extra_scores_to_use – The type of extra-scores to use. If None normal scores will be used. Only effective if scores is None.
compile – Takes effect only if tournament_path is not None. If true, the results will be recompiled from individual world results. This is accurate but slow. If false, it will be assumed that all results are already compiled.
independent_test (#) – True if you want an independent t-test

Returns: