evaluate_tournament
- negmas.tournaments.evaluate_tournament(tournament_path: str | Path | None, scores: DataFrame | None = None, stats: DataFrame | None = None, world_stats: DataFrame | None = None, type_stats: DataFrame | None = None, agent_stats: DataFrame | None = None, metric: str | Callable[[DataFrame], float] = 'mean', verbose: bool = False, recursive: bool = True, extra_scores_to_use: str | None = None, compile: bool = True) TournamentResults[source]
Evaluates the results of a tournament
- Parameters:
tournament_path – Path to save the results to. If scores is not given, it is also used as the source of scores. Pass None to avoid saving the results to disk.
scores – Optionally the scores of all agents in all world runs. If not given they will be read from the file scores.csv in
tournament_pathstats – Optionally the stats of all world runs. If not given they will be read from the file stats.csv in
tournament_pathworld_stats – Optionally the aggregate stats collected in
WorldSetRunStatsfor each world settype_stats – Optionally the aggregate stats collected in
AgentStatsfor each agent typeagent_stats – Optionally the aggregate stats collected in
AgentStatsfor each agent instancemetric – The metric used for evaluation. Possibilities are: mean, median, std, var, sum, truncated_mean or a callable that receives a pandas data-frame and returns a float.
verbose – If true, the winners will be printed
recursive – If true, ALL scores.csv files in all subdirectories of the given tournament_path will be combined
extra_scores_to_use – The type of extra-scores to use. If None normal scores will be used. Only effective if scores is None.
compile – Takes effect only if
tournament_pathis not None. If true, the results will be recompiled from individual world results. This is accurate but slow. If false, it will be assumed that all results are already compiled.independent_test (#) – True if you want an independent t-test
Returns: