You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Shuyan Zhou 8210cd1be0
Update README.md
5 months ago
.github/workflows fix type errors 6 months ago
agent fix typo 7 months ago
browser_env Merge remote-tracking branch 'origin/main' into new_eval 6 months ago
config_files remove duplicate "string_match" in "eval_types" for task 301 302 6 months ago
environment_docker Update README.md 5 months ago
evaluation_harness fix type errors 6 months ago
llms fix type errors 6 months ago
media add v2 execution trajectories 6 months ago
resources add v2 execution trajectories 6 months ago
scripts minor 6 months ago
tests Merge remote-tracking branch 'origin/main' into new_eval 6 months ago
.gitignore minor 6 months ago
.pre-commit-config.yaml add instruction for self-hosting webarena 9 months ago
CITATION.cff Create CITATION.cff 9 months ago
LICENSE release commit 9 months ago
README.md Update README.md 6 months ago
check_errors.sh release commit 9 months ago
minimal_example.py Update tests configs to fit the current settings 8 months ago
parallel_run.sh add parallel running script 7 months ago
prepare.sh release commit 9 months ago
requirements.txt fix type errors 6 months ago
run.py minor 6 months ago
setup.cfg update README 9 months ago
setup.py release commit 9 months ago