What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Organizations
models 21
JackBAI/tti-new
Updated
JackBAI/webvoyager_allresults
Updated
JackBAI/short-to-long
Updated
JackBAI/aitw-general-digiq-agent
Updated
JackBAI/aitw-webshop-digiq-agent
Updated
JackBAI/llava-v1.5-7b-sfted-pad-inputtext
Updated
JackBAI/CRATE-GPT-12L-Pile-600000steps
Updated
JackBAI/webshop-off2on-filteredbc
Updated
JackBAI/general-off2on-filteredbc
Updated
JackBAI/general-off2on-digirl
Updated • 2
datasets 9
JackBAI/jack-latest-vllm-stack
Updated • 8
JackBAI/tinytlp
Viewer • Updated • 30k • 78
JackBAI/eval_data
Viewer • Updated • 9.64k • 14
JackBAI/autoui-zeroshot-trajectories
Preview • Updated • 16
JackBAI/pile_uncopyrighted_bin
Updated • 8
JackBAI/bert_pretrain_datasets
Viewer • Updated • 80.5M • 72 • 1
JackBAI/redbajama-sampled
Viewer • Updated • 24.3M • 66
JackBAI/merged_roberta_dataset
Updated • 7
JackBAI/chatgpt-woi-finetune
Preview • Updated • 25 • 3