Model Database's logo Model Database
  • Models
  • Datasets
  • Spaces
  • Docs
  • Pricing

  • Log In
  • Sign Up
Shengyi Costa Huang's picture
3 12 4

Shengyi Costa Huang

vwxyzjn
http://costa.sh
vwxyzjn
vwxyzjn

Research interests

None yet

Organizations

cleanrl's profile picture trl internal testing's profile picture Model Database H4's profile picture TRL's profile picture ICML2023's profile picture Model Database Smol Cluster's profile picture lm-human-preference-details's profile picture

Papers 2

arxiv:2111.08819
arxiv:2006.14171

spaces 2

😻

Vwxyzjn Testyes4

Stopped
📊

Pyserini Wikipedia Kilt Doc

models 27

vwxyzjn/train_policy_accelerate__None__seed1__1695136188

Text Generation • Updated 2 days ago

vwxyzjn/train_policy_accelerate-None-seed1

Text Generation • Updated 2 days ago

vwxyzjn/testyes4

Text Generation • Updated 2 days ago

vwxyzjn/testyes2

Text Generation • Updated 2 days ago

vwxyzjn/starcoderbase-triviaqa

Text Generation • Updated 23 days ago • 384

vwxyzjn/starcoderbase-triviaqa1

Text Generation • Updated 29 days ago • 1

vwxyzjn/starcoderbase_1_0_triviaqa

Text Generation • Updated Aug 17

vwxyzjn/Breakout-v5-cleanba_impala_envpool_machado_atari_wrapper-seed1

Reinforcement Learning • Updated Mar 25

vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper-seed1

Reinforcement Learning • Updated Mar 2

vwxyzjn/BigfishHard-v0-cleanba_ppo_envpool_procgen-seed1

Reinforcement Learning • Updated Feb 27

datasets 3

vwxyzjn/summarize_from_feedback_tldr_3_filtered

Viewer • Updated 2 days ago

vwxyzjn/lm-human-preferences

Preview • Updated 21 days ago • 1.91k

vwxyzjn/lm-human-preferences-debug

Viewer • Updated Jul 14
Company
© Model Database
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs