woniu03/chatAI
Updated
Error code: JobManagerCrashedError
Need help to make the dataset viewer work? Open a discussion for direct support.
This repo collects chinese corpus for Supervised Finetuning (SFT) and Reinforcement Learning From Human Feedback (RLHF).
Chinese
An example looks as follows:
{
"prompt": "问题:有没有给未成年贷款的有的联系",
"answers":
[
{
"answer": "若通过招行办理,我行规定,贷款人年龄需年满18岁,且年龄加贷款年限不得超过70岁。如果您持有我行信用卡附属卡,可尝试办理预借现金。",
"score": 1
}
],
"prefix": "回答:"
}
An example looks as follows:
{
"prompt": "初学纹发现1/2\"的管螺纹并不是1\"的一半。不知道其中的原因,请各位指点。",
"answers":
[
{
"answer": "管螺纹的名义尺寸是“管子”的孔(内)径,而管子的壁厚不是两倍。所以,1/2\"的管螺纹并不是1\"的一半,",
"score": 1
}
],
"prefix": "回答:"
}
The data fields are the same among all splits.
prompt
: prompt, string
answers
: list of answersanswer
: answer, string
score
: score of answer, int
prefix
: prefix to the answer, string
prompt
: prompt, string
answers
: list of answersanswer
: answer, string
score
: score of answer, int
prefix
: prefix to the answer, string
name | train |
---|---|
train_data_external_v1.jsonl | 5477982 |
dev_data_external_v1.jsonl | 10000 |
Link to github: data_prepare