Refactor grpo dataset #3192

Jintao-Huang · 2025-02-20T02:19:39Z

No description provided.

Yimi81

这一块能改一下吗 python if 'solution' in dataset.features: dataset = dataset.map(lambda x: {'__#solution': x['solution']}, **map_kwargs)
改成如下：

if 'solution' in dataset.features or 'answer' in dataset.features:
       dataset = dataset.map(lambda x: {'__#solution': x['solution'] if 'solution' in dataset.features else x['answer']}, **map_kwargs)

不然对应列名只有question,answer的gsm8k这样的数据集还是报错

Jintao-Huang · 2025-02-20T03:33:02Z

这一块能改一下吗 python if 'solution' in dataset.features: dataset = dataset.map(lambda x: {'__#solution': x['solution']}, **map_kwargs) 改成如下：
if 'solution' in dataset.features or 'answer' in dataset.features:
       dataset = dataset.map(lambda x: {'__#solution': x['solution'] if 'solution' in dataset.features else x['answer']}, **map_kwargs)
不然对应列名只有question,answer的gsm8k这样的数据集还是报错

这个只需要在外面做个映射：--columns '{"answer": "solution"}' 这样就好了

Yimi81 · 2025-02-20T03:35:13Z

ok~

…soth_fast_grpo * commit '8921d9b98310d93f9f111af8859358ee32dce687': (46 commits) Support multiple vllms (modelscope#3202) update dataset & fix bugs (modelscope#3203) support vllm dp (modelscope#3201) fix setup.py (modelscope#3198) add links (modelscope#3193) Refactor grpo dataset (modelscope#3192) support r1 dataset (modelscope#3191) compat vllm==0.7.2 (modelscope#3083) support Knowledge Distillation sampling (modelscope#3185) Support GOT_OCR2_hf (modelscope#3182) Fix prm in sampler (modelscope#3184) fix sampler reaches max_length (modelscope#3180) refactor cosine orm (modelscope#3179) fix internvl-4b (modelscope#3178) Fix lmdeploy branch (modelscope#3145) Fix/agent grpo (modelscope#3172) fix streaming (modelscope#3176) fix max_length error (modelscope#3173) Support Agent GRPO (modelscope#3170) Fix ovis2 (modelscope#3169) ... # Conflicts: # swift/llm/train/tuner.py

Jintao-Huang added 6 commits February 20, 2025 00:10

update dataset

9173f22

lint pass

cd36488

refactor grpo dataset

1dddccf

Merge branch 'main' into refactor_grpo_dataset

b029d4f

fix sample

53eac75

update

153e15f

tastelikefeet approved these changes Feb 20, 2025

View reviewed changes

Yimi81 reviewed Feb 20, 2025

View reviewed changes

update

fa9ef3d

Jintao-Huang merged commit b71827a into modelscope:main Feb 20, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor grpo dataset #3192

Refactor grpo dataset #3192

Jintao-Huang commented Feb 20, 2025

Yimi81 left a comment

Jintao-Huang commented Feb 20, 2025

Yimi81 commented Feb 20, 2025

Refactor grpo dataset #3192

Refactor grpo dataset #3192

Conversation

Jintao-Huang commented Feb 20, 2025

Yimi81 left a comment

Choose a reason for hiding this comment

Jintao-Huang commented Feb 20, 2025

Yimi81 commented Feb 20, 2025