Add re-sft scripts #14

so298 · 2024-02-16T11:32:23Z

max_seq_length 4096
bf16
lr_scheduler_type cosine
warmup_ratio 0.1
learning_rate 2e-5 or 1e-4
global_batch_size = gradient_accumulation_steps * n_gpu * per_device_train_batch_size = 64

で設定

出力先は/model/7B_HF_RE/*

lora-all-jaster
lora-all-jaster-lr_1e-4
lora-all-gpt4-self-inst
lora-all-gpt4-self-inst-lr_1e-4

を追加

so298 · 2024-02-16T12:23:05Z

exp2とexp3を追加

so298 · 2024-02-17T02:24:35Z

exp6とexp8のgpt4-self-instのデータセットに関してはsingle-GPU学習だとメモリ不足で学習ができなかったので、1 node 8 GPUを利用した学習へと変更

$\text{global batch size} = N_\text{GPUs} \times \text{per device batch size} \times \text{gradient accumulation steps} = 64$
となるようにper_device_batch_size = 1, gradient_accumulation_steps = 8に調整

so298 · 2024-05-22T13:55:08Z

@llm-jp/modelwg
draftのまま放置してしまっていました
今更ですが、誰かreviewとマージを行っていただけるでしょうか？

hiroshi-matsuda-rit · 2024-05-28T14:51:58Z

明日以降、私の方でレビューを行いたいと思いますが、実験の状況を理解していないものが多くあるので、いろいろ確認を入れさせてもらうかもしれません。ご了承ください。

so298 added 3 commits February 16, 2024 20:25

Add exp1 sft scripts

b981ad9

per_device_batch_size = 1

f3f35ff

exp2 and exp3

5c2236a

so298 changed the title ~~Add exp1 sft scripts~~ Add re-sft scripts Feb 16, 2024

so298 added 3 commits February 16, 2024 21:48

exp4, 4+, 7

124c009

exp6 and 8

eb631b6

Fix exp6 and exp8 scripts

c8197d4

so298 force-pushed the sft-scripts branch from 420836b to c8197d4 Compare February 17, 2024 02:15

so298 added 6 commits February 17, 2024 15:07

Add full parameter script for exp6 and exp8

eab9aac

Add sft README

7352d72

Full sft scripts

4dddb14

Add exp9 scripts

0d311ac

Add eval scripts

779a1e1

Update exp9 scripts

c90df37

so298 marked this pull request as ready for review May 22, 2024 13:53

so298 requested a review from a team as a code owner May 22, 2024 13:53

hiroshi-matsuda-rit self-requested a review May 28, 2024 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add re-sft scripts #14

Add re-sft scripts #14

so298 commented Feb 16, 2024 •

edited

so298 commented Feb 16, 2024

so298 commented Feb 17, 2024

so298 commented May 22, 2024

hiroshi-matsuda-rit commented May 28, 2024

Add re-sft scripts #14

Are you sure you want to change the base?

Add re-sft scripts #14

Conversation

so298 commented Feb 16, 2024 • edited

so298 commented Feb 16, 2024

so298 commented Feb 17, 2024

so298 commented May 22, 2024

hiroshi-matsuda-rit commented May 28, 2024

so298 commented Feb 16, 2024 •

edited