Skip to content

KeyError: 'tokenizer' #40

@Ther-nullptr

Description

@Ther-nullptr

When I run PiSSA/scripts/metamath_llama2_7b/run_qlora.sh and to save a checkpoint, I get the following error:

{'loss': 0.3431, 'grad_norm': 0.09538137912750244, 'learning_rate': 1.9506710479124212e-05, 'epoch': 0.13}                                                                                             
 13%|████████████████████▏                                                                                                                                         | 100/781 [33:26<3:49:05, 20.18s/it]Traceback (most recent call last):
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 303, in <module>
    train()
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 292, in train
    trainer.train(resume_from_checkpoint = resume_from_checkpoint_dir)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2245, in train
    return inner_training_loop(
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2620, in _inner_training_loop
    self._maybe_log_save_evaluate(
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 3101, in _maybe_log_save_evaluate
    self.control = self.callback_handler.on_save(self.args, self.state, self.control)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 545, in on_save
    return self.call_event("on_save", args, state, control)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 556, in call_event
    result = getattr(callback, event)(
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 69, in on_save
    self.save_model(args, state, kwargs)
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 66, in save_model
    kwargs["tokenizer"].save_pretrained(peft_model_path)
KeyError: 'tokenizer'
[rank0]: Traceback (most recent call last):
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 303, in <module>
[rank0]:     train()
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 292, in train
[rank0]:     trainer.train(resume_from_checkpoint = resume_from_checkpoint_dir)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2245, in train
[rank0]:     return inner_training_loop(
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2620, in _inner_training_loop
[rank0]:     self._maybe_log_save_evaluate(
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 3101, in _maybe_log_save_evaluate
[rank0]:     self.control = self.callback_handler.on_save(self.args, self.state, self.control)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 545, in on_save
[rank0]:     return self.call_event("on_save", args, state, control)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 556, in call_event
[rank0]:     result = getattr(callback, event)(
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 69, in on_save
[rank0]:     self.save_model(args, state, kwargs)
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 66, in save_model
[rank0]:     kwargs["tokenizer"].save_pretrained(peft_model_path)
[rank0]: KeyError: 'tokenizer'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions