-
Notifications
You must be signed in to change notification settings - Fork 47
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* save eora to hf format Signed-off-by: Qubitium <[email protected]> * test needs to store self.x in cls to stay consistent Signed-off-by: Qubitium <[email protected]> * temp disable torch kernel auto compile that is causing dynamo errors Signed-off-by: Qubitium <[email protected]> * fix shape error Signed-off-by: ZX-ModelCloud <[email protected]> * cleanup debug logs Signed-off-by: Qubitium <[email protected]> * re-enable auto torch compile code Signed-off-by: Qubitium <[email protected]> * add lora config validation Signed-off-by: Qubitium <[email protected]> * refractor loading cache into AdapterCache cls Signed-off-by: Qubitium <[email protected]> * add lora rank override code from LoraConfig Signed-off-by: Qubitium <[email protected]> * remove `peft` dependency Signed-off-by: Qubitium <[email protected]> * comment on original HF repo path for test files Signed-off-by: Qubitium <[email protected]> * clean up HF download logic Signed-off-by: Qubitium <[email protected]> * save to PEFT compatible format Signed-off-by: Qubitium <[email protected]> * add test_quant_and_eora_transformers.py Signed-off-by: ZX-ModelCloud <[email protected]> * fix missing task_type in adapter_config.json Signed-off-by: ZX-ModelCloud <[email protected]> * fix regex rule prefix not stripped Signed-off-by: Qubitium <[email protected]> * push peft compat changes Signed-off-by: Qubitium <[email protected]> * prevent preft doing alpha / r scaling. set alpha eq r so math is just 1, no scaling Signed-off-by: Qubitium <[email protected]> * fix lora load with transformers Signed-off-by: ZX-ModelCloud <[email protected]> * fix device Signed-off-by: ZX-ModelCloud <[email protected]> * format Signed-off-by: Qubitium <[email protected]> * assert lora weight Signed-off-by: ZX-ModelCloud <[email protected]> * fix empty base_model_name_or_path Signed-off-by: ZX-ModelCloud <[email protected]> * assert dynamic rank Signed-off-by: ZX-ModelCloud <[email protected]> * remove dynamic adapter config when save quantize_config Signed-off-by: ZX-ModelCloud <[email protected]> * fix dynamic is none * [CI] install bitblas for test_inference_speed --------- Signed-off-by: Qubitium <[email protected]> Signed-off-by: ZX-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: CSY-ModelCloud <[email protected]>
- Loading branch information
1 parent
a6a8e82
commit f7b86a5
Showing
17 changed files
with
936 additions
and
150 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.