File size: 1,021 Bytes
1da9b83 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 | ---
library_name: peft
base_model: meta-llama/Llama-2-7b-hf
tags:
- lora
- llama2
- mmlu
- data-selection
---
# offline-embedding
Offline training with **embedding**-based retrieved data (2.5% of full dataset).
> **Note**: This checkpoint is from a **single random seed** (seed=3) and a specific training step (step 1040). Results may vary across seeds.
## Details
| Key | Value |
|-----|-------|
| Base model | `meta-llama/Llama-2-7b-hf` |
| Task | MMLU |
| Data selection | Embedding Retrieval |
| Data ratio | 2.5% |
| Online | False |
| LoRA rank | 128 |
| LoRA alpha | 512 |
| Target modules | q_proj, k_proj, v_proj, o_proj |
| Seed | 3 |
| Checkpoint step | 1040 |
## Usage
```python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-hf")
model = PeftModel.from_pretrained(base_model, "DATA-ADAPT/offline-embedding")
tokenizer = AutoTokenizer.from_pretrained("DATA-ADAPT/offline-embedding")
```
|