Model save

Browse files

Files changed (5) hide show

README.md +72 -0
all_results.json +16 -0
eval_results.json +11 -0
model.safetensors +1 -1
train_results.json +8 -0

README.md ADDED Viewed

	@@ -0,0 +1,72 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: answerdotai/ModernBERT-base
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+- f1
+- precision
+- recall
+model-index:
+- name: arxiv-new-datasets-modernbert-v3
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# arxiv-new-datasets-modernbert-v3
+This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1966
+- Accuracy: 0.9523
+- F1: 0.9523
+- Precision: 0.9523
+- Recall: 0.9523
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 5
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 0.1198        | 1.0   | 425  | 0.2111          | 0.9311   | 0.9319 | 0.9360    | 0.9311 |
+| 0.1327        | 2.0   | 850  | 0.1966          | 0.9523   | 0.9523 | 0.9523    | 0.9523 |
+| 0.1925        | 3.0   | 1275 | 0.2396          | 0.9457   | 0.9458 | 0.9460    | 0.9457 |
+| 0.0004        | 4.0   | 1700 | 0.3799          | 0.9364   | 0.9357 | 0.9371    | 0.9364 |
+| 0.0           | 5.0   | 2125 | 0.3855          | 0.9417   | 0.9414 | 0.9417    | 0.9417 |
+### Framework versions
+- Transformers 4.48.0.dev0
+- Pytorch 2.10.0+cu128
+- Datasets 4.5.0
+- Tokenizers 0.21.4

all_results.json ADDED Viewed

	@@ -0,0 +1,16 @@

+{
+    "epoch": 5.0,
+    "eval_accuracy": 0.952317880794702,
+    "eval_f1": 0.9522791647926332,
+    "eval_loss": 0.1966489851474762,
+    "eval_precision": 0.9522548670060105,
+    "eval_recall": 0.952317880794702,
+    "eval_runtime": 1.7864,
+    "eval_samples_per_second": 422.641,
+    "eval_steps_per_second": 26.87,
+    "total_flos": 1.156532129378304e+16,
+    "train_loss": 0.13934201941052998,
+    "train_runtime": 244.329,
+    "train_samples_per_second": 138.911,
+    "train_steps_per_second": 8.697
+}

eval_results.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "epoch": 5.0,
+    "eval_accuracy": 0.952317880794702,
+    "eval_f1": 0.9522791647926332,
+    "eval_loss": 0.1966489851474762,
+    "eval_precision": 0.9522548670060105,
+    "eval_recall": 0.952317880794702,
+    "eval_runtime": 1.7864,
+    "eval_samples_per_second": 422.641,
+    "eval_steps_per_second": 26.87
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a9836b60782ecea9a9b9d7b9f2aaebf5b3f642fb5500f9b9676c25b9164e1355
 size 598439784

 version https://git-lfs.github.com/spec/v1
+oid sha256:34468ce93b23c4c5d4e9a8ffe9accf7e34f624e2814cd27d669a406f7d03774d
 size 598439784

train_results.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+    "epoch": 5.0,
+    "total_flos": 1.156532129378304e+16,
+    "train_loss": 0.13934201941052998,
+    "train_runtime": 244.329,
+    "train_samples_per_second": 138.911,
+    "train_steps_per_second": 8.697
+}