Models pretrained for the paper titled "PLDR-LLMs Reason At Self-Organized Criticality".
Burc Gokden
fromthesky
AI & ML interests
Large Language Models, Transformers, Natural Language Processing
Recent Activity
published
a model 3 days ago
fromthesky/PLDR-LLM-v51-SOC-110M-5 updated
a model 3 days ago
fromthesky/PLDR-LLM-v51-SOC-110M-4 updated
a model 3 days ago
fromthesky/PLDR-LLM-v51-SOC-110M-3 Organizations
Pretrained PLDR-LLMs with KVG cache (Pytorch/Transformers)
PLDR-LLMs pretrained for paper titled "PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference"
-
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
Paper • 2502.13502 • Published • 3 -
fromthesky/PLDR-LLM-v51-104M
Text Generation • 0.1B • Updated -
fromthesky/PLDR-LLM-v51-110M-1
Text Generation • 0.1B • Updated • 3 -
fromthesky/PLDR-LLM-v51-110M-2
Text Generation • 0.1B • Updated • 1
Pretrained PLDR-LLMs (Tensorflow)
PLDR-LLMs pretrained for paper titled "PLDR-LLM: Large Language Model from Power Law Decoder Representations"
PLDR-LLMs Reason At Self-Organized Criticality
Models pretrained for the paper titled "PLDR-LLMs Reason At Self-Organized Criticality".
Pretrained PLDR-LLMs with KVG cache (Pytorch/Transformers)
PLDR-LLMs pretrained for paper titled "PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference"
-
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
Paper • 2502.13502 • Published • 3 -
fromthesky/PLDR-LLM-v51-104M
Text Generation • 0.1B • Updated -
fromthesky/PLDR-LLM-v51-110M-1
Text Generation • 0.1B • Updated • 3 -
fromthesky/PLDR-LLM-v51-110M-2
Text Generation • 0.1B • Updated • 1
Finetuned PLDR-LLMs
Finetuned PLDR-LLMs with Huggingface Transformers library support
Pretrained PLDR-LLMs (Tensorflow)
PLDR-LLMs pretrained for paper titled "PLDR-LLM: Large Language Model from Power Law Decoder Representations"