era-temporary

AI & ML interests

None defined yet.

Recent Activity

FlippyDora submitted a paper 19 days ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

FlippyDora authored a paper 3 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

FlippyDora submitted a paper 3 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

View all activity

submitted a paper to Daily Papers 19 days ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published 21 days ago • 10

authored a paper 3 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

submitted a paper to Daily Papers 3 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

updated a model 4 months ago

era-temporary/openvla-7b-era_dataset-b16-lr-0.0005-lora-r32-dropout-0.0

8B • Updated Nov 21, 2025 • 1

published a model 4 months ago

era-temporary/openvla-7b-era_dataset-b16-lr-0.0005-lora-r32-dropout-0.0

8B • Updated Nov 21, 2025 • 1

authored a paper 6 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

updated a model 6 months ago

era-temporary/eb_alfred_sft_best

4B • Updated Sep 24, 2025

published a model 6 months ago

era-temporary/eb_alfred_sft_best

4B • Updated Sep 24, 2025

updated a model 6 months ago

era-temporary/eb_man_sft_best

4B • Updated Sep 24, 2025 • 2

published a model 6 months ago

era-temporary/eb_man_sft_best

4B • Updated Sep 24, 2025 • 2

updated a model 7 months ago

era-temporary/eb_alfred_sft_stage1_grounding_action_full_planning_randomized

4B • Updated Sep 18, 2025

published a model 7 months ago

era-temporary/eb_alfred_sft_stage1_grounding_action_full_planning_randomized

4B • Updated Sep 18, 2025

updated a model 7 months ago

era-temporary/eb-alfred-external-know-env-anchored-lr1e-5-full-e1-bs-16

4B • Updated Sep 16, 2025 • 1

published a model 7 months ago

era-temporary/eb-alfred-external-know-env-anchored-lr1e-5-full-e1-bs-16

4B • Updated Sep 16, 2025 • 1

updated a model 7 months ago

era-temporary/eb_alfred_sft_openo1_1w

4B • Updated Sep 16, 2025

published a model 7 months ago

era-temporary/eb_alfred_sft_openo1_1w

4B • Updated Sep 16, 2025

updated a model 7 months ago

era-temporary/eb_alfred-ablation_action_sequence_only

4B • Updated Sep 15, 2025 • 1

published a model 7 months ago

era-temporary/eb_alfred-ablation_action_sequence_only

4B • Updated Sep 15, 2025 • 1

updated a model 7 months ago

era-temporary/eb_man-ablation_no_relative

4B • Updated Sep 15, 2025 • 7

published a model 7 months ago

era-temporary/eb_man-ablation_no_relative

4B • Updated Sep 15, 2025 • 7