Does BERTA support matryoshka dimensions?

by dantetemplar - opened 3 days ago

Hello, can't find information on that - FRIDA, BERTA, others models do not declare matryoshka representation, but it looks I have no loss when truncate dim up to 384.

sergeyzh

Owner 3 days ago

This is indeed an interesting observation. Although BERTA were not trained using Matryoshka Representation Learning (MRL), using a vector truncated by 2–3 times shows almost no drop in accuracy for most tasks. In my own testing, I tried various truncation methods (such as [:384], [384:], [0::2], and [1::2]) and did not observe any significant degradation either.

sergeyzh changed discussion status to closed 3 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment