Metharme Tokens
#6
by Mar2ck - opened
It's probably working as intended (?)
But I agree that model is much dumber on metharme than with mistral
I didn't want to touch the mistral vocab, so these Metharme tags are composed of multiple tokens.
>< = 4177 is not part of it. > and < should be separate (in case you need the token ids for something?).
I thought you might have repurposed some of the reserved tokens. The >< thing doesn't happen in real use so it's fine.
Mar2ck changed discussion status to closed
