Update README code snippet

Generally, I recommend using model.encode_query() and model.encode_document() for users if they want to perform retrieval, as these are just encode but with the query/document prompts automatically applied. The 2nd change means that if someone does use model.encode() without any prompt or prompt_name, then it defaults to the document option (i.e. "<|im_start|>system\ndocument<|im_end|>\n<|im_start|>user\n"). This should give much better performance than not using any prompt at all.

You're totally free to update the README snippet/texts to your liking. I prefer adding an "expected similarity" though, so end users who run the models locally with various ways can have confidence that their version gives the expected results.

Tom Aarsen

Extend the README snippet to use encode_query/encode_document, default to document9508e3c0

tomaarsen changed pull request status to open about 5 hours ago

dilawarm

ZeroEntropy org about 5 hours ago

LGTM! Thank you, Tom! Will let @npip99 merge.

npip99 changed pull request status to merged about 1 hour ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment