Buckets:

HCAI-Lab/preconditioner-100k / sample_contract.json
download
raw
487 Bytes
{
"SAMPLE_TYPE": "uniform_random",
"SAMPLE_PURPOSE": "preconditioner_building",
"SAMPLE_TARGET_DOCS": 100000,
"SAMPLE_MIN_TOKENS": 512,
"SAMPLE_REALIZED_DOC_COUNT": 100000,
"SAMPLE_REALIZED_TOKEN_TOTAL": 251517816,
"SAMPLE_TOKEN_STATS": {
"min": 512,
"max": 468399,
"median": 1243,
"mean": 2515.2
},
"SAMPLE_UNIQUE_SHARDS": 38277,
"SAMPLE_TOTAL_DOCS_READ": 1098646162,
"SAMPLE_TOTAL_DOCS_BELOW_MIN_TOKENS": 289730295,
"SAMPLE_SAMPLING_SEED": 42
}

Xet Storage Details

Size:
487 Bytes
·
Xet hash:
7b20c8ef9331da268423dab9cafb8a27efd33067fe677efef64f2e7e3f82f816

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.