Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
9
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0092
11.1 GB
56,043 files
Updated about 1 month ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023401.jsonl.zst
62.9 kB
xet
about 1 month ago
07879d43
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023405.jsonl.zst
91.8 kB
xet
about 1 month ago
42760ded
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023410.jsonl.zst
34.2 kB
xet
about 1 month ago
dcea3148
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023412.jsonl.zst
28 kB
xet
about 1 month ago
1cc95a17
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023416.jsonl.zst
44 kB
xet
about 1 month ago
444e1a88
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023417.jsonl.zst
59.2 kB
xet
about 1 month ago
a2014e6e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023420.jsonl.zst
301 kB
xet
about 1 month ago
13f55d93
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023426.jsonl.zst
21.3 kB
xet
about 1 month ago
638fd3a8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023428.jsonl.zst
98 kB
xet
about 1 month ago
2a5b0e9c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023430.jsonl.zst
67.6 kB
xet
about 1 month ago
f7d6fcf5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023432.jsonl.zst
66 kB
xet
about 1 month ago
86a1df83
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023436.jsonl.zst
60.7 kB
xet
about 1 month ago
177b2294
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023442.jsonl.zst
73.7 kB
xet
about 1 month ago
0614c0ce
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023444.jsonl.zst
81.2 kB
xet
about 1 month ago
058dd141
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023448.jsonl.zst
87.5 kB
xet
about 1 month ago
86499a42
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023449.jsonl.zst
38.7 kB
xet
about 1 month ago
d680b615
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023452.jsonl.zst
31.8 kB
xet
about 1 month ago
a76074f7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023459.jsonl.zst
55.3 kB
xet
about 1 month ago
f5271f63
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023460.jsonl.zst
138 kB
xet
about 1 month ago
6de84e71
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023462.jsonl.zst
74.9 kB
xet
about 1 month ago
c4de13cc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023463.jsonl.zst
140 kB
xet
about 1 month ago
60ac35a8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023467.jsonl.zst
31.4 kB
xet
about 1 month ago
4bd6e5a6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023475.jsonl.zst
70.2 kB
xet
about 1 month ago
f94578c5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023476.jsonl.zst
33 kB
xet
about 1 month ago
c759d327
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023479.jsonl.zst
64.4 kB
xet
about 1 month ago
ca1f67fd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023480.jsonl.zst
6.56 kB
xet
about 1 month ago
a038ae6b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023484.jsonl.zst
26.7 kB
xet
about 1 month ago
56a0b6c3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023489.jsonl.zst
37.3 kB
xet
about 1 month ago
3cbd82c9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023491.jsonl.zst
39.1 kB
xet
about 1 month ago
812ed12a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023494.jsonl.zst
77.3 kB
xet
about 1 month ago
b95b4cbe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023496.jsonl.zst
34 kB
xet
about 1 month ago
d91aa893
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023506.jsonl.zst
60.5 kB
xet
about 1 month ago
e66329ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023507.jsonl.zst
23.1 kB
xet
about 1 month ago
cf8c8e85
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023511.jsonl.zst
103 kB
xet
about 1 month ago
368afaf9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023513.jsonl.zst
70.4 kB
xet
about 1 month ago
ffab6bb4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023523.jsonl.zst
25.5 kB
xet
about 1 month ago
e2484050
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023524.jsonl.zst
84.4 kB
xet
about 1 month ago
9eef6b7e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023526.jsonl.zst
37.5 kB
xet
about 1 month ago
daa00d35
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023528.jsonl.zst
21.6 kB
xet
about 1 month ago
dd851c97
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023530.jsonl.zst
85.5 kB
xet
about 1 month ago
e88fd184
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023538.jsonl.zst
214 kB
xet
about 1 month ago
5dae2107
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023540.jsonl.zst
11.5 kB
xet
about 1 month ago
72c97f2e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023542.jsonl.zst
38 kB
xet
about 1 month ago
0d396edf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023543.jsonl.zst
150 kB
xet
about 1 month ago
8b6aba79
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023545.jsonl.zst
52.6 kB
xet
about 1 month ago
8026e19a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023552.jsonl.zst
37.5 kB
xet
about 1 month ago
d3b8e978
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023555.jsonl.zst
38.1 kB
xet
about 1 month ago
9fa19b6d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023558.jsonl.zst
26.4 kB
xet
about 1 month ago
0ccaff04
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023559.jsonl.zst
24.6 kB
xet
about 1 month ago
f5e996d9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023560.jsonl.zst
53.5 kB
xet
about 1 month ago
c80099ee
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023570.jsonl.zst
117 kB
xet
about 1 month ago
72d73ba9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023574.jsonl.zst
20 kB
xet
about 1 month ago
37c2d4be
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023577.jsonl.zst
52.7 kB
xet
about 1 month ago
4d57cc29
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023584.jsonl.zst
52.5 kB
xet
about 1 month ago
2219e6cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023588.jsonl.zst
79.4 kB
xet
about 1 month ago
feb4645d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023589.jsonl.zst
45.9 kB
xet
about 1 month ago
c340772d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023591.jsonl.zst
34.7 kB
xet
about 1 month ago
8095937e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023593.jsonl.zst
6.61 kB
xet
about 1 month ago
7ca82b46
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023602.jsonl.zst
6.13 kB
xet
about 1 month ago
db201985
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023607.jsonl.zst
32.1 kB
xet
about 1 month ago
b7ee91bc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023608.jsonl.zst
17.1 kB
xet
about 1 month ago
683d4555
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023618.jsonl.zst
101 kB
xet
about 1 month ago
4e7e9aa2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023619.jsonl.zst
77.1 kB
xet
about 1 month ago
79a1799b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023620.jsonl.zst
36.5 kB
xet
about 1 month ago
9ab6b96f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023622.jsonl.zst
56.3 kB
xet
about 1 month ago
8e712513
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023624.jsonl.zst
35.7 kB
xet
about 1 month ago
8f257a73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023633.jsonl.zst
127 kB
xet
about 1 month ago
700b9aa0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023634.jsonl.zst
268 kB
xet
about 1 month ago
ca9947cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023635.jsonl.zst
29.3 kB
xet
about 1 month ago
673368fa
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023637.jsonl.zst
8.55 kB
xet
about 1 month ago
4b087027
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023640.jsonl.zst
38.9 kB
xet
about 1 month ago
e862c1ab
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023649.jsonl.zst
3.25 kB
xet
about 1 month ago
5719c826
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023650.jsonl.zst
159 kB
xet
about 1 month ago
73e418a9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023651.jsonl.zst
94.5 kB
xet
about 1 month ago
3f6380d0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023652.jsonl.zst
109 kB
xet
about 1 month ago
d81680b3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023654.jsonl.zst
30.7 kB
xet
about 1 month ago
19501eb4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023665.jsonl.zst
83.6 kB
xet
about 1 month ago
029edced
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023666.jsonl.zst
21.2 kB
xet
about 1 month ago
624f7391
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023670.jsonl.zst
37.9 kB
xet
about 1 month ago
73a61081
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023672.jsonl.zst
13.4 kB
xet
about 1 month ago
b6788dec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023681.jsonl.zst
12.2 kB
xet
about 1 month ago
abffe8fc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023682.jsonl.zst
38.5 kB
xet
about 1 month ago
45248c0c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023684.jsonl.zst
51 kB
xet
about 1 month ago
81181715
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023689.jsonl.zst
57.1 kB
xet
about 1 month ago
48e9d037
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023697.jsonl.zst
44.7 kB
xet
about 1 month ago
65862638
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023698.jsonl.zst
25.9 kB
xet
about 1 month ago
27f52aca
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023699.jsonl.zst
85.1 kB
xet
about 1 month ago
dd8ef442
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023700.jsonl.zst
18.8 kB
xet
about 1 month ago
ea633ff5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023704.jsonl.zst
69.7 kB
xet
about 1 month ago
4572112d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023712.jsonl.zst
54.2 kB
xet
about 1 month ago
41625170
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023713.jsonl.zst
60.6 kB
xet
about 1 month ago
1949ddb7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023715.jsonl.zst
78.1 kB
xet
about 1 month ago
27aa11b9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023716.jsonl.zst
49.4 kB
xet
about 1 month ago
5714ffdd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023719.jsonl.zst
12.3 kB
xet
about 1 month ago
7fb10278
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023729.jsonl.zst
31.2 kB
xet
about 1 month ago
64dc4322
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023731.jsonl.zst
112 kB
xet
about 1 month ago
2ae791cd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023732.jsonl.zst
41.1 kB
xet
about 1 month ago
855642cc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023735.jsonl.zst
151 kB
xet
about 1 month ago
ad8fedb7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023744.jsonl.zst
27.4 kB
xet
about 1 month ago
e705bc28
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00023746.jsonl.zst
134 kB
xet
about 1 month ago
9360bfe4
Load more
Use this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors