Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
9
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0086
11.1 GB
56,043 files
Updated about 1 month ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004249.jsonl.zst
141 kB
xet
about 1 month ago
2d4c65e0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004275.jsonl.zst
13 kB
xet
about 1 month ago
d28f2e71
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004292.jsonl.zst
9.8 kB
xet
about 1 month ago
3d18c964
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004316.jsonl.zst
17.1 kB
xet
about 1 month ago
d05e6f04
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004317.jsonl.zst
21.7 kB
xet
about 1 month ago
904c18b1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004328.jsonl.zst
49 kB
xet
about 1 month ago
5fdbad88
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004334.jsonl.zst
45.2 kB
xet
about 1 month ago
c4cea8f8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004368.jsonl.zst
16.4 kB
xet
about 1 month ago
fd5d4eb1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004398.jsonl.zst
19.2 kB
xet
about 1 month ago
19b51707
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004400.jsonl.zst
75.5 kB
xet
about 1 month ago
8047a84d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004418.jsonl.zst
45.8 kB
xet
about 1 month ago
f5d998f3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004453.jsonl.zst
7.25 kB
xet
about 1 month ago
d423e5dc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004454.jsonl.zst
14.5 kB
xet
about 1 month ago
43f8894a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004482.jsonl.zst
33 kB
xet
about 1 month ago
14116d83
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004505.jsonl.zst
77.7 kB
xet
about 1 month ago
aa54b5ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004512.jsonl.zst
31.8 kB
xet
about 1 month ago
196b8bc1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004553.jsonl.zst
48.6 kB
xet
about 1 month ago
760db8ea
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004568.jsonl.zst
37.6 kB
xet
about 1 month ago
ad8c9d55
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004571.jsonl.zst
55.8 kB
xet
about 1 month ago
e157328d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004580.jsonl.zst
27.2 kB
xet
about 1 month ago
76faf685
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004657.jsonl.zst
21.5 kB
xet
about 1 month ago
ae60578c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004669.jsonl.zst
23.9 kB
xet
about 1 month ago
fe70b756
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004689.jsonl.zst
89.2 kB
xet
about 1 month ago
726661f7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004692.jsonl.zst
31.7 kB
xet
about 1 month ago
06f13cbd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004699.jsonl.zst
80.3 kB
xet
about 1 month ago
a235bfc4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004732.jsonl.zst
32.4 kB
xet
about 1 month ago
61e3bbd7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004755.jsonl.zst
14.8 kB
xet
about 1 month ago
1484182f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004759.jsonl.zst
15.8 kB
xet
about 1 month ago
dc7da6ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004798.jsonl.zst
14.2 kB
xet
about 1 month ago
aa42aa51
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004810.jsonl.zst
23.4 kB
xet
about 1 month ago
96f1ccca
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004815.jsonl.zst
23.9 kB
xet
about 1 month ago
f7a4f013
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004833.jsonl.zst
59.6 kB
xet
about 1 month ago
b7e55fb2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004897.jsonl.zst
7.4 kB
xet
about 1 month ago
524747fb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004906.jsonl.zst
40.8 kB
xet
about 1 month ago
a2989fe5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004911.jsonl.zst
12.9 kB
xet
about 1 month ago
19a377d9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004916.jsonl.zst
69.8 kB
xet
about 1 month ago
52c4e8d8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004925.jsonl.zst
36.5 kB
xet
about 1 month ago
e9c621b4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004976.jsonl.zst
13.5 kB
xet
about 1 month ago
98dc6571
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004982.jsonl.zst
14.8 kB
xet
about 1 month ago
b98b5e62
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00004993.jsonl.zst
33.1 kB
xet
about 1 month ago
dc2caa41
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005013.jsonl.zst
9.78 kB
xet
about 1 month ago
56c91ab1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005040.jsonl.zst
15.8 kB
xet
about 1 month ago
43263f08
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005059.jsonl.zst
5.4 kB
xet
about 1 month ago
4a067f98
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005092.jsonl.zst
37 kB
xet
about 1 month ago
ccd8e01a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005122.jsonl.zst
56 kB
xet
about 1 month ago
971c3ff0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005140.jsonl.zst
37.3 kB
xet
about 1 month ago
3ec086bf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005141.jsonl.zst
15.6 kB
xet
about 1 month ago
83a73f07
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005151.jsonl.zst
19.3 kB
xet
about 1 month ago
454c02d2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005161.jsonl.zst
42 kB
xet
about 1 month ago
db540f18
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005179.jsonl.zst
11 kB
xet
about 1 month ago
7c42b7ed
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005191.jsonl.zst
102 kB
xet
about 1 month ago
57b28b96
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005216.jsonl.zst
5.85 kB
xet
about 1 month ago
7861c0d7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005221.jsonl.zst
62.2 kB
xet
about 1 month ago
d31de8a5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005244.jsonl.zst
42.8 kB
xet
about 1 month ago
69a34d5d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005258.jsonl.zst
18.6 kB
xet
about 1 month ago
439c495b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005269.jsonl.zst
16.6 kB
xet
about 1 month ago
6b0a50fd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005291.jsonl.zst
19.4 kB
xet
about 1 month ago
5edb506e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005293.jsonl.zst
5.72 kB
xet
about 1 month ago
c89fdeb0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005312.jsonl.zst
9 kB
xet
about 1 month ago
57807b82
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005325.jsonl.zst
71.3 kB
xet
about 1 month ago
2ea17bb6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005340.jsonl.zst
5.31 kB
xet
about 1 month ago
28b75cc2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005355.jsonl.zst
27 kB
xet
about 1 month ago
9e7f00c9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005382.jsonl.zst
25.6 kB
xet
about 1 month ago
bff97ba8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005386.jsonl.zst
24.8 kB
xet
about 1 month ago
3140bd79
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005387.jsonl.zst
29.9 kB
xet
about 1 month ago
c9639525
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005407.jsonl.zst
44 kB
xet
about 1 month ago
be976eba
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005449.jsonl.zst
32.8 kB
xet
about 1 month ago
325ada26
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005466.jsonl.zst
16.4 kB
xet
about 1 month ago
56403b65
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005475.jsonl.zst
13.9 kB
xet
about 1 month ago
651e21d1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005479.jsonl.zst
94.8 kB
xet
about 1 month ago
4f74472f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005494.jsonl.zst
55.2 kB
xet
about 1 month ago
35a113ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005504.jsonl.zst
34.7 kB
xet
about 1 month ago
cb628d7c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005510.jsonl.zst
11 kB
xet
about 1 month ago
91a10973
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005549.jsonl.zst
78.5 kB
xet
about 1 month ago
422d6d2b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005568.jsonl.zst
25.9 kB
xet
about 1 month ago
c585cdcb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005576.jsonl.zst
47.9 kB
xet
about 1 month ago
0c787fd7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005581.jsonl.zst
15.6 kB
xet
about 1 month ago
10bf91fa
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005598.jsonl.zst
11.1 kB
xet
about 1 month ago
cbcc537f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005624.jsonl.zst
13.4 kB
xet
about 1 month ago
4005edd2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005635.jsonl.zst
41.4 kB
xet
about 1 month ago
1cbee487
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005638.jsonl.zst
15.4 kB
xet
about 1 month ago
2ca4502a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005644.jsonl.zst
21.9 kB
xet
about 1 month ago
8f67ef81
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005669.jsonl.zst
30.2 kB
xet
about 1 month ago
84093f76
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005696.jsonl.zst
78 kB
xet
about 1 month ago
6333b7f1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005700.jsonl.zst
60.5 kB
xet
about 1 month ago
669e54cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005704.jsonl.zst
126 kB
xet
about 1 month ago
6b52c885
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005707.jsonl.zst
24.3 kB
xet
about 1 month ago
20cb58cb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005725.jsonl.zst
15.5 kB
xet
about 1 month ago
cd83b450
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005743.jsonl.zst
7.16 kB
xet
about 1 month ago
99a3a300
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005761.jsonl.zst
27.5 kB
xet
about 1 month ago
d2dc0f37
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005772.jsonl.zst
60.3 kB
xet
about 1 month ago
8a760fdf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005796.jsonl.zst
79.9 kB
xet
about 1 month ago
9da9f032
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005831.jsonl.zst
39.5 kB
xet
about 1 month ago
35ef5b03
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005833.jsonl.zst
70.4 kB
xet
about 1 month ago
db2d8cd4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005840.jsonl.zst
28.4 kB
xet
about 1 month ago
2bcaeb6d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005853.jsonl.zst
65.5 kB
xet
about 1 month ago
d73c7c78
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005862.jsonl.zst
229 kB
xet
about 1 month ago
923c47ae
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005870.jsonl.zst
19.7 kB
xet
about 1 month ago
3b971e48
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005890.jsonl.zst
65.1 kB
xet
about 1 month ago
8a90b2a9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00005901.jsonl.zst
61.8 kB
xet
about 1 month ago
e5eabd4a
Load more
Use this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors