Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
9
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0057
11.1 GB
56,043 files
Updated about 1 month ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000369.jsonl.zst
222 kB
xet
about 1 month ago
2197bcac
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000370.jsonl.zst
199 kB
xet
about 1 month ago
a48489fb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000371.jsonl.zst
280 kB
xet
about 1 month ago
7290e771
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000372.jsonl.zst
281 kB
xet
about 1 month ago
14374d5e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000373.jsonl.zst
322 kB
xet
about 1 month ago
aa0760d5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000374.jsonl.zst
146 kB
xet
about 1 month ago
db78b973
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000375.jsonl.zst
221 kB
xet
about 1 month ago
4cf3d6d5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000376.jsonl.zst
151 kB
xet
about 1 month ago
78ab89ad
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000377.jsonl.zst
302 kB
xet
about 1 month ago
06050ccd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000378.jsonl.zst
154 kB
xet
about 1 month ago
58a26c3d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000379.jsonl.zst
275 kB
xet
about 1 month ago
348979fd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000380.jsonl.zst
337 kB
xet
about 1 month ago
7cf19394
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000381.jsonl.zst
221 kB
xet
about 1 month ago
1a30e798
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000382.jsonl.zst
530 kB
xet
about 1 month ago
adb8813e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000383.jsonl.zst
149 kB
xet
about 1 month ago
aa289db2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000384.jsonl.zst
153 kB
xet
about 1 month ago
42e56a3b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000385.jsonl.zst
212 kB
xet
about 1 month ago
f8ef503d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000386.jsonl.zst
71.9 kB
xet
about 1 month ago
1bf75438
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000387.jsonl.zst
236 kB
xet
about 1 month ago
b0df717d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000388.jsonl.zst
224 kB
xet
about 1 month ago
592ef131
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000389.jsonl.zst
133 kB
xet
about 1 month ago
5102f484
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000390.jsonl.zst
133 kB
xet
about 1 month ago
b34d0a4c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000391.jsonl.zst
204 kB
xet
about 1 month ago
26decf8f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000392.jsonl.zst
301 kB
xet
about 1 month ago
f21f8f0a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000393.jsonl.zst
225 kB
xet
about 1 month ago
d83ad679
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000394.jsonl.zst
223 kB
xet
about 1 month ago
6a01e2ce
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000395.jsonl.zst
346 kB
xet
about 1 month ago
6d40ddb1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000396.jsonl.zst
179 kB
xet
about 1 month ago
55eda9ca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000397.jsonl.zst
216 kB
xet
about 1 month ago
0571634b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000398.jsonl.zst
219 kB
xet
about 1 month ago
e5250570
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000399.jsonl.zst
116 kB
xet
about 1 month ago
3b063576
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000400.jsonl.zst
336 kB
xet
about 1 month ago
e64b2aaf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000401.jsonl.zst
241 kB
xet
about 1 month ago
9afea27c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000402.jsonl.zst
181 kB
xet
about 1 month ago
d823f6ce
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000403.jsonl.zst
256 kB
xet
about 1 month ago
adbb53a2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000404.jsonl.zst
127 kB
xet
about 1 month ago
926ab904
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000405.jsonl.zst
215 kB
xet
about 1 month ago
0f2176ca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000406.jsonl.zst
174 kB
xet
about 1 month ago
a9dc0460
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000407.jsonl.zst
429 kB
xet
about 1 month ago
edb38d83
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000408.jsonl.zst
355 kB
xet
about 1 month ago
e070b493
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000409.jsonl.zst
130 kB
xet
about 1 month ago
bdd33e33
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000410.jsonl.zst
223 kB
xet
about 1 month ago
1258a4d0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000411.jsonl.zst
271 kB
xet
about 1 month ago
a90d9328
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000412.jsonl.zst
148 kB
xet
about 1 month ago
f67c903b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000413.jsonl.zst
406 kB
xet
about 1 month ago
fbeb9629
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000414.jsonl.zst
199 kB
xet
about 1 month ago
a9a9d1da
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000415.jsonl.zst
167 kB
xet
about 1 month ago
8d9e145e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000416.jsonl.zst
186 kB
xet
about 1 month ago
93f7a194
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000417.jsonl.zst
193 kB
xet
about 1 month ago
0ee167f4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000418.jsonl.zst
264 kB
xet
about 1 month ago
50d8cf9d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000419.jsonl.zst
225 kB
xet
about 1 month ago
d05e79ce
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000420.jsonl.zst
198 kB
xet
about 1 month ago
d140be08
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000421.jsonl.zst
268 kB
xet
about 1 month ago
6b18ac97
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000422.jsonl.zst
251 kB
xet
about 1 month ago
d6cda7ec
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000423.jsonl.zst
190 kB
xet
about 1 month ago
d0649c4c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000424.jsonl.zst
242 kB
xet
about 1 month ago
b6255fa8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000425.jsonl.zst
321 kB
xet
about 1 month ago
dcfd258f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000426.jsonl.zst
269 kB
xet
about 1 month ago
0d1046e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000427.jsonl.zst
159 kB
xet
about 1 month ago
b80acae0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000428.jsonl.zst
332 kB
xet
about 1 month ago
08cb99a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000429.jsonl.zst
325 kB
xet
about 1 month ago
9cea191e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000430.jsonl.zst
211 kB
xet
about 1 month ago
a6ec6925
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000431.jsonl.zst
241 kB
xet
about 1 month ago
bdb0a1e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000432.jsonl.zst
297 kB
xet
about 1 month ago
34cd28d7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000433.jsonl.zst
177 kB
xet
about 1 month ago
c3d38914
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000434.jsonl.zst
220 kB
xet
about 1 month ago
964afad6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000435.jsonl.zst
165 kB
xet
about 1 month ago
1c1ce61c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000436.jsonl.zst
284 kB
xet
about 1 month ago
ca7a7e3a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000437.jsonl.zst
210 kB
xet
about 1 month ago
d419c257
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000438.jsonl.zst
228 kB
xet
about 1 month ago
d0970981
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000439.jsonl.zst
167 kB
xet
about 1 month ago
5a2ca018
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000440.jsonl.zst
170 kB
xet
about 1 month ago
2f834568
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000441.jsonl.zst
196 kB
xet
about 1 month ago
0addeedf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000442.jsonl.zst
240 kB
xet
about 1 month ago
55b3f122
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000443.jsonl.zst
125 kB
xet
about 1 month ago
4c799106
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000444.jsonl.zst
144 kB
xet
about 1 month ago
b5b20aa6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000445.jsonl.zst
205 kB
xet
about 1 month ago
2f3c7a7b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000446.jsonl.zst
358 kB
xet
about 1 month ago
d3180050
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000447.jsonl.zst
183 kB
xet
about 1 month ago
8f0265d4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000448.jsonl.zst
140 kB
xet
about 1 month ago
b7b46840
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000449.jsonl.zst
332 kB
xet
about 1 month ago
5714b256
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000450.jsonl.zst
212 kB
xet
about 1 month ago
c8561e76
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000451.jsonl.zst
369 kB
xet
about 1 month ago
b3a3c2de
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000452.jsonl.zst
90.1 kB
xet
about 1 month ago
723613f9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000453.jsonl.zst
159 kB
xet
about 1 month ago
b1ba0555
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000454.jsonl.zst
50.9 kB
xet
about 1 month ago
e8856b07
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000455.jsonl.zst
73 kB
xet
about 1 month ago
4a438f54
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000456.jsonl.zst
165 kB
xet
about 1 month ago
22a33126
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000457.jsonl.zst
185 kB
xet
about 1 month ago
f2f760c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000458.jsonl.zst
128 kB
xet
about 1 month ago
8b034ea5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000459.jsonl.zst
205 kB
xet
about 1 month ago
0b3c1dfa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000460.jsonl.zst
260 kB
xet
about 1 month ago
4b7059b7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000461.jsonl.zst
153 kB
xet
about 1 month ago
b705d794
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000462.jsonl.zst
147 kB
xet
about 1 month ago
ec36592b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000463.jsonl.zst
352 kB
xet
about 1 month ago
5f7e29f1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000464.jsonl.zst
141 kB
xet
about 1 month ago
da5152fa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000465.jsonl.zst
234 kB
xet
about 1 month ago
42343f7b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000466.jsonl.zst
152 kB
xet
about 1 month ago
5351ee9e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000467.jsonl.zst
195 kB
xet
about 1 month ago
10b5432d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0014__shard_00000468.jsonl.zst
188 kB
xet
about 1 month ago
e9362b73
Load more
Use this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors