Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
9
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0064
11.1 GB
56,043 files
Updated about 1 month ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000958.jsonl.zst
204 kB
xet
about 1 month ago
ac518d27
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000959.jsonl.zst
146 kB
xet
about 1 month ago
9cfee8f4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000960.jsonl.zst
111 kB
xet
about 1 month ago
67d0dee6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000961.jsonl.zst
114 kB
xet
about 1 month ago
fa40ccd5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000962.jsonl.zst
128 kB
xet
about 1 month ago
697b866f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000963.jsonl.zst
148 kB
xet
about 1 month ago
91c254dd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000964.jsonl.zst
214 kB
xet
about 1 month ago
95b8b71e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000965.jsonl.zst
161 kB
xet
about 1 month ago
44afd984
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000966.jsonl.zst
136 kB
xet
about 1 month ago
3823f1dd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000967.jsonl.zst
163 kB
xet
about 1 month ago
90685850
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000968.jsonl.zst
74.9 kB
xet
about 1 month ago
c68494fa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000969.jsonl.zst
12.5 kB
xet
about 1 month ago
1b1947c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000000.jsonl.zst
129 kB
xet
about 1 month ago
1a85471e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000001.jsonl.zst
86.5 kB
xet
about 1 month ago
b2ca4761
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000002.jsonl.zst
109 kB
xet
about 1 month ago
7edcd20b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000003.jsonl.zst
191 kB
xet
about 1 month ago
739b2ca9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000004.jsonl.zst
149 kB
xet
about 1 month ago
37126e72
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000005.jsonl.zst
107 kB
xet
about 1 month ago
4a79d7d2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000006.jsonl.zst
205 kB
xet
about 1 month ago
59f323cd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000007.jsonl.zst
102 kB
xet
about 1 month ago
0b3c342b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000008.jsonl.zst
131 kB
xet
about 1 month ago
ec154ab9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000009.jsonl.zst
374 kB
xet
about 1 month ago
591de101
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000010.jsonl.zst
309 kB
xet
about 1 month ago
37ced582
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000011.jsonl.zst
112 kB
xet
about 1 month ago
89c6671a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000012.jsonl.zst
94.1 kB
xet
about 1 month ago
8ccf395d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000013.jsonl.zst
134 kB
xet
about 1 month ago
071e638a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000014.jsonl.zst
222 kB
xet
about 1 month ago
e1b3a874
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000015.jsonl.zst
128 kB
xet
about 1 month ago
84952f0a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000016.jsonl.zst
126 kB
xet
about 1 month ago
b2c97706
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000017.jsonl.zst
170 kB
xet
about 1 month ago
79207a93
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000018.jsonl.zst
124 kB
xet
about 1 month ago
fa85306a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000019.jsonl.zst
130 kB
xet
about 1 month ago
41fa4fd0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000020.jsonl.zst
197 kB
xet
about 1 month ago
67263f98
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000021.jsonl.zst
108 kB
xet
about 1 month ago
9b6051b0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000022.jsonl.zst
167 kB
xet
about 1 month ago
65b7a9c5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000023.jsonl.zst
124 kB
xet
about 1 month ago
8e7d6b36
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000024.jsonl.zst
201 kB
xet
about 1 month ago
5d2198c2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000025.jsonl.zst
110 kB
xet
about 1 month ago
bd6ae219
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000026.jsonl.zst
220 kB
xet
about 1 month ago
2550cba5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000027.jsonl.zst
96.9 kB
xet
about 1 month ago
6fcb4d18
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000028.jsonl.zst
59.7 kB
xet
about 1 month ago
5599696a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000031.jsonl.zst
42.1 kB
xet
about 1 month ago
6f70ae78
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000032.jsonl.zst
108 kB
xet
about 1 month ago
fb1210a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000033.jsonl.zst
77.1 kB
xet
about 1 month ago
8fab4c3d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000034.jsonl.zst
156 kB
xet
about 1 month ago
f714455b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000035.jsonl.zst
92.6 kB
xet
about 1 month ago
588aa47f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000036.jsonl.zst
141 kB
xet
about 1 month ago
0bf5c486
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000037.jsonl.zst
200 kB
xet
about 1 month ago
85b2faa2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000038.jsonl.zst
75.5 kB
xet
about 1 month ago
68d54ab7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000039.jsonl.zst
115 kB
xet
about 1 month ago
29171742
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000040.jsonl.zst
202 kB
xet
about 1 month ago
6e5fc211
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000041.jsonl.zst
131 kB
xet
about 1 month ago
3401f237
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000042.jsonl.zst
103 kB
xet
about 1 month ago
46590e87
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000043.jsonl.zst
186 kB
xet
about 1 month ago
839c57a2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000044.jsonl.zst
169 kB
xet
about 1 month ago
7e4f6124
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000045.jsonl.zst
116 kB
xet
about 1 month ago
11c5a7a4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000046.jsonl.zst
115 kB
xet
about 1 month ago
8349a0bb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000047.jsonl.zst
223 kB
xet
about 1 month ago
c16a9666
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000048.jsonl.zst
94.9 kB
xet
about 1 month ago
771b6dea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000049.jsonl.zst
138 kB
xet
about 1 month ago
54692e69
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000050.jsonl.zst
121 kB
xet
about 1 month ago
9f1e88c2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000051.jsonl.zst
135 kB
xet
about 1 month ago
9c84a775
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000052.jsonl.zst
188 kB
xet
about 1 month ago
ce807f35
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000053.jsonl.zst
111 kB
xet
about 1 month ago
bca9633e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000054.jsonl.zst
134 kB
xet
about 1 month ago
8514e75a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000055.jsonl.zst
108 kB
xet
about 1 month ago
8690dcf2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000056.jsonl.zst
195 kB
xet
about 1 month ago
b1c769ae
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000057.jsonl.zst
173 kB
xet
about 1 month ago
9ba09550
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000058.jsonl.zst
140 kB
xet
about 1 month ago
1e6d21e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000059.jsonl.zst
112 kB
xet
about 1 month ago
a3cedfed
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000060.jsonl.zst
206 kB
xet
about 1 month ago
82f63fa9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000061.jsonl.zst
168 kB
xet
about 1 month ago
f985eb6c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000062.jsonl.zst
121 kB
xet
about 1 month ago
4d4ca686
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000063.jsonl.zst
143 kB
xet
about 1 month ago
fc2b2150
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000064.jsonl.zst
172 kB
xet
about 1 month ago
8b86af47
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000065.jsonl.zst
179 kB
xet
about 1 month ago
d8920ba6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000066.jsonl.zst
164 kB
xet
about 1 month ago
8545389b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000067.jsonl.zst
152 kB
xet
about 1 month ago
bf5a3f8a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000068.jsonl.zst
210 kB
xet
about 1 month ago
5663b396
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000069.jsonl.zst
205 kB
xet
about 1 month ago
8d887136
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000070.jsonl.zst
106 kB
xet
about 1 month ago
13ecc706
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000071.jsonl.zst
83.1 kB
xet
about 1 month ago
1a5eb116
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000072.jsonl.zst
208 kB
xet
about 1 month ago
dd166c1c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000073.jsonl.zst
167 kB
xet
about 1 month ago
35b0bf9d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000074.jsonl.zst
210 kB
xet
about 1 month ago
86d0ce6b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000075.jsonl.zst
104 kB
xet
about 1 month ago
0cc9af7a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000076.jsonl.zst
171 kB
xet
about 1 month ago
057d84c6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000077.jsonl.zst
158 kB
xet
about 1 month ago
fa34c17d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000078.jsonl.zst
131 kB
xet
about 1 month ago
bc3bc905
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000079.jsonl.zst
360 kB
xet
about 1 month ago
6b5f3d69
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000080.jsonl.zst
138 kB
xet
about 1 month ago
a7a894ed
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000081.jsonl.zst
158 kB
xet
about 1 month ago
cedc406e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000082.jsonl.zst
166 kB
xet
about 1 month ago
401def92
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000083.jsonl.zst
127 kB
xet
about 1 month ago
dd96f08b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000084.jsonl.zst
149 kB
xet
about 1 month ago
7d77f624
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000085.jsonl.zst
111 kB
xet
about 1 month ago
d71f8344
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000086.jsonl.zst
146 kB
xet
about 1 month ago
618141f2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000087.jsonl.zst
172 kB
xet
about 1 month ago
a1e9d8f6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000088.jsonl.zst
116 kB
xet
about 1 month ago
19e4aaee
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000089.jsonl.zst
188 kB
xet
about 1 month ago
aed0842e
Load more
Use this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors