Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
9
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0065
11.1 GB
56,043 files
Updated about 1 month ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000429.jsonl.zst
133 kB
xet
about 1 month ago
c3076f0a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000430.jsonl.zst
111 kB
xet
about 1 month ago
9660d5c5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000431.jsonl.zst
118 kB
xet
about 1 month ago
6a8748d7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000432.jsonl.zst
117 kB
xet
about 1 month ago
a3683730
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000433.jsonl.zst
97.9 kB
xet
about 1 month ago
6a1a86d0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000434.jsonl.zst
129 kB
xet
about 1 month ago
740ff4f3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000435.jsonl.zst
119 kB
xet
about 1 month ago
3c823d0b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000436.jsonl.zst
118 kB
xet
about 1 month ago
c24441e4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000437.jsonl.zst
111 kB
xet
about 1 month ago
2d9f0726
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000438.jsonl.zst
160 kB
xet
about 1 month ago
9f75b47b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000439.jsonl.zst
131 kB
xet
about 1 month ago
90e7c8b5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000440.jsonl.zst
94.4 kB
xet
about 1 month ago
4fe85964
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000441.jsonl.zst
184 kB
xet
about 1 month ago
af439696
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000442.jsonl.zst
169 kB
xet
about 1 month ago
d8746dc4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000443.jsonl.zst
128 kB
xet
about 1 month ago
416456cd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000444.jsonl.zst
73.7 kB
xet
about 1 month ago
cc33a8ef
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000445.jsonl.zst
185 kB
xet
about 1 month ago
04e50a22
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000446.jsonl.zst
117 kB
xet
about 1 month ago
6e6e8f89
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000447.jsonl.zst
185 kB
xet
about 1 month ago
1ba717bf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000448.jsonl.zst
275 kB
xet
about 1 month ago
be23cd63
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000449.jsonl.zst
222 kB
xet
about 1 month ago
df8405d2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000450.jsonl.zst
91.6 kB
xet
about 1 month ago
e14df372
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000451.jsonl.zst
63.5 kB
xet
about 1 month ago
1b6de383
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000452.jsonl.zst
264 kB
xet
about 1 month ago
925f9d8b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000453.jsonl.zst
168 kB
xet
about 1 month ago
edecc417
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000454.jsonl.zst
184 kB
xet
about 1 month ago
48f18881
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000455.jsonl.zst
229 kB
xet
about 1 month ago
1cb9e47e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000456.jsonl.zst
147 kB
xet
about 1 month ago
188955ab
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000457.jsonl.zst
104 kB
xet
about 1 month ago
61bd37ff
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000458.jsonl.zst
163 kB
xet
about 1 month ago
8c0ab7a9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000459.jsonl.zst
196 kB
xet
about 1 month ago
d19fa288
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000460.jsonl.zst
185 kB
xet
about 1 month ago
0ed10f8e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000461.jsonl.zst
151 kB
xet
about 1 month ago
ff78596a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000462.jsonl.zst
93.6 kB
xet
about 1 month ago
3e026104
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000463.jsonl.zst
125 kB
xet
about 1 month ago
74ea76c5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000464.jsonl.zst
194 kB
xet
about 1 month ago
e549ab1b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000465.jsonl.zst
166 kB
xet
about 1 month ago
0fa5a243
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000466.jsonl.zst
208 kB
xet
about 1 month ago
ab5086e2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000467.jsonl.zst
172 kB
xet
about 1 month ago
0b2d0116
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000468.jsonl.zst
177 kB
xet
about 1 month ago
9bc31694
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000469.jsonl.zst
151 kB
xet
about 1 month ago
fcc8d5b3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000470.jsonl.zst
231 kB
xet
about 1 month ago
91356375
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000471.jsonl.zst
95.2 kB
xet
about 1 month ago
4b623958
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000472.jsonl.zst
104 kB
xet
about 1 month ago
b43c8725
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000473.jsonl.zst
114 kB
xet
about 1 month ago
30a9d3ef
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000474.jsonl.zst
144 kB
xet
about 1 month ago
15a687f7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000475.jsonl.zst
231 kB
xet
about 1 month ago
374b0370
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000476.jsonl.zst
220 kB
xet
about 1 month ago
5507b382
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000477.jsonl.zst
142 kB
xet
about 1 month ago
ed48d34e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000478.jsonl.zst
128 kB
xet
about 1 month ago
86edb91f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000479.jsonl.zst
105 kB
xet
about 1 month ago
f6a30d09
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000480.jsonl.zst
196 kB
xet
about 1 month ago
98f0e358
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000481.jsonl.zst
200 kB
xet
about 1 month ago
2300ca8c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000482.jsonl.zst
158 kB
xet
about 1 month ago
b7e8fa1c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000483.jsonl.zst
114 kB
xet
about 1 month ago
07e27495
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000484.jsonl.zst
148 kB
xet
about 1 month ago
ce5cb05b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000485.jsonl.zst
131 kB
xet
about 1 month ago
e08540f7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000486.jsonl.zst
122 kB
xet
about 1 month ago
83cdaf88
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000487.jsonl.zst
205 kB
xet
about 1 month ago
237a10ef
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000488.jsonl.zst
193 kB
xet
about 1 month ago
f99ee627
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000489.jsonl.zst
99.1 kB
xet
about 1 month ago
e46754f8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000490.jsonl.zst
126 kB
xet
about 1 month ago
edc2a62c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000491.jsonl.zst
213 kB
xet
about 1 month ago
b423b720
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000492.jsonl.zst
152 kB
xet
about 1 month ago
9a84d952
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000493.jsonl.zst
179 kB
xet
about 1 month ago
ac078fc0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000494.jsonl.zst
233 kB
xet
about 1 month ago
796e584d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000495.jsonl.zst
133 kB
xet
about 1 month ago
9d3ebf9f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000496.jsonl.zst
82.1 kB
xet
about 1 month ago
7e4d4dcf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000497.jsonl.zst
185 kB
xet
about 1 month ago
4c28e07e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000498.jsonl.zst
178 kB
xet
about 1 month ago
fd5f233a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000499.jsonl.zst
124 kB
xet
about 1 month ago
a8f2b653
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000500.jsonl.zst
176 kB
xet
about 1 month ago
ae963b36
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000501.jsonl.zst
114 kB
xet
about 1 month ago
41061933
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000502.jsonl.zst
111 kB
xet
about 1 month ago
210f2144
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000503.jsonl.zst
143 kB
xet
about 1 month ago
a0a6c807
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000504.jsonl.zst
106 kB
xet
about 1 month ago
62953577
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000505.jsonl.zst
99.3 kB
xet
about 1 month ago
0ef8d913
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000506.jsonl.zst
139 kB
xet
about 1 month ago
17b65245
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000507.jsonl.zst
131 kB
xet
about 1 month ago
2141dbea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000508.jsonl.zst
89.3 kB
xet
about 1 month ago
e7034d78
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000509.jsonl.zst
170 kB
xet
about 1 month ago
6fdf57be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000510.jsonl.zst
111 kB
xet
about 1 month ago
6c6b8c32
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000511.jsonl.zst
325 kB
xet
about 1 month ago
1ca48552
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000512.jsonl.zst
138 kB
xet
about 1 month ago
4d93cfb6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000513.jsonl.zst
184 kB
xet
about 1 month ago
3f44a385
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000514.jsonl.zst
162 kB
xet
about 1 month ago
dcf9477e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000515.jsonl.zst
127 kB
xet
about 1 month ago
c1184a5a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000516.jsonl.zst
160 kB
xet
about 1 month ago
4a1cca3c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000517.jsonl.zst
155 kB
xet
about 1 month ago
1e78d24b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000518.jsonl.zst
143 kB
xet
about 1 month ago
03e07856
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000519.jsonl.zst
119 kB
xet
about 1 month ago
fb145ef0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000520.jsonl.zst
116 kB
xet
about 1 month ago
fc256985
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000521.jsonl.zst
109 kB
xet
about 1 month ago
1ce3e67b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000522.jsonl.zst
137 kB
xet
about 1 month ago
de436d6c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000523.jsonl.zst
112 kB
xet
about 1 month ago
c899192b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000524.jsonl.zst
162 kB
xet
about 1 month ago
a47ce6bb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000525.jsonl.zst
79.8 kB
xet
about 1 month ago
e9b5ab30
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000526.jsonl.zst
148 kB
xet
about 1 month ago
12f30bfe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000527.jsonl.zst
138 kB
xet
about 1 month ago
a9fb56f0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000528.jsonl.zst
127 kB
xet
about 1 month ago
e82980c9
Load more
Use this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors