Post
2597
I am very excited to see the release of
nyuuzyou/gitee-code. This is exactly what I have been looking for. Thank you to
@nyuuzyou
for his hard work on this.
Join the community of Machine Learners and AI enthusiasts.
Sign UpGlad you are finding it useful! You should also check out these datasets:
https://huggingface.co/datasets/nyuuzyou/gitcode-code
https://huggingface.co/datasets/nyuuzyou/jihulab-code
They use the same data processing pipeline and format, but they are sourced from different Chinese services.
good work.