Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Ujjwal-Tyagi 
posted an update about 1 month ago
Post
2597
I am very excited to see the release of nyuuzyou/gitee-code. This is exactly what I have been looking for. Thank you to @nyuuzyou for his hard work on this.

Glad you are finding it useful! You should also check out these datasets:

https://huggingface.co/datasets/nyuuzyou/gitcode-code
https://huggingface.co/datasets/nyuuzyou/jihulab-code

They use the same data processing pipeline and format, but they are sourced from different Chinese services.

Awesome! Congrats on the release, @nyuuzyou

good work.