by ZhuLinsen · Agent Tool · ★ 180
A powerful tool for creating high-quality training datasets for Large Language Models (LLMs)(一个快速生成高质量LLM微调训练数据集的工具)
| Stars | 180 |
| Forks | 27 |
| Language | Python |
| Category | Agent Tool |
| License | Apache-2.0 |
| Quality Score | 41.75/100 |
| Last Updated | 2025-08-31 |
| Created | 2025-04-25 |
| Platforms | python |
| Est. Tokens | ~198k |
These tools work well together with FastDatasets for enhanced workflows:
Explore other popular agent tool tools:
FastDatasets is A powerful tool for creating high-quality training datasets for Large Language Models (LLMs)(一个快速生成高质量LLM微调训练数据集的工具). It is categorized as a Agent Tool with 180 GitHub stars.
FastDatasets is primarily written in Python. It covers topics such as asyncio, dataset-generation, datasets.
You can find installation instructions and usage details in the FastDatasets GitHub repository at github.com/ZhuLinsen/FastDatasets. The project has 180 stars and 27 forks, indicating an active community.
FastDatasets is released under the Apache-2.0 license, making it free to use and modify according to the license terms.