# DataMate All-in-One Data Work Platform
[](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-backend.yml)
[](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-frontend.yml)




**DataMate is an enterprise-level data processing platform for model fine-tuning and RAG retrieval, supporting core
functions such as data collection, data management, operator marketplace, data cleaning, data synthesis, data
annotation, data evaluation, and knowledge generation.**
[简体中文](./README-zh.md) | [English](./README.md)
If you like this project, please give it a Star⭐️!
## 🌟 Core Features
- **Core Modules**: Data Collection, Data Management, Operator Marketplace, Data Cleaning, Data Synthesis, Data
Annotation, Data Evaluation, Knowledge Generation.
- **Visual Orchestration**: Drag-and-drop data processing workflow design.
- **Operator Ecosystem**: Rich built-in operators and support for custom operators.
## 🚀 Quick Start
### Prerequisites
- Git (for pulling source code)
- Make (for building and installing)
- Docker (for building images and deploying services)
- Docker-Compose (for service deployment - Docker method)
- Kubernetes (for service deployment - k8s method)
- Helm (for service deployment - k8s method)
This project supports deployment via two methods: docker-compose and helm. After executing the command, please enter the corresponding number for the deployment method. The command echo is as follows:
```shell
Choose a deployment method:
1. Docker/Docker-Compose
2. Kubernetes/Helm
Enter choice:
```
When running make uninstall, the installer will prompt once whether to delete volumes; that single choice is applied to all components. The uninstall order is: milvus -> label-studio -> datamate, which ensures the datamate network is removed cleanly after services that use it have stopped.
### Clone the Code
```bash
git clone git@github.com:ModelEngine-Group/DataMate.git
cd DataMate
```
### Deploy the basic services
```bash
make install
```
To list all available Make targets, flags and help text, run:
```bash
make help
```
### Build and deploy Mineru Enhanced PDF Processing
```bash
make build-mineru
make install-mineru
```
### Deploy the DeerFlow service
```bash
make install-deer-flow
```
### Local Development and Deployment
After modifying the local code, please execute the following commands to build the image and deploy using the local image.
```bash
make build
make install dev=true
```
## 🤝 Contribution Guidelines
Thank you for your interest in this project! We warmly welcome contributions from the community. Whether it's submitting
bug reports, suggesting new features, or directly participating in code development, all forms of help make the project
better.
• 📮 [GitHub Issues](../../issues): Submit bugs or feature suggestions.
• 🔧 [GitHub Pull Requests](../../pulls): Contribute code improvements.
## 📄 License
DataMate is open source under the [MIT](LICENSE) license. You are free to use, modify, and distribute the code of this
project in compliance with the license terms.