Kedro is an open-source Python system for creating reproducible, maintainable, and modular data science code with pipeline orchestration. Developed by McKinsey QuantumBlack and donated to the Linux Foundation, it brings software engineering best practices like modularity and testing to data science projects. Kedro provides a standardised project structure, a data catalogue, and pipeline visualisation.
- Modular pipeline nodes
- Data catalogue abstraction
- Project templating
- Pipeline visualisation
- Plugin ecosystem
Pros
- Excellent code organisation and modularity
- Strong software engineering principles
- Good documentation
Cons
- Learning curve for data scientists unfamiliar with software engineering
- Less real-time monitoring than alternatives
No reviews yet. Be the first to leave a review!
Log in to leave a review.