Cloud Chef Labs aims to provide a simple and efficient Data Platform which can solve most of the problems
that you encounter in Data Lake.
DataRoaster is open source tool to provide data platforms running on Kubernetes to build a data lake and AI-based analytics platform with ease.
Users can use DataRoaster as a cost-effective alternative to serverless services provided by other cloud providers.
To use DataRoaster, visit github repo: https://github.com/cloudcheflabs/dataroaster
DataRoaster consists of the following components.
CLI: command line interface to API Server.
API Server: handles requests from clients like CLI.
Authorizer: runs as OAuth2 Server.
Secret Manager: manages secrets like kubeconfig using Vault.
Resource Controller: manages remote kubernetes resources with kubectl, helm and kubernetes client like fabric8 k8s client.
The following demo shows how to create a data platform which consists of hive metastore, spark thrift server, trino, redash and jupyterhub, etc
running on Kubernetes using DataRoaster with ease.