Webinar Recording

Utilize the compute resources of your HPC cluster for AI/ML workloads while maintaining collaboration and compliance across teams

DKube.io and VMware present seamless integration of your vSphere based HPC/Slurm cluster with DKube MLOps platform based on the best of Kubeflow/MLFlow innovations

Watch the recorded Webinar

Many organizations have HPC clusters with large compute and GPU resource pools.  Tapping those resources for AI/ML workloads can be cumbersome, requiring hand-built plug-ins, open source libraries and tools by researchers, students, or individual employees duplicating cost and effort while reducing collaboration.  

Moreover AI/ML models often need to maintain traceability, lineage, and governance required by the regulatory or safety bodies in an industry or country.  That is available in commercial MLOps platforms which on the other hand were not built to take advantage of HPC compute and GPU resources.  

With DKube you can offload your data pre-processing or AI training jobs to a Slurm cluster based on vSphere -as individual jobs/runs or as part of pipelines.  Full traceability, lineage and logging of the work being performed is maintained in SQL database.  Multiple HPC clusters can be attached while the control plane of the DKube MLOps platform runs on a Kubernetes cluster such as VMWare Tanzu providing you with all the core innovations of Kubeflow and MLFlow.

This on-demand webinar provides a preview of these capabilities and how you can replicate them.

Watch the recording

This webinar took place on
May 25th, 2023 at 8AM US Pacific

In this event, we discuss holiday preparedness in cybersecurity, emerging threats in the domain, and jam freestyle on the latest IT innovations.

FABIANO TEIXERIA

Solutions Architect, VMWare

AJAY TYAGI

Senior Director,
DKube.io

Receive a link to the recording
Thank you, you will have received an email with the webinar recording.

Please feel free to reach out to our team if you have any questions.
Oops! Something went wrong while submitting the form.
past events

Bridge the AI/ML Divide Between Data Science and IT Infrastructure

The commercialization of AI/ML projects faces a significant challenge due to the lack of coordination between IT and data science teams. To address this issue, DKube Machine Learning Operations (MLOps) Platform on VMware Tanzu offers a solution that bridges the gap between teams.

Better collaboration

All teams benefit from best-in-class model operations and infrastructure management

Lower cost of operations

Save time and resources while significantly reducing costs

Do more with DKube

Find out why industry leaders in AI use DKube to run their Machine Learning operations.

Schedule a Demo