Cancel

Building machine learning models at scale for data parallel problems on Pivotal's MPP databases

Posted Mar 20, 2016 2016-03-20T00:00:00-07:00 by Srivatsan Ramanujam

Updated May 17, 2020 2020-05-17T16:02:06-07:00

Using a clever trick in leveraging static dictionaries in PL/Python, we can easily scale ML models from popular libraries like scikit-learn or XGBoost for data parallel problems. You can read the full blog that I published in the Pivotal Engineering Journal following the link below.

Building machine learning models at scale for data parallel problems on Pivotal’s MPP databases

Blogs, Deepdive

This post is licensed under CC BY 4.0 by the author.

Recent Update

Tweets by being_bayesian

Comments powered by Disqus.

Building machine learning models at scale for data parallel problems on Pivotal's MPP databases

Recent Update

Trending Tags

Contents

Trending Tags

Building machine learning models at scale for data parallel problems on Pivotal's MPP databases

Recent Update

Trending Tags

Contents

Further Reading

All Things Python @ Pivotal

Scalable in-database machine learning with PL/Python (Postgres Open Silicon Valley 2017)

Einstein for Sales - Under the Hood (Dreamforce 2019 Breakout Session)

Trending Tags