Filter (clear filters)





Overview for recommendations

The Latest in Apache Hive, Spark, Druid and Impala

See how Hortonworks and Cloudera is using the latest in Apache Hive, Spark, Druid and Impala in data warehousing, analytics and recommendations.


Building a Recommendation Engine Using Diverse Features

This talk is about how Sailthru leverages diverse features about users and items to build a recommendation system that will be used by several media and ecommerce companies. In particular, learn how they make use of Spark SQL to extract user and item level features, (b) How they run Spark code in production and best practices for building effective Spark application, (c) How they make use GBMs to use diverse features and various algorithms to make final recommendations for each user, and (d) How they make use of Spark MLlib to make recommendations for millions of users by scoring over ten thousand items per user.


Boosting consumer engagement at PayPal

Sujit Mathew and Yew Yap Goh from PayPal discuss:

  1. How they use data to boost engagement for PayPal products with their consumers
  2. Collaborative filtering and how they scale models on Hadoop cluster
  3. How  they design content models and hybrid models
  4. How they use property graphs and graph modeling
  5. Visualizing our data for stakeholders
  6. How they take our models to production