SHOW

Filter (clear filters)

Domains

Companies

Technologies

Functions


Overview for Apache Spark

How Traveloka's Runs Cloud-Scale Apache Spark in Production Since 2017

Traveloka's Data Engineering and Data Science team shares how the staff submit their cloud-scale Spark jobs today. The discussion highlights pros/cons, integration of Apache Spark with CI/CD components, Schedulers, Airflow, Key Management Systems (KMS), templates. The journey starts at historic event of a self-managed Spark cluster on-premise, and talk through adoption of AWS EMR, Qubole, Databricks, and Dataproc. How multiple back-end data sets has helped transform Traveloka from meta-search engine to fully integrated On-Line Travel Booking agency, and one of top Indonesian Unicorn startups!

Links



The Latest in Apache Hive, Spark, Druid and Impala

See how Hortonworks and Cloudera is using the latest in Apache Hive, Spark, Druid and Impala in data warehousing, analytics and recommendations.

Links