AI-Powered Retail Experience with Databricks

如果无法正常显示,请先停止浏览器的去广告插件。
分享至:
相关话题: #zalando
1. WIFI SSID:Spark+AISummit | Password: UnifiedDataAnalytics
2. AI-Powered Retail Experience with Databricks Akhil Dhingra, Zalando Saurav Verma, Zalando #UnifiedDataAnalytics #SparkAISummit
3. Zalando SE ● Founded in 2008 in Berlin. ● Europe's leading online fashion platform ● Connects customers, brands and partners. #UnifiedDataAnalytics #SparkAISummit 3
4. Zalando SE 4
5. Big-Data Stack @ Zalando 5
6. About Us Akhil Dhingra Product Manager, Data Solutions @Zalando Exp: 7+ Years, Ex-Groupon, Ex-Wingify | MBA Saurav Verma Senior Engineer, Data Lake @Zalando Exp: 9+ Years , Ex-Visa | Masters NUS 6
7. Data Platform Data Sources 7
8. Data Platform ● Data Lake on top of S3 Data Sources 8
9. Data Platform ● Multi-tenant / single compute: more ingestion pipelines Data Sources 9
10. Many Use Cases Team A Data Sources 10
11. Many Use Cases Team B Team A Data Sources 11
12. Many Use Cases Team B Team A Team C Data Sources 12
13. Too Many Use Cases Team B Team M Team A Team C Team N Data Sources 13
14. Too Many … Compute Stream Team B Training Auto-Scale Team A Compute Team C Python / Scala Team M Batch Team N Data Sources 14
15. Too Many … Compute Stream Team B Training Auto-Scale Team A Compute Team C Python / Scala Batch Team M ● ● ● ● ● ● ● ● Cost control problem at Scale More Time To Production No Best Practices Duplication of work / Data Dependencies Inconsistent Environment No Community Knowledge Accidental Complexity Team N 15
16. Spark as a Service Stream ● Team B Training Auto-Scale Team M Team A Team C Python / Scala Batch ● ● ● ● ● Foundational piece of Zalando’s Big Data Infrastructure GitOps Management, Decentralized Clusters Security / Compliance / CI-CD XX clusters/Jobs ~20 teams in production Thriving #Databricks community in Zalando Team N 16
17. Spark as a Service Migration Projects ETLs | Data Preparation in Spark-S3 17
18. Spark as a Service Others: Structured Streams | Traceability 18
19. Spectrum of use cases 19
20. GDPR and Antitrust Compliance with GDPR and antitrust laws 20
21. GDPR and Antitrust Probe (pilot) - Use marker event to create heat map of the data path. - List of all datasets within the heat map. 21
22. GDPR and Antitrust Pseudonymize/Remove - Identifier based, on-demand, in-place record updater with field precision - Great for semi-structured formats like JSON - Use S3 Inventory + Streaming 22
23. Search & Ranking Personalized article ranking for relevance and user engagement. 23
24. Search & Ranking Using Spark in ML training pipeline ! 24
25. Search & Ranking ML Model Article Scoring and personalization ! 25
26. Others • Sizing: Reducing return rates due to size and fit issues. • Experimentation @Scale • Merchant Analytics • Marketing Services 26
27. First Impressions • GitOps | Self Service 27
28. First Impressions • Multi-Tiered support system • Delta Adoption | But few readers outside Databricks ecosystem • Communicating pricing downstream • Exploding Usage is Good • Fits all Size? 28
29. Thank you. AI- Powered Retail Experience with Databricks Akhil Dhingra Saurav Verma www.zalando.com www.jobs.zalando.com/tech #UnifiedDataAnalytics #SparkAISummit 29
30. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT

Home - Wiki
Copyright © 2011-2025 iteam. Current version is 2.142.1. UTC+08:00, 2025-04-11 03:01
浙ICP备14020137号-1 $Map of visitor$