Tag: data
-
windmill (self host job orchestrator)
Read This Post: windmill (self host job orchestrator)https://www.windmill.dev
-
data oriented design (book)
Read This Post: data oriented design (book)https://www.dataorienteddesign.com/dodbook
-
Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2
Read This Post: Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2https://aws.amazon.com/blogs/opensource/amazons-exabyte-scale-migration-from-apache-spark-to-ray-on-amazon-ec2
-
delta cat
Read This Post: delta catA portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads. https://github.com/ray-project/deltacat
-
netflix — maestro
Read This Post: netflix — maestrohttps://github.com/Netflix/maestro https://netflixtechblog.com/orchestrating-data-ml-workflows-at-scale-with-netflix-maestro-aaa2b41b800c https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78 https://news.ycombinator.com/item?id=41037745
-
[hn] get me out of data hell
Read This Post: [hn] get me out of data hellhttps://news.ycombinator.com/item?id=42010249
-
[sdg] data leader’s guide to hiring and team building – part 1
Read This Post: [sdg] data leader’s guide to hiring and team building – part 1https://open.substack.com/pub/seattledataguy/p/the-data-leaders-guide-to-hiring