Category: Data Engineering
-

How Open Universities Australia Reduced ETL Costs Using AWS Cloud Services
Discover how Open Universities Australia revolutionized their data infrastructure by transitioning from costly third-party ETL tools to AWS services, achieving significant cost savings and improved efficiency in just 5 months.
-

How Aqua Security Leverages AWS Step Functions for Scalable Data Export Solutions
Discover how Aqua Security revolutionized their data export process using AWS Step Functions and Aurora PostgreSQL. Learn about their journey in implementing a scalable, secure, and efficient solution for managing enterprise-level security data.
-

Implementing Hybrid Analytics with Amazon EMR on AWS Outposts
Discover how Amazon EMR on AWS Outposts enables powerful hybrid analytics, combining cloud scalability with on-premises control. Learn to process sensitive data locally while accessing cloud resources for comprehensive big data solutions.
-

How EUROGATE Revolutionizes Container Terminal Operations with Amazon DataZone Integration
Discover how EUROGATE transformed its container terminal operations by implementing Amazon DataZone, enabling efficient data sharing, enhanced analytics, and streamlined machine learning capabilities across their European operations.
-

Building Event-Driven Amazon Redshift Lakehouse Architecture for Cloud Excellence at MuleSoft
Explore how MuleSoft implemented a sophisticated lakehouse architecture using AWS services to achieve cloud excellence. Learn about their three-phase approach combining preparation, enrichment, and action to create a comprehensive cloud operations framework.
-

Integrating AWS Glue with Amazon OpenSearch Service for Streamlined Data Ingestion
Discover how to effectively integrate AWS Glue with Amazon OpenSearch Service for streamlined data ingestion. Learn about three powerful integration methods, best practices, and infrastructure setup for building robust data pipelines.
-

Optimizing Quant Research with Apache Iceberg: Performance and Productivity Gains
Explore how Apache Iceberg enhances quantitative research platforms through improved query performance, cost reduction, and increased productivity. Learn about its advantages over traditional Parquet files and its impact on data management efficiency.
-

Scaling Data Preprocessing: Leveraging Ray and GKE for Large-Scale ML Datasets
Discover how to overcome data preprocessing challenges in machine learning by implementing a distributed computing solution using Ray and Google Kubernetes Engine (GKE). Learn to efficiently handle large-scale datasets and accelerate your ML workflow.
-

Revolutionizing the 3D Printing Supply Chain: HP’s Innovative Approach with Delta Sharing
Discover how HP is revolutionizing the 3D printing supply chain using Delta Sharing. By leveraging real-time telemetry data, predictive maintenance, and enhanced data security, HP empowers customers to optimize operations, reduce downtime, and improve overall efficiency. Explore the power of data-driven solutions in transforming business outcomes.
