Tag: Glue
-
Accelerate Data to AI Innovation with Amazon SageMaker Unified Studio
AWS announces the general availability of Amazon SageMaker Unified Studio, bringing together analytics and AI capabilities in a single development environment. This integrated platform enables teams to discover data, collaborate on projects, and build advanced applications with built-in governance, dramatically reducing time-to-value for data-driven initiatives.
-
How Open Universities Australia Reduced ETL Costs Using AWS Cloud Services
Discover how Open Universities Australia revolutionized their data infrastructure by transitioning from costly third-party ETL tools to AWS services, achieving significant cost savings and improved efficiency in just 5 months.
-
Integrating AWS Glue with Amazon OpenSearch Service for Streamlined Data Ingestion
Discover how to effectively integrate AWS Glue with Amazon OpenSearch Service for streamlined data ingestion. Learn about three powerful integration methods, best practices, and infrastructure setup for building robust data pipelines.
-
Implementing End-to-End Data Lineage for Complex Analytics using AWS Services and dbt
Discover how to build comprehensive data lineage for one-time and complex queries using Amazon Athena, Redshift, and Neptune. Learn about unified data modeling with dbt and automated lineage generation through AWS serverless architecture.
-
Implementing Write-Audit-Publish Pattern with Apache Iceberg and AWS Glue Data Quality
Explore how to implement the Write-Audit-Publish pattern using Apache Iceberg and AWS Glue Data Quality for robust data validation. Learn about efficient data quality management strategies and their practical applications in modern data architectures.
-
AWS Glue Data Catalog Now Automates Table Statistics for Enhanced Query Performance
AWS Glue Data Catalog now offers automated table statistics generation, enhancing query performance in Redshift Spectrum and Athena. This feature provides automatic statistics collection across multiple file formats, with flexible configuration options at both catalog and table levels.
-
Streamlining Spark Debugging: AWS Glue Introduces Generative AI Troubleshooting Feature
AWS Glue introduces a game-changing generative AI troubleshooting feature for Apache Spark applications. This innovative solution automates root cause analysis and provides actionable recommendations, transforming hours of debugging into minutes.
-
AWS Glue Data Catalog Enables VPC-Based Apache Iceberg Table Optimization
Discover how AWS Glue Data Catalog now supports automatic optimization of Apache Iceberg tables through VPC integration, enabling secure table maintenance while meeting strict access control requirements. Learn about key features and implementation details.
-
Enhance AWS Glue Data Catalog with Generative AI and Amazon Bedrock
Learn how to automate metadata generation for AWS Glue Data Catalog using foundation models on Amazon Bedrock. This solution explores both in-context learning and RAG approaches to create comprehensive data descriptions for improved data governance.