Data Engineering Labs

Apply Apache Beam concepts to practical engineering problems. Review skeleton code scripts, analyze test inputs, and explore step-by-step solutions.

Foundation LabEasy
Employee Analytics
Est: 15 mins

Learn basic transformations by filtering and aggregating employee records.

Start Lab
Foundation LabEasy
Student Records
Est: 15 mins

Parse student grades and compute the average score per subject.

Start Lab
Foundation LabEasy
Product Catalog
Est: 15 mins

Filter products by price range and list catalog summaries.

Start Lab
Foundation LabEasy
Sales Report
Est: 20 mins

Summarize transactions to calculate total sales revenue.

Start Lab
Batch LabMedium
Customer Orders
Est: 30 mins

Analyze retail transactions and generate total customer spending aggregations.

Start Lab
Batch LabMedium
Inventory Analytics
Est: 30 mins

Calculate total stock valuation and identify low stock items.

Start Lab
Batch LabMedium
Banking Transactions
Est: 30 mins

Audit accounts by summing deposits and withdrawals.

Start Lab
Batch LabMedium
HR Analytics
Est: 30 mins

Compute average salary by department and identify the highest earner.

Start Lab
Streaming LabHard
Telecom Monitoring
Est: 45 mins

Process real-time streaming cellular events using Fixed Windows.

Start Lab
Streaming LabHard
IoT Analytics
Est: 45 mins

Aggregate device temperature readings using sliding windows.

Start Lab
Streaming LabHard
Website Clickstream
Est: 45 mins

Count real-time webpage pageviews using streaming windows.

Start Lab
Streaming LabHard
Fraud Detection
Est: 50 mins

Detect rapid successive banking transactions to flag potential fraud.

Start Lab
Advanced LabHard
Session Analytics
Est: 60 mins

Group website click logs into user session windows to measure engagement.

Start Lab
Advanced LabHard
Kafka Pipeline
Est: 60 mins

Consume from an Apache Kafka topic stream and output parsed JSON payloads.

Start Lab
Advanced LabHard
Pub/Sub to BigQuery
Est: 60 mins

Design a streaming ingestion pipeline from Google Cloud Pub/Sub to Google BigQuery.

Start Lab
Advanced LabHard
Real-Time Dashboard Pipeline
Est: 60 mins

Calculate continuous dashboard aggregations with dynamic triggers.

Start Lab