-
Onboard
For Tech Teams
- Reduce initial time to productivity.
- Increase employee tenure.
- Plug-and-play into HR onboarding and career pathing programs.
- Customize for ad-hoc and cohort-based hiring approaches.
-
Upskill
For Tech Teams
- Upgrade and round out developer skills.
- Tailor to tech stack and specific project.
- Help teams, business units, centers of excellence and corporate tech universities.
-
Reskill
For Tech Teams
- Offer bootcamps to give employees a running start.
- Create immersive and cadenced learning journeys with guaranteed results.
- Supplement limited in-house L&D resources with all-inclusive programs to meet specific business goals.
-
Design
For Tech Teams
- Uplevel your existing tech learning framework.
- Extend HR efforts to provide growth opportunities within the organization.
- Prepare your team for an upcoming tech transformation.
Get your team started on a custom learning journey today!
Our Boulder, CO-based learning experts are ready to help!
Course Summary
The Intermediate Google Cloud for Data Engineers training course is designed to advance the skills of those students who are already familiar with data engineering capabilities of Google Cloud to build specialized types of data pipelines, including those for machine learning, streaming data analytics, and recommendation systems.
The course starts by exploring data engineering with unbounded data sets and how streaming data analytics pipelines built with Apache Beam and DataFlow compare to alternatives, including lambda architecture. After working on a data pipeline using BigTable, DataFlow, and BigQuery, students will learn about what it takes to create data pipelines for machine learning and recommendation systems. The course concludes with covering the importance of reproducibility when creating training, evaluation, and test data and then will use TensorFlow together with Apache Beam for feature engineering of both structured and unstructured data.
Before attending this course, students should take the Google Cloud for Data Engineers course or be familiar with all of the topics listed here: Google Cloud for Data Engineers
- Productivity Objectives:
- Construct data processing pipelines for streaming data analysis and machine learning.
- Create high performance, internet-scale, low-latency data stores with BigTable.
- Develop data pipelines to support machine learning model training and serving.
- Employ TensorFlow, DataFlow, and BigQuery for unstructured and structured data pipelines.
- Design and propose scenarios for large scale data migrations to Google Cloud.
Request Information
Get your team upskilled or reskilled today. Chat with one of our experts to create a custom training proposal. Fully customized at no additional cost.

If you are not completely satisfied with your training class, we'll give you your money back.




about our training
-
Real-World Content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
-
Expert Practitioners
Industry experts with 15+ years of industry experience that bring their battle scars into the classroom.
-
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
-
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!
What You'll Learn
In the Intermediate Google Cloud for Data Engineers training course, you'll learn:
- Data Engineering for Unbounded Datasets
- Bounded vs. Unbounded Datasets
- Data Velocity vs. Volume + Variety
- Challenges and Solutions for Streaming Data Pipelines
- Lambda Architecture
- Apache Beam and DataFlow
- Advanced DataFlow for Streaming Data
- Integration with Cloud Pub/Sub
- Data De-duplication
- Late-arriving and Out-of-order Data
- Session and Sliding Windows
- Watermarks and Triggers
- Pipeline Side Inputs
- DataFlow Templates
- Advanced BigQuery for Streaming Data
- Streaming Data Warehousing
- SQL Analysis of Streaming and Batch Data
- De-duplication and Data Consistency
- Cost Estimation and Planning
- BigTable
- Use Cases for Low-Latency, Internet-Scale Storage
- Wide-Column NoSQL Storage
- Integration with Colossus Storage
- Queries with HBase API
- Key / Schema Design for BigTable
- BigTable Performance Optimizations
- Data Engineering for Machine Learning (ML)
- Machine Learning with Google Cloud
- Data Engineering for the ML Lifecycle
- Introduction to ML Use Cases
- Training, Validation, Test Datasets for ML
- Data Hashing for ML Reproducibility
- Data Engineering for Benchmarks with BigQuery ML
- Machine Learning Model Training vs. Serving
- Feature Engineering from Structured Data for ML
- Motivation for Feature Engineering
- Feature Pre-Processing vs. Feature Creation
- SQL and Apache Beam for Feature Engineering
- TensorFlow Transform API
- Feature Engineering for Unstructured Image Data for ML
- Image Transforms for Data Augmentation
- Google Colaboratory (Colab)
- TensorFlow Image API
- Image Format Conversion
- Image Resizing, Cropping, and Rotation
- Apache Beam for Image Data Augmentation
- Data Engineering for Recommendation Systems
- Recommendation Engines with Transactional Data
- Cloud SQL Databases for Recommendation Data
- Recommendation Engines with Apache Spark MLLib
- Hosting Recommendation Systems with Dataproc
- Data Migration to Google Cloud
- Cloud Data Migration Challenges
- Migration Scenarios and Destinations
Real-world content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
Expert Practitioners
Industry experts that bring their battle scars into the classroom.
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!

Elite Instructor Program
We recently launched our internal Elite Instructor Program. The community driven instructor program is designed to support instructors in transforming students’ lives by consistently showing a world-class level of engagement, ability, and teaching prowess. Reach out today to learn more about our instructors.
Customized Technical Learning Solutions to Help Attract and Retain Talented Developers
Let DI help you design solutions to onboard, upskill or reskill your software development organization. Fully customized. 100% guaranteed.
DevelopIntelligence leads technical and software development learning programs for Fortune 500 companies. We provide learning solutions for hundreds of thousands of engineers for over 250 global brands.



“I appreciated the instructor’s technique of writing live code examples rather than using fixed slide decks to present the material.”
VMwareAbout Us
LET’S DISCUSS
DevelopIntelligence has been in the technical/software development learning and training industry for nearly 20 years. We’ve provided learning solutions to more than 48,000 engineers, across 220 organizations worldwide.
Resources
Thank you for everyone who joined us this past year to hear about our proven methods of attracting and retaining tech talent.

- Boulder, Colorado Headquarters: 980 W. Dillon Road, Louisville, CO 80027
© 2013 - 2022 DevelopIntelligence LLC - Privacy Policy