Reliability Engineer, Google Cloud
Xindian District, New Taipei City, Taiwan

Google has one of the largest and most powerful computing infrastructures in the world. Your team is responsible for providing the manufacturing capability to deliver this state-of-the-art physical infrastructure.

As a Manufacturing Engineer, you evaluate the product designs and create the processes, tools and procedures behind Google's powerful search technology.

When vendors build parts for our infrastructure, you're right there alongside ensuring manufacturing processes are repeatable and controlled.

You collaborate with Commodity Managers and Design Engineers to determine Google's infrastructure needs and product specifications.

Your work ensures the various pieces of Google's infrastructure fit together perfectly and keep our systems humming along smoothly for a seamless user experience.

In this role, you will join the Google Cloud Infrastructure team as a hardware reliability engineering professional leading New Product Introduction (NPI) product reliability-related activities between our engineering teams, contract manufacturers and suppliers.

This will include identifying and managing risks and clearly communicating project deliverables and status to stakeholders.

You will break down complex problems into steps that drive product development and manufacturing in a fast-paced environment, and focus on the root cause, not the symptom.

As a Reliability Engineer, you will evaluate the product designs and create the processes, tools and procedures to help continuously improve our technical infrastructure by improving every aspect of product reliability.

You will influence and work very closely with various highly skilled engineering teams, product operations and supply chain teams to ensure the product represents the very best of Google's brand and ensures a reliability of service beyond today’s standards.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running.

From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible.

We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.


  • Set the proper reliability plans, goals and priorities and put risk mitigation plans in place during NPI, providing design input to improve cloud product reliability.
  • Identify critical failure modes to influence and enable an optimal design and de-risk the product at an early stage of development.
  • Extract root causes from symptoms, create actionable insights from test and FA data.

  • Define and drive reliability test plans and collect / analyze / synthesize test data to enable evaluation, implementation, and verification of the design reliability across multiple functional teams.
  • Maintain relationships with outside testing labs and internal groups, as well as Contract Manufacturer (CM) partners, while developing in-house test and qualification capabilities where needed.
  • Collaborate and work synergistically with several of Google’s cross-functional teams.
  • Minimum qualifications :

  • Bachelor's degree in Electrical Engineering, Industrial Engineering or Mechanical Engineering, related field or equivalent practical experience.
  • 7 years of working experience in quality or reliability engineering of computing or network infrastructure hardware.
  • Experience conducting DFMEA, DOE, derating analysis, test plans, ORT, RIT and system-level reliability analysis.
  • Experience with failure analysis and fault isolation techniques.
  • Preferred qualifications :

  • Master's degree or PhD in Electrical Engineering, Industrial Engineering or Mechanical Engineering or equivalent practical experience.
  • 10 years of working experience in quality or reliability engineering of cloud infrastructure hardware and technology.
  • Experience leading cross-functional problem solving teams using practical approaches (e.g., with demonstrated technical leadership, de-escalation skills and executive communication).
  • Advanced practical understanding and experience with failure analysis and fault isolation techniques and how to apply them to isolate root causes for complex technical problems.
  • Knowledge of Industry Test Standards (JEDEC, ASTM, IEEE).
  • 报告这项工作

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    通過點擊“持續”,我允許neuvoo同意處理我的數據並向我發送電子郵件提醒,詳見neuvoo的 隱私政策 。我可以隨時撤回我的同意或退訂。