Designing Reliable Systems

lazy-deletion

The article will highlight a few of the proven strategies for designing reliable systems in case of single point of failures, correlated failures, cascading failures, query of death overload, lazy deletion, etc. with solutions. In this era of cloud designing reliable systems are very important and critical to encounter system/cloud failure as the users rapidly increasing if the system gains popularity overnight. These strategies mainly focus on Google Cloud; however, you can employ these strategies in any cloud including private cloud. Continue reading Designing Reliable Systems

Google Cloud Professional DevOps Engineer Q120

question-and-answer

You are building and deploying a microservice on Cloud Run for your organization. Your service is used by many applications internally. You are deploying a new release, and you need to test the new version extensively in the staging and production environments. You must minimize user and developer impact. What should you do? Continue reading Google Cloud Professional DevOps Engineer Q120

Google Cloud Professional DevOps Engineer Q119

question-and-answer

You work for a global organization and run a service with an availability target of 99% with limited engineering resources. For the current calendar month, you noticed that the service has 99.5% availability. You must ensure that your service meets the defined availability goals and can react to business changes, including the upcoming launch of new features. You also need to reduce technical debt while minimizing operational costs. You want to follow Google-recommended practices. What should you do? Continue reading Google Cloud Professional DevOps Engineer Q119

Google Cloud Professional DevOps Engineer Q118

question-and-answer

You are developing the deployment and testing strategies for your CI/CD pipeline in Google Cloud. You must be able to:
– Reduce the complexity of release deployments and minimize the duration of deployment rollbacks.
– Test real production traffic with a gradual increase in the number of affected users.
You want to select a deployment and testing strategy that meets your requirements. What should you do? Continue reading Google Cloud Professional DevOps Engineer Q118