Cloud-Native Recovery Tool, BOSH Backup & Restore, Now Available in Public Beta

July 10, 2017 Therese Stowell

The largest companies in the world run their most important apps on Cloud Foundry. Their stories were on display at Cloud Foundry Summit last month.

Operators have a range of approaches for ensuring they can recover Cloud Foundry, apps, and data in case of a disaster. The approaches fall into two categories: backing up the raw data or automating recreation of the data, and both have associated issues and complexity.

We set out to change that. Our recommended solution for the community: BOSH Backup and Restore, now a beta in Pivotal Cloud Foundry 1.11.

BOSH Backup and Restore (BBR) is going to be the way to backup and restore distributed systems on any cloud #cfsummit #cloudfoundry
— Alex Ley (@AlexEvade) June 15, 2017

Burning Down the House: How to Deal with Disaster Recovery in Cloud Foundry

What is BOSH Backup & Restore (BBR)?

BBR is a framework for backing up and restoring BOSH deployments and BOSH directors. It orchestrates triggering the backup or restore process on the deployment or director, and transfers the backup artifact to and from the deployment or director.

Our ideal solution needed to answer two questions. First, how do you take a distributed backup that’s consistent? And second, how do you avoid modifying your backup scripts every time Cloud Foundry changes?

BBR does that by defining a contract between the backup orchestrator and the component to be backed up. The orchestrator calls scripts on the components to be backed up and restored, and the components are responsible for generating the backup and restoring the backup.

The orchestrator is the BBR binary, and the component is a BOSH job.

To enable consistency, the component (a BOSH release) can implement locking - a pre-backup-lock script and a post-backup-unlock script. Backups are triggered per BOSH deployment, with each pre-backup-lock script on all the jobs in the deployment getting called before each backup script is called.

The BOSH Backup and Restore script execution sequence. Note that the order of calling a particular type of script (e.g. pre-backup-check) is not guaranteed across instance groups. and instances within a group (e.g. foo/job1 may run before foo/job2). Also, the terminology in this diagram follows BOSH 2.0 conventions.

The authors of the component write and maintain the backup and restore scripts for that component, and the scripts are packaged with the component. As a result, scripts can stay in sync with the component, avoiding compatibility issues. And the scripts can be smart, only backing up / restoring required data and, if necessary, performing processing like encryption or credential generation.

There’s a lot of BOSH here. Isn’t this supposed to be for Cloud Foundry? Well, yes! Consider that:

All components in Cloud Foundry are BOSH deployments
The BOSH director is a BOSH release
For an operator, a BOSH deployment is the logical unit of backup

This our rationale for BOSH Backup and Restore. The key to making this all work: the responsibility for writing and maintaining backup and restore scripts sits with the BOSH release author.

We gave a talk at CF Summit 2017 on BOSH Backup and Restore. Check it out if you’d like to hear more about the service, and how we got to this point.

We’ve also proposed BOSH Backup and Restore as an open-source Cloud Foundry extension! We’ll keep you posted on our progress.

How It Works: First, Create The Backup Artifact. Then, Put It Back.

To understand how BBR works, let’s look at the steps that happen once an operator initiates a backup. The BBR binary is run from a jumpbox that has access to PCF deployments. The operator triggers a backup for a BOSH deployment (or director) using the cli. The BBR binary then looks at the jobs in the deployment (or director) for lock / backup / unlock scripts. The binary then triggers those scripts in the prescribed order. The backup artifacts are transferred to the jumpbox. The operator proceeds to transfer the artifacts to external storage.

A restore is the inverse of this process - the backup artifact must be copied into the jumpbox where the BBR binary is located. Then the restore process is triggered by the operator using the cli, specifying the deployment or director to restore, as well as the path to the backup artifact. The BBR binary identifies which jobs implement the restore script, copies the matching backup artifact into the job, and triggers the restore script.

Beta Testers Wanted!

We’re full-speed ahead on a GA release. To help us get there quickly, sign-up to beta test the product! BBR does require Pivotal Cloud Foundry 1.11; it also supports a subset of modules today (CredHub, UAA, the BOSH Director, Elastic Runtime in Pivotal Cloud Foundry). Support for open-source Cloud Foundry and data services are on the roadmap.

One other note: backup & restore needs an ecosystem. So we’re building one! BOSH Backup and Restore solves the core problem of creating a backup artifact, then putting it back. We are leaning on third-parties to solve encryption, scheduling, permissions, and secondary backup sites. Watch this space!

About the Author

Therese Stowell is Product Manager at Pivotal. She has worked in the software industry for 20+ years as programmer, interface designer, and product manager. She worked on Windows, developing the command line environment, founded a successful social enterprise, and was part of a startup team to win a Nesta Open Data Institute £40,000 prize. She also has an MA in Fine Art.

Microsoft Azure Partner of the Year Award: And the Winner is.......

Amid 16,000 partners at Microsoft’s Inspire conference last week, Pivotal was recognized as “Azure Consumpt...

Detecting Risky Assets in an Organization Using Time-Variant Graphical Model

Cloud-Native Recovery Tool, BOSH Backup & Restore, Now Available in Public Beta

Burning Down the House: How to Deal with Disaster Recovery in Cloud Foundry

How It Works: First, Create The Backup Artifact. Then, Put It Back.

Beta Testers Wanted!

About the Author

Previous

Next

Cloud-Native Recovery Tool, BOSH Backup & Restore, Now Available in Public Beta

Burning Down the House: How to Deal with Disaster Recovery in Cloud Foundry

How It Works: First, Create The Backup Artifact. Then, Put It Back.

Beta Testers Wanted!

About the Author

Previous

Next

Most Recent

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

How VMware Tanzu CloudHealth helps customers uncover spiraling AWS Extended Support charges.

VMware Tanzu enhances Spring development with simplified operations, accelerated innovation, seamless microservices transition, increased security, and effortless scaling.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

Bitnami-packaged open source software is loved by developers for its ease of use, which enables developers to directly pull a Bitnami package and seamlessly start using it with little effort.

VMware Tanzu announces the General Availability of AWS Commitment Discount Recommendations, which provides recommendations for all reservable services in AWS through VMware Tanzu CloudHealth.

Introducing VMWare Tanzu Data Hub, a self-managed Database as a Service (DBaaS) Platform, providing enterprises a way to host their internal DBaaS offering for internal business users.

In the cloud-native landscape, MCAs drive seamless compliance integration. Their expertise ensures proactive security measures align with regulatory standards for sustained innovation & collaboration.

Tanzu Application Platform brings innovation faster with more frequent feature updates. With 1.9, take advantage of enhanced DORA metrics visibility and improved compliance options for companies.

We’re excited to share some great news! Spring Academy Pro content is now free. It will be available to everyone who registers a work, vocational, or educational email address.

March 28, 2024, marks the official minor release date of Spring Cloud Gateway for K8s version 2.2, and it's set to optimize how developers protect access to their GraphQL services.

We are excited to announce that VMware Tanzu Application Service 6.0 is now generally available!

Get a clear picture of your OSS supply chain, and the risks you face from your open source software dependencies, using the all-new Tanzu OSS Health Assessment.

Trivy can now utilize CSAF VEX data to filter out false positives in CVE reports, maximizing the value of VEX documents in VMware Tanzu Application Catalog.

Bitnami-packaged open source software container images available in DockerHub are now signed by Notation, an implementation of the Notary Project specifications and a CNCF-incubating project.

There’s never been a better time to be a Java and Spring developer! Let me show you why with a sneak peak into JD Conference 2024.

If you're into FinOps, you've probably heard of FOCUS. Introducing our FOCUS FlexReports template for AWS, Azure, and GCP. Turn your cloud bills into FOCUS-compliant reports in minutes!