Hard Drives Technology

How I analyzed Backblaze SMART Data

July 8, 2019

This post briefly explains how I took the raw SMART data (drive health data) from Backblaze and turned it into insights on my previous post.

Backblaze posts all of its SMART data on the following page: https://www.backblaze.com/b2/hard-drive-test-data.html

I downloaded the zip files and imported the data into MySQL. After importing the data, I used the below SELECT query to create a VIEW or TABLE that I then connected to Microsoft Excel (using Power Query) where I easily pivoted the data to come up with the insights I wrote about.

SQL Query used to transform the raw SMART data:

Click here to zoom

Additional data sources used:

  • Wayback machine (to gather previous website claims of data stored)
  • Backblaze blog posts and pictures (to understand rack units, number of enclosures per rack, data protection scheme to know max physical to stored data ratio)
  • Google to search for average rack unit colocation costs

Any questions about this, just ask!

Leave a Reply

Your email address will not be published. Required fields are marked *