Chris Essig

Walkthroughs, tips and tricks from a data journalist in eastern Iowa

Archive for June 2012

Spreadsheet tutorials to help journalists clean and format data

leave a comment »

Note: This is cross-posted from Lee’s data journalism blog. Reporters at Lee newspapers can read my blog over there by clicking here.

I was talking to colleague the other day who was interested in getting into computer programming and more data projects. He asked where the best place to start was.

My gut reaction was to tell him to learn the basics of spreadsheets. Almost all of the data I have used with projects — whether they end up being a map or a graphic — were initially set up in spreadsheet form.

Most of us are familiar with spreadsheets and have likely worked with Microsoft Excel. Agencies both big and small often gather spreadsheets of useful information for us to use with our stories.

The problem is most of the time the data isn’t formatted right or contains inaccuracies like misspellings. That is where the magic of spreadsheet formulas can come in to help organize your data.

Here’s a few resources you might find handy for working with spreadsheets.

1. Poynter: How journalists can use Excel to organize data for stories

This walkthrough is great because it is written directly for journalists. It is also intended for beginners so those with no spreadsheet knowledge will be able to keep up. Finally, the walkthrough is intended for both print and online journalists. So journalists who have no intention of making visualizations will still find a handful of features in Excel to help them with their day-to-day reporting.

2. My Favorite (Excel) Things

This PDF provided by Mary Jo Webster at the St. Paul Pioneer Press is a great, concise list of Excel formulas she uses all the time. It includes formulas on how to format dates, run sums and even run if-else statements in Excel. It’s one of my favorite resources for Excel and definitely worth bookmarking.

3. Google Docs – Spreadsheet resources

Not a fan of Excel or don’t have it installed on your work computer? Fortunately you can make a spreadsheet with Google by logging into Google Drive and clicking “Create > Spreadsheet.” The best part is spreadsheets you create with Google can be accessed from any computer as long as you log in with your Google account.

Here’s some resources for getting started with Google spreadsheets:

4. Google Refine resources

Sometimes you need more than just spreadsheet formulas to clean dirty data. That’s where the powerful Google Refine program can come into play. The program was designed to clean dirty data by finding inconsistencies in your spreadsheets. It can also help you sort data, add to data, transform it from one service to another and much, much more.

Here’s some resources you might find handy:

 5. NICAR-L e-mail list

Still stuck? Fortunately, there is a wonderful community of computer-assisted reporters who are more than willing to help others out. If you want great information on spreadsheets or any other data journalism topic, check out the National Institute of Computer-Assisted Reporting’s email list. Questions on Excel come up almost every day.

Have any other useful resources not listed here? E-mail me at chris.essig@wcfcourier.com.

Advertisements

Written by csessig

June 25, 2012 at 9:37 am

Data journalism resources

leave a comment »

Note: This is cross-posted from Lee’s data journalism blog. Reporters at Lee newspapers can read my blog over there by clicking here.

Okay this is going to be a quick blog post since we and every other Lee newspaper are switching over our sites to Templates 2.0. Here’s a couple of neat websites I’ve ran across in the past couple of weeks that may help with your next data story and/or visualization:

1. Data Journalism Handbook

Want to know about the importance of data journalism today and what is possible for you to do at your own newspaper? Then this handbook is a must read. One of the great aspects of the book is it shows how data can be used not only to make impressive graphics but also great stories. And best of all it is free and growing.

2. DataVisualization: Selected tools

Here’s a great resource for many of the tools used by data journalists across the world. For instance, Google Fusion Tables is the go-to service for map-makers, while Google Refine is a great program for cleaning data to use with maps, visualizations or just stories. Also check out ColorBrewer to help you find great color patterns.

One Javascript library worth checking out is D3.js, which has been used by the New York Times to create some jaw-dropping visualizations. Also worth noting are two blog posts from software developer Jim Vallandingham. One deals with making bubble charts with D3.js. The second shows how to use the library to work with past versions of Internet Explorer. This is a must read for sites like ourselves that have a high percentage of readers who use Internet Explorer.

3. Data Stories

If you are like me, you love listening to podcasts, especially after work when your eyes can no longer stare at your computer screen. This site features a series of podcasts on data visualization. Topics include how to learn data visualization, when to use animated graphics and other advice from people smarter than I will ever be. So check it out if you are looking for something new to listen to while you hit the gym, bike trail or couch.

Written by csessig

June 5, 2012 at 2:00 pm