Google 101
Chistophe Bisciglia, an engineer from Google, is teaching a course at the University of Washington. The course focuses on problem solving on large-scale clusters. The complete course material is available on the homepage.
I think there’s a huge potential for programs to analyze voluminous
amounts of data. Most data analysis is done either using Excel, or
using command line tools such as awk and sed. A smart way to
distribute the post-processing or analysis would be cool.