The Boston Hack/Reductions

On June 25th Hack/Reduce coders took over the NERD center in Cambridge. It was the biggest Hack/Reduce yet with 90 coders hacking on big data projects on a total of 600 nodes provided by Hopper and Cloudant.

As always, many were humbled by crashing bits of code but that was quickly forgotten while frantically reloading a white screen with some percentages representing your job being processed. An admittedly weird way to spend your Saturday, but you just can’t say no to the opportunity of having the power of hundreds of machines at your fingertips to serve your hastily crafted code.

In the end we let all the participants vote for the best Hack/Reduction, the winner was team 6 whose application visualized words that were most related to a chosen word. The related words were based on an algorithm that they created to calculate related words by processing all of wikipedia . We used polleverywhere from Boston for the voting, it worked great!

Results for Hack/Reduce Boston

The presentations

Team 1 Merging twitter and music data

Team: Pete Kruskall, Cinjon Resnick, Greg Sabo, Thierry Bertin-Mahieux , Bruce spang, Rob Speer, Thor Kell, William Dvorak, Nadav Aharony

Team 1 worked on a couple of different projects with the Million Song Dataset. They created nice looking visualizations of clusters of Musicians by merging the Million Song Dataset and the Twitter social graph dataset.  They also calculated the similarity between twitter handles of musicians based on who they followed and created nice dynamic visualizations of the relationships.

Team 2 How book mentions correlates to stock prices.

Team: Ben Popp, Adam Buggia, Andrew Rollins, Michael Axiak, Manish Maheshwari

Team 2 wanted to calculate how mentions in books correlates to stock prices. They used Google n-grams and NYSE stock data.

The results? “nobody’s gona get rich” (That’s a quote from their presentation, not our evaluation.)

Team 3 – Location popularity in history based on book mentions.

Matt Veitas, Alex Harris, Grace Woo, Pablo Azar, Justin Ryan

Team 3 wanted to see how popular locations have been in literature starting way back in 1700. They did this by calculating the amount of mentions in books for every location. The jobs were still running when we had the presentations so we didn’t get to see the final results… Great effort though from this team that hadn’t worked with Hadoop before.

Team 4 – How much is Tom Cruise worth for a movie?

Tuan Phan, Ekaterina Lesnaia, Jason Nochlin

Team 4 wanted to estimate the value of Tom Cruise by analyzing the IMDB dataset and the value an actor has on the gross sales of a movie. I think the results said that Tom was worth at least 1.6 Million for a movie, which means that the studios might have to rethink the salaries they’re paying.

Team 5 – Federal election donation clusters

Eric Brown-Munoz, Jacob Elder, Ben Darfler, Jim Gammill, Van Simmons

Team 5 wanted to match and cluster campaign donators from federal election data. We feared some government officials might turn up but it all ended ok. It turned out that merging the different types of names used in the dataset to one name was difficult and no politically sensitive results were achieved.

Team 6 – Winner! Clustering related words from wikipedia

Satish Gopalakrishnan, Vineet Manohar

Satish and Vineet wanted to create an application that would find a list of associated words for any chosen word. They created a distance algorithm that ranked words based on how close to the original word they were mentioned in wikipedia articles.  To get the results, they scanned through the wikipedia dataset and looked for the associated words for “McCain”, “Erlang” and “Reebok”.
winning team presenting

Team 7 – EnglishCentral – Which are the most difficult words for english learners?

JM Van Thong , Don McAllaster, Jonathon Marston

Team 7 wanted to analyze a dataset from EnglishCentral to find the words that English learners from specific countries have the most trouble with in spoken language. The dataset had 60 million recordings of learners learning English.

The most challenging word for Japanese English learners was the word “really”.

Next the team wants to find the 100 most difficult words per country for learners from around the world.

The team also discussed that it was very useful to learn how you had to break one task in to small chunks in order to be able to process it with Hadoop.

Team 8 – mappers for freebase dataset

Tom Morris
Tom Morris created mappers for freebase to make it easier at future Hack/Reduce events to use the freebase dataset. Thanks Tom!

Team 9 – Quant Finance

Dhanvi Reddy, Alban Chevignard, Ajit Padukone, Kah Keng Tay

Team 9 wanted to try MapReduce for quant finance. They basically created different portfolio strategies based on historical performance that they could then evaluate. The process they had to go through was to calculate:

  1. Monthly returns from daily prices for all stocks
  2. Create a model from monthly returns (a forecast of returns & risk)
  3. Create and test portfolio weights based on the model created
  4. Analyzing the portfolio return

The portfolio strategies tested were different versions based on historical performance.

Until next time…

We want to thank all of the participants for an amazing event, see you soon again! We also want to thank the sponsors, Hopper and Cloudant and Microsoft for offering us the space!

Then next Hack/Reduce will be organized right after summer, stay tuned!




  1. Trackback: Homepage
  2. Trackback: Where to Get Payday Loans Online
  3. Trackback: Miscelatori verticali
  4. Trackback: Dau
  5. Trackback: Lawyer Divorce Arizona
  6. Trackback: Vemma Reviews
  7. Trackback: John Walmer
  8. Trackback: vancouver bc injury attorney
  9. Trackback: Six flags tickets
  10. Trackback: facebook hack
  11. Trackback: gta
  12. Trackback: portraiture
  13. Trackback: 15m hdmi cable
  14. Trackback: Virtual Tours
  15. Trackback: check this out
  16. Trackback: flachen bauch
  17. Trackback: Calgary SEO Consultant
  18. Trackback: muscle factor x reviews
  19. Trackback: web agency torino
  20. Trackback: find out more
  21. Trackback: Consigli su progettazione elettronica nella zona di Cuneo
  22. Trackback: dc movers
  23. Trackback: start your own web business
  24. Trackback: Peak Life Capsules
  25. Trackback: VitaCore Joint Capsules
  26. Trackback: click here
  27. Trackback: technology blog online
  28. Trackback:
  29. Trackback: playpokemon-games
  30. Trackback:
  31. Trackback: divorce mediation attorneys
  32. Trackback: MLG
  33. Trackback: Simple Sixpack Test
  34. Trackback:
  35. Trackback: calgary Home for sale
  36. Trackback: happywheelsdemo2
  37. Trackback: Calgary Website Design
  38. Trackback: Calgary Web Design
  39. Trackback: Calgary Web Design
  40. Trackback: Calgary Plumbing reviews
  41. Trackback: ceavbjhnmkjnhbgefsdf
  42. Trackback: go right here Tempe DUI lawyers
  43. Trackback: csmrynkmgfgktjglsrtmsgxn
  44. Trackback: order garcinia cambogia max
  45. Trackback:
  46. Trackback: testosterone supplements
  47. Trackback: visit homepage Phoenix DUI lawyers
  48. Trackback: cnrgjhgbvdbhfjngmnhbsgv
  49. Trackback: cara merawat wajah
  50. Trackback: lening simulatie
  51. Trackback: csngrdngthnfgdsfgnsfsd
  52. Trackback: cars
  53. Trackback: pure garcinia cambogia scam
  54. Trackback: garcinia cambogia dr oz
  55. Trackback: Anonymous
  56. Trackback: finanzas
  57. Trackback: what is hosting
  58. Trackback: paintless dent removal training
  59. Trackback: Janitorial Supplies
  60. Trackback: seguros en Mexico
  61. Trackback: pdr-training
  62. Trackback: Oceanfront Suites Floorplan
  63. Trackback: freehold property in singapore
  64. Trackback: supplement to gain muscle
  65. Trackback: paintless dent repair training by
  66. Trackback: cnxdhffnefmnnrarcbfaxxrnf
  67. Trackback: travel club
  68. Trackback: Atlanta home builders
  69. Trackback: home remodeling Atlanta
  70. Trackback: handyman atlanta
  71. Trackback: handyman Marietta Ga
  72. Trackback: commonwealth tower
  73. Trackback: ถ่ายรูป
  74. Trackback:
  75. Trackback: gobiernos de puerto rico
  76. Trackback: dr oz 2 weeks diet
  77. Trackback: ecvxdgnjgvjhmknmbknhbfxb
  78. Trackback: dr oz 2 week diet plan
  79. Trackback: how can i get free coins for slotomania
  80. Trackback: chemicals in vapor cigarettes
  81. Trackback: believing these seven myths about happy wheels keeps you from growing
  82. Trackback: what he wants
  83. Trackback: vue 8 residence
  84. Trackback: new commercial
  85. Trackback:
  86. Trackback: deer antler extract
  87. Trackback: check website
  88. Trackback: melt your mans heart review
  89. Trackback: dr oz garcinia cambogia side effects
  90. Trackback: get ex girlfriend
  91. Trackback: doctor oz garcinia cambogia
  92. Trackback: dr oz weight loss garcinia cambogia
  93. Trackback: garcinia cambogia max free trial
  94. Trackback: testosterone boosters review
  95. Trackback:
  96. Trackback: does deer antler spray really work
  97. Trackback: insanity workout guide
  98. Trackback: garcinia cambogia extract and green coffee bean
  99. Trackback: acceptress beetlestone adglutinate
  100. Trackback: visit this site
  101. Trackback: trawnik
  102. Trackback: best hotels mombasa
  103. Trackback: recommended site
  104. Trackback: walk appendalgia binormal
  105. Trackback: dr oz
  106. Trackback: dr oz show
  107. Trackback: personal loans for veterans