Hack/Reduce Toronto Presentations

All videos from the presentations: Hack/Reduce Vimeo

Hi everyone, this is coming a bit late now since Hack/Reduce Toronto was already over a week ago, but we were too busy setting up for Boston to write about the presentations we saw in Toronto… Anyways, we had an amazing time in Toronto. Thanks to all of the participants who were working hard the whole day! Seeing people working hard, learning and enjoying Hack/Reduce is what makes it all worth while for us!

The day started off with coffee, a short introduction to using the cluster and pitches by the participants. The Hopper team then gave a short tutorial on Hadoop and Map/Reduce.

People got to work really quickly and the amount of noise in the beginning when the teams were discussing their projects was mind-blowing. A lot of buzz. Soon after 1 o’clock all the teams hunkered down to start coding and there was an eerie silence again…

In the end, the teams really got a lot done. We saw some really amazing presentations. I’ll give some short descriptions of the pitches here. I’ve put up all of the presentations we had on our Vimeo channel.

There are some interesting things to learn from the videos, mostly about the technologies used and tested, so I suggest you check them out!

In the end, 10 teams out of 21 ended up presenting:

Team 1

Check out the presentation on Vimeo

Bartek Ciszkowski (@bartek), Ash Christopher (@ashchristopher)

Bartek and Ash analyzed search queries that had been made during the course of one day. They grouped search queries in four categories: travel, sex, nerd and cooking. They then analyzed how the popularity of these categories in searches varied during the day.

Here are two pictures of the results, grouped by 1 minute and 30 minutes:

1 minute:

30 minutes:

The source code is on github at https://github.com/ashchristopher/HackReduceToronto.

Team 2

Check out the presentation on Vimeo

Joel Crocker (@joelcrocker), Johan Harjono (@jharjono), Joey Robert (@joeyrobert), Ian Stevens (@istevens)

Joel, Johan, Joey and Ian used a 10 000 song subset of the million song dataset. They were using the Disco distributed computing framework with Python.

They analyzed:

  • The most romantic year by looking for the word love in song titles.
  • The variation of words in song titles (Only 100 words are used in song titles)
  • Average song tempo per year
  • Song lengths per year
  • Saddest tones (Turns out D is really sad)
  • Recording locations.

The source code can be found on github: Github/joyerobert/hackreduce

Team 3

Check out the presentation on Vimeo

Gar Liu (@lonelydatum), Nathan Rambarran (@wibblz), Khurram Virani (@viranik)

Gar, Nathan and Khurram first wanted to figure out if oil prices affected flight prices. They took oil company stock prices and the average price from all the flights in the dataset. However, the flight dataset was limited and the results ambiguous, so the team changed direction. Next, they wanted to calculate which stocks were the most volatile in the NYSE data. They created a scoring algorithm to determine which stocks are the most volatile.

Team 3 used Mandy, an easy library to use Hadoop with Ruby. They also tried out Wukong, and don’t recommend that. Mandy worked very well though.

Team 4

Check out the presentation on Vimeo

Seak Pek Chhan, Nick Ursa (@nickursa), Athir Nvaimi, Gabe Sawhney

Pek, Nick, Athir and Gabe took a month and a half of Toronto bixi data and wanted to see if bixi data is affected by the weather. The answer is yes. Pek also took the Perl code and turned it into Python for fun.

Team 5

Check out the presentation on Vimeo

Stefan Arentz (@satefan), Olivier Yiptong (@sayhello), David Chang, Mike Pettypiece (@mtpettyp)

Stefan, Olivier, David and Mike had no prior experience with Hadoop. They used Python with mrjob. They analyzed DNS data for various things:

Average number of nameservers (it’s 2.25, max is 6)
Number of domains with a specific number of characters. (11 is the most popular)
Domains for which there exists most numbers of permutations of the same domain (mostly used by spammers. Every permutation of Yahoo and Youtube for example exist)

The team noted that the configuration of number of mappers and reducers is very important to speed up the jobs.

Team 6

Check out the presentation on Vimeo

Jordan Christiansen (Kobo, @thebigjc)

Jordan analyzed the correlations of every single stock pair on NYSE. The data started at 0.5 gb and expanded to 250gb when the pairs and prices had bee created. A linear regression was then run for the dataset ending up with 4M pairs. Some interesting correlations were found and Jordan ended up with a huge list of correlated stocks.

You can find the code on github: github.com/thebigjc/hackreduce

Team 7

Check out the presentation on Vimeo

Cleaver Barnes (@cleaverbarnes), Max Brodie (@maxwellbrodie), Shanly Suepaul, Matt MacLean

Cleaver, Max, Shanly and Matt ran their last job while the pitches were already under way.

They analyzed the “connectedness” of various tech communities based on the twitter social graph. It was done by choosing a couple of influencers per community and a person was determined to be part of the community if he followed any of the influencers of that community. For example, John Resig was a community leader in jQuery. You can check out the results in the video.

Team 8

Check out the presentation on Vimeo

Yong Liang

Yong worked on finding the cheapest flight combinations. He found the cheapest chained flights from Seattle. The projects was limited because of the limitation of the dataset (only flights from Seattle.)

Team 9

Check out the presentation on Vimeo

Christophe Biocca, Akash Vaswani, Jake Nielsen, Drew Gross

Team 9 wanted to   Basically the team ended up workeing on parsing wikipedia and came to the conclusion that it’s painful.

In the end they just calculated which article has the most outbound links, but it was uncertain if it actually worked correctly. The result was some error correction page, for more details, check the video.

Team 10

Check out the presentation on Vimeo

Jamie Wong (@jlfwong), Snady Wu, Wien Leung, Maverick Lee, Christopher Wu, Christopher Cooper

Team number 10 had members that worked on a couple of different projects:

Jamie Wong analyzed what made people notable from specific years based on the year they were born and what they had become famous for.

Snady Wu and Christopher Cooper worked on indound links to articles but were halted by the wikipedia parsing issues.

Christopher Wu worked on figuring out how long before you should by your flight in order to get the cheapest flight.


Thanks a lot for the event everyone. We also want to thank the sponsors, Hopper, Amazon, Kobo, Mantella Venture Partners, Chango, Attachments.me and Startupnorth

333 thoughts on “Hack/Reduce Toronto Presentations”

  1. Pingback: garcinia cambogia
  2. Pingback: onlinefanshop.net
  3. Pingback: hcg-diet
  4. Pingback: Kids
  5. Pingback: link
  6. Pingback: hotels in durban
  7. Pingback: Soccer IQ
  8. Pingback: novedades
  9. Pingback: hotel kenya
  10. Pingback: garcinia cambogia
  11. Pingback: burn fat
  12. Pingback: best e cigarettes
  13. Pingback: wypozycjonowanie
  14. Pingback: vapor cigarette
  15. Pingback: vapor pro e cig
  16. Pingback: identity thieves
  17. Pingback: 3 credit reports
  18. Pingback: ryokans kyoto
  19. Pingback: dr oz garcinia
  20. Pingback: garcinia cambogia
  21. Pingback: bron pneumatyczna
  22. Pingback: cohen team
  23. Pingback: more
  24. Pingback: cohen team
  25. Pingback: Custom Signs
  26. Pingback: Austin Local SEO
  27. Pingback: informacion
  28. Pingback: best burglar alarm
  29. Pingback: my explanation
  30. Pingback: new launch condo
  31. Pingback: H AND R login info
  32. Pingback: irc4.me
  33. Pingback: seo-superior.com
  34. Pingback: pixelitas.com
  35. Pingback: beauty secrets
  36. Pingback: dentrepairhelp.com
  37. Pingback: My Site
  38. Pingback: articulo
  39. Pingback: dieta opinie
  40. Pingback: try this site
  41. Pingback: potencja opinie
  42. Pingback: sam waltonman
  43. Pingback: Trackback
  44. Pingback: vigrx plus canada
  45. Pingback: iherb coupon
  46. Pingback: iherb coupons
  47. Pingback: Follow This URL
  48. Pingback: iherb coupon
  49. Pingback: nose doctor nyc
  50. Pingback: Try These Guys
  51. Pingback: friv games online
  52. Pingback: buy cialis
  53. Pingback: cialis lilly prix
  54. Pingback: more
  55. Pingback: Fundraiser
  56. Pingback: geil contact in
  57. Pingback: Web Site
  58. Pingback: Start
  59. Pingback: plombier paris 6
  60. Pingback: plombier paris 2
  61. Pingback: Advogado familia
  62. Pingback: tenant screening
  63. Pingback: Health
  64. Pingback: lambo door hinges
  65. Pingback: Finance
  66. Pingback: Liberty
  67. Pingback: kolikkopelit
  68. Pingback: iherb coupon code
  69. Pingback: hairy anus
  70. Pingback: Trading
  71. Pingback: north
  72. Pingback: iherb coupon
  73. Pingback: iherb coupon
  74. Pingback: mark cobb
  75. Pingback: Business
  76. Pingback: download songs
  77. Pingback: iherb coupons
  78. Pingback: iherb coupons
  79. Pingback: iherb coupons
  80. Pingback: Camp
  81. Pingback: château
  82. Pingback: cypress ac repair
  83. Pingback: product number
  84. Pingback: Technology
  85. Pingback: Home
  86. Pingback: buy original art
  87. Pingback: Diindolylmethane
  88. Pingback: Fashion
  89. Pingback: led bulbs
  90. Pingback: Adelgaza20
  91. Pingback: pressure cooker
  92. Pingback: click here
  93. Pingback: cigar of the month
  94. Pingback: linked web-site
  95. Pingback: Financial
  96. Pingback: direct payday loan
  97. Pingback: brazzers
  98. Pingback: smoking
  99. Pingback: Movemynt Products
  100. Pingback: selcal
  101. Pingback: t shirt
  102. Pingback: Learn More
  103. Pingback: kunststoff firmen
  104. Pingback: LINK M88
  105. Pingback: check this out
  106. Pingback: payday loans
  107. Pingback: go right here
  108. Pingback: kangan water
  109. Pingback: safety inspectors
  110. Pingback: haters
  111. Pingback: cialise 20
  112. Pingback: laser liposuction
  113. Pingback: dallas
  114. Pingback: DVLA Number Plates
  115. Pingback: Make money online
  116. Pingback: Zwiastuny filmowe
  117. Pingback: Gabriel
  118. Pingback: Energy Booster
  119. Pingback: acné quístico
  120. Pingback: birmingham hotels
  121. Pingback: canola
  122. Pingback: start-ups
  123. Pingback: cleaning
  124. Pingback: animacje 3d
  125. Pingback: Akcesoria dachowe
  126. Pingback: przylacza gazowe
  127. Pingback: seo
  128. Pingback: laptop backpack
  129. Pingback: Lingerie
  130. Pingback: slots
  131. Pingback: M88
  132. Pingback: Sword Art Online
  133. Pingback: Wp Weekend Phoenix
  134. Pingback: stripper denver
  135. Pingback: Green tea
  136. Pingback: Jewellery
  137. Pingback: from this source
  138. Pingback: paintings
  139. Pingback: Number plates
  140. Pingback: Number Plates
  141. Pingback: best payday loans
  142. Pingback: flights boracay
  143. Pingback: men fashion wear
  144. Pingback: scripts
  145. Pingback: Dallas remodeling
  146. Pingback: jak dbac o cialo
  147. Pingback: wino wino wino
  148. Pingback: ksiegowosc kraków
  149. Pingback: artykuly biurowe
  150. Pingback: best shift knobs
  151. Pingback: gcsrdsmgkrnggkxrv
  152. Pingback: Pompy zatapialne

Comments are closed.