Big Data Montreal #35 – Tuesday April 7th

Tickets: bdm35.eventbrite.ca

Big Data Montreal would like to invite you to its thirty-fifth meeting!

Join us on Tuesday April 7th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have 2 flash presentations and 1 full presentation scheduled:

  • Flash presentation on Hive Transform vs Hadoop Streaming by Francis Pieraut
  • Flash presentation on Mediative’s Experience with Databricks Cloud: an Initial Report by Xavier Clements
    Databricks Cloud (DBCloud) promises to be “a single platform for big data processing“ and to make its users’ lives easier in 4 aspects: data visualization, distributed computation, third-party integration and production pipeline creation. At Mediative we have worked with this product for the last 3 months, and in this talk we will report on our experiences thus far. We have found that (1) the cluster management capabilities offered by DBCloud are very useful, (2) data exploration is fast and intuitive (thanks to a complete REPL that allows for short feedback loops and quick visualisations) and, (3) the Databricks support is very professional and timely. In addition to the above we have also found some limitations that undermine the power of the tool. We will present examples of our assertions as well as some avenues of inquiry that we will be exploring in the future.
  • Full presentation on Big Data Warehousing at Shopify by Harry Brundage
    While many companies have focused on powering products with their big data, Shopify has instead focused on building a well modelled data warehouse to power better decisions across it’s rather wild business first. This seems like a terribly uncool thing to do these days: it doesn’t involve much machine learning or advanced math, but it has been wildly successful at Shopify, and laid a fantastic foundation for data scientists to start doing the fancy stuff. Shopify has built a highly structured, clean, and useful warehouse using new ideas like Apache Spark and old ideas like dimensional modelling to achieve technical and organizational scalability. This talk will be about how we model our data, why we think it is the right thing to do, and the hard-won lessons learned in deploying Spark and Hadoop for such a task.

N.B.: This edition will be in English.

There will be food and drinks courtesy of Shopify, our sponsors for this event.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Billets: bdm35.eventbrite.ca

Big Data Montréal vous invite à sa trente cinquième édition!

Joignez-vous à nous le mardi 7 avril à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations courtes à l’horaire, ainsi qu’une présentation longue:

  • Flash presentation on Hive Transform vs Hadoop Streaming by Francis Pieraut
  • Flash presentation on Mediative’s Experience with Databricks Cloud: an Initial Report by Xavier Clements
    Databricks Cloud (DBCloud) promises to be “a single platform for big data processing“ and to make its users’ lives easier in 4 aspects: data visualization, distributed computation, third-party integration and production pipeline creation. At Mediative we have worked with this product for the last 3 months, and in this talk we will report on our experiences thus far. We have found that (1) the cluster management capabilities offered by DBCloud are very useful, (2) data exploration is fast and intuitive (thanks to a complete REPL that allows for short feedback loops and quick visualisations) and, (3) the Databricks support is very professional and timely. In addition to the above we have also found some limitations that undermine the power of the tool. We will present examples of our assertions as well as some avenues of inquiry that we will be exploring in the future.
  • Full presentation on Big Data Warehousing at Shopify by Harry Brundage
    While many companies have focused on powering products with their big data, Shopify has instead focused on building a well modelled data warehouse to power better decisions across it’s rather wild business first. This seems like a terribly uncool thing to do these days: it doesn’t involve much machine learning or advanced math, but it has been wildly successful at Shopify, and laid a fantastic foundation for data scientists to start doing the fancy stuff. Shopify has built a highly structured, clean, and useful warehouse using new ideas like Apache Spark and old ideas like dimensional modelling to achieve technical and organizational scalability. This talk will be about how we model our data, why we think it is the right thing to do, and the hard-won lessons learned in deploying Spark and Hadoop for such a task.

N.B.: Cette édition sera en anglais.

Nous servirons nourriture et breuvages, gracieuseté de Shopify, qui parraine cet événement.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

 

appaquet@gmail.com

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>