Big Data Montréal #42 – Tuesday November 3rd

Register here: http://bdm42.eventbrite.ca

Big Data Montreal would like to invite you to its 42rd meeting!

Join us on Tuesday November 3rd at 18h00 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two flash presentations and one main presentation scheduled:

  • Got BigData? Watch your server clock! by Nikolai Grigoriev, Principal Software Engineer at SocialLabs
    Everyone understands NTP and how simply is to run it. Yet, many people do not understand the importance of having perfectly synchronized server time in distributed systems. Misaligned clocks on the servers may result in some funny behavior or can cause a disaster in some use cases.
  • FinTech MTL 2015 big-data highlights
    Finance field as 2 choices: self-disruption or be disrupted. Highlights about why and opportunities/chanllenge for the big-data community. We will conclude about the projected impact of IOT and market opportunity.
  • The Machine Learning Database by Nicolas Kruchten from Daracratic
    Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear at Brasseurs de Montreal.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici: http://bdm42.eventbrite.ca

Big Data Montréal vous invite à sa 42e rencontre!

Joignez-vous à nous le mardi 3 novembre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations rapides et une présentation complète à l’horaire:

  • Got BigData? Watch your server clock! by Nikolai Grigoriev, Principal Software Engineer at SocialLabs
    Everyone understands NTP and how simply is to run it. Yet, many people do not understand the importance of having perfectly synchronized server time in distributed systems. Misaligned clocks on the servers may result in some funny behavior or can cause a disaster in some use cases.
  • FinTech MTL 2015 big-data highlights
    Finance field as 2 choices: self-disruption or be disrupted. Highlights about why and opportunities/chanllenge for the big-data community. We will conclude about the projected impact of IOT and market opportunity.
  • The Machine Learning Database by Nicolas Kruchten from Daracratic
    Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre aux aux Brasseurs de Montréal.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #40 – Tuesday September 1st

Register here: http://bdm40.eventbrite.ca

Big Data Montreal would like to invite you to its fortieth meeting!

Join us on Tuesday September 1st at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two presentations scheduled so far:

  • How Autodesk Implemented Scalable Eventing Over Mesos by Olivier Paugam, Software Architect at Autodesk (SLIDES)
    The world of Cloud engineering has been rapidly transformed last year with the advent of container technologies. The impact rippled through pretty much any facet of how we deal with complex distributed back-ends while scalability reached new unforeseen peaks. Coming up with a watertight architecture and new software tools is paramount to harnessing what is a huge potential for Autodesk. We will focus on the latest clustering technologies, see how they intersect with containers and finally propose a whole new stack. Cheaper, Bigger and Easier will be the motto of this talk.
  • Apache Phoenix: Bringing SQL and secondary indexes to HBase by Jean-Marc Spaggiari, Senior Solution Architect at Cloudera
    HBase is a well known very efficient and scalable NoSQL Database. However it doesn’t talk SQL and only have a primary index on the Key. Apache Phoenix brings to HBase what is missing. From types to secondary indexes through SQL queries, Phoenix allows you to get data from HBase at the speed of the light with the flexibly of the SQL language. In this presentation, we will go over all the main Phoenix features and we will do a small demonstration of the application.

N.B.: This edition will be in English.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa quarantième édition!

Joignez-vous à nous le mardi 1er septembre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons deux présentations à l’horaire jusqu’à maintenant:

  • How Autodesk Implemented Scalable Eventing Over Mesos by Olivier Paugam, Software Architect at Autodesk (SLIDES)
    The world of Cloud engineering has been rapidly transformed last year with the advent of container technologies. The impact rippled through pretty much any facet of how we deal with complex distributed back-ends while scalability reached new unforeseen peaks. Coming up with a watertight architecture and new software tools is paramount to harnessing what is a huge potential for Autodesk. We will focus on the latest clustering technologies, see how they intersect with containers and finally propose a whole new stack. Cheaper, Bigger and Easier will be the motto of this talk.
  • Apache Phoenix: Bringing SQL and secondary indexes to HBase by Jean-Marc Spaggiari, Senior Solution Architect at Cloudera
    HBase is a well known very efficient and scalable NoSQL Database. However it doesn’t talk SQL and only have a primary index on the Key. Apache Phoenix brings to HBase what is missing. From types to secondary indexes through SQL queries, Phoenix allows you to get data from HBase at the speed of the light with the flexibly of the SQL language. In this presentation, we will go over all the main Phoenix features and we will do a small demonstration of the application.

N.B.: Cette édition sera en anglais.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montreal #35 – Tuesday April 7th

Tickets: bdm35.eventbrite.ca

Big Data Montreal would like to invite you to its thirty-fifth meeting!

Join us on Tuesday April 7th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have 2 flash presentations and 1 full presentation scheduled:

  • Flash presentation on Hive Transform vs Hadoop Streaming by Francis Pieraut
  • Flash presentation on Mediative’s Experience with Databricks Cloud: an Initial Report by Xavier Clements
    Databricks Cloud (DBCloud) promises to be “a single platform for big data processing“ and to make its users’ lives easier in 4 aspects: data visualization, distributed computation, third-party integration and production pipeline creation. At Mediative we have worked with this product for the last 3 months, and in this talk we will report on our experiences thus far. We have found that (1) the cluster management capabilities offered by DBCloud are very useful, (2) data exploration is fast and intuitive (thanks to a complete REPL that allows for short feedback loops and quick visualisations) and, (3) the Databricks support is very professional and timely. In addition to the above we have also found some limitations that undermine the power of the tool. We will present examples of our assertions as well as some avenues of inquiry that we will be exploring in the future.
  • Full presentation on Big Data Warehousing at Shopify by Harry Brundage
    While many companies have focused on powering products with their big data, Shopify has instead focused on building a well modelled data warehouse to power better decisions across it’s rather wild business first. This seems like a terribly uncool thing to do these days: it doesn’t involve much machine learning or advanced math, but it has been wildly successful at Shopify, and laid a fantastic foundation for data scientists to start doing the fancy stuff. Shopify has built a highly structured, clean, and useful warehouse using new ideas like Apache Spark and old ideas like dimensional modelling to achieve technical and organizational scalability. This talk will be about how we model our data, why we think it is the right thing to do, and the hard-won lessons learned in deploying Spark and Hadoop for such a task.

N.B.: This edition will be in English.

There will be food and drinks courtesy of Shopify, our sponsors for this event.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Billets: bdm35.eventbrite.ca

Big Data Montréal vous invite à sa trente cinquième édition!

Joignez-vous à nous le mardi 7 avril à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations courtes à l’horaire, ainsi qu’une présentation longue:

  • Flash presentation on Hive Transform vs Hadoop Streaming by Francis Pieraut
  • Flash presentation on Mediative’s Experience with Databricks Cloud: an Initial Report by Xavier Clements
    Databricks Cloud (DBCloud) promises to be “a single platform for big data processing“ and to make its users’ lives easier in 4 aspects: data visualization, distributed computation, third-party integration and production pipeline creation. At Mediative we have worked with this product for the last 3 months, and in this talk we will report on our experiences thus far. We have found that (1) the cluster management capabilities offered by DBCloud are very useful, (2) data exploration is fast and intuitive (thanks to a complete REPL that allows for short feedback loops and quick visualisations) and, (3) the Databricks support is very professional and timely. In addition to the above we have also found some limitations that undermine the power of the tool. We will present examples of our assertions as well as some avenues of inquiry that we will be exploring in the future.
  • Full presentation on Big Data Warehousing at Shopify by Harry Brundage
    While many companies have focused on powering products with their big data, Shopify has instead focused on building a well modelled data warehouse to power better decisions across it’s rather wild business first. This seems like a terribly uncool thing to do these days: it doesn’t involve much machine learning or advanced math, but it has been wildly successful at Shopify, and laid a fantastic foundation for data scientists to start doing the fancy stuff. Shopify has built a highly structured, clean, and useful warehouse using new ideas like Apache Spark and old ideas like dimensional modelling to achieve technical and organizational scalability. This talk will be about how we model our data, why we think it is the right thing to do, and the hard-won lessons learned in deploying Spark and Hadoop for such a task.

N.B.: Cette édition sera en anglais.

Nous servirons nourriture et breuvages, gracieuseté de Shopify, qui parraine cet événement.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

 

appaquet@gmail.com

Big Data Montréal #29 – Tuesday October 7th 6:30pm at the RPM Startup Centre

Tickets: bdm29.eventbrite.ca

Big Data Montreal would like to invite you to its twenty-eight meeting!

Join us on Tuesday October 7th at 6:30pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the RPM Startup Centre, which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one scheduled presentation:

  • ElasticSearch ELK stack by Colin Surprenant, Software Engineer at ElasticSearch
    By combining the massively popular Elasticsearch, Logstash and Kibana, the Elasticsearch company created an end-to-end stack that delivers actionable insights in real-time from almost any type of structured and unstructured data source. This talk will present the ELK stack and how it can be used to search and analyse data easier than before.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

 

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Billets: bdm29.eventbrite.ca

Big Data Montréal vous invite à sa vingt-huitième rencontre!

Joignez-vous à nous le mardi 7 octobre à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au RPM Startup Centre, qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation à l’horaire:

  • ElasticSearch ELK stack by Colin Surprenant, Software Engineer at ElasticSearch
    By combining the massively popular Elasticsearch, Logstash and Kibana, the Elasticsearch company created an end-to-end stack that delivers actionable insights in real-time from almost any type of structured and unstructured data source. This talk will present the ELK stack and how it can be used to search and analyse data easier than before.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

 

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montreal #25 – Tuesday June 3rd 18h30 at the RPM Startup Centre

Tickets: http://bdm25.eventbrite.ca


Big Data Montreal would like to invite you to its twenty-fifth meeting!

Join us on Tuesday June 3rd at 18h30 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the RPM Startup Centre, which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two scheduled presentations:

  • Spark Runtime Internal, by Nan Zhu, Contributor to the Apache Spark project. This talk introduces the Spark system design. It will start from the eco-system and the features of Spark which facilitates the data processing task. Then it will go through the process in which the Spark application is translated from the source code to the distributed tasks across the nodes in the Spark cluster. It will also cover the distributed memory abstraction in Spark at the end of the talk, which enables high-performance and the reliable data processing in Spark. (slides)
  • Migrating to the Berkeley Data Analytics Stack, by Olivier Mirandette, Lead Software Architect at Guavus. Olivier will present how Guavus is planning the migration of a data warehouse to the Berkely Data Analytics Stack (which includes Spark, Spark streaming and Shark).

 

N.B.: This edition of BDM will be in English.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking.

Please tell your friends and colleagues :) !

 

 

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa vingt-cinquième rencontre!

Joignez-vous à nous le mardi 3 juin à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au RPM Startup Centre, qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Il y a une présentation à l’horaire:

  • Spark Runtime Internal, by Nan Zhu, Contributor to the Apache Spark project. This talk introduces the Spark system design. It will start from the eco-system and the features of Spark which facilitates the data processing task. Then it will go through the process in which the Spark application is translated from the source code to the distributed tasks across the nodes in the Spark cluster. It will also cover the distributed memory abstraction in Spark at the end of the talk, which enables high-performance and the reliable data processing in Spark. (slides)
  • Migrating to the Berkeley Data Analytics Stack, by Olivier Mirandette, Lead Software Architect at Guavus. Olivier will present how Guavus is planning the migration of a data warehouse to the Berkely Data Analytics Stack (which includes Spark, Spark streaming and Shark).

 

N.B.: Cette édition de BDM sera en anglais.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter.

Passez le mot et venez en grand nombre :) !

 

 

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !