Big Data Montréal #51

Register here: https://www.meetup.com/fr-FR/Big-Data-Montreal/events/237551533

Big Data Montreal would like to invite you to its 51st meeting!

Join us on Monday February 13th 2017 at 18h00 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Crew Collective & Cafe 360 rue St. Jacques, Montréal, QC

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one presentation scheduled (and a few flash talks):

  • Learn firsthand how Cloudera Accelerates Customer Success by Enabling Production-Ready Big Data Analytics Across Multiple Cloud and Hybrid Cloud Environments by Doneyli De Jesus, Pre-Sales Engineer at Cloudera

    In significant numbers enterprise companies including Adecco, Airbnb, GoPro, Nielsen, Novantasand others – are running Cloudera Enterprise on public cloud infrastructure. Reasons for deploying in a hybrid, multi-cloud, or single cloud service often include the desire to do the following:
    Reduce the cost associated with purchasing, configuring, and maintaining on-premises hardware required to run big data applications. Increase the ability for data engineers and data analysts to respond to business problems through self-service provisioning. Meet strategic objectives to “move to the cloud” to reduce a company’s owned data center footprint

Unsplash is generously sponsoring this meetup and providing pizza and drinks

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear in a closeby location.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici:

https://www.meetup.com/fr-FR/Big-Data-Montreal/events/237551533

Big Data Montréal vous invite à sa 51ieme rencontre!

Joignez-vous à nous le lundi 13 février 2017 à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Crew Collective & Cafe 360 rue St. Jacques, Montréal, QC

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation complète à l’horaire (et quelques flash-talks):

  • Learn firsthand how Cloudera Accelerates Customer Success by Enabling Production-Ready Big Data Analytics Across Multiple Cloud and Hybrid Cloud Environments by Doneyli De Jesus, Pre-Sales Engineer at Cloudera

    In significant numbers enterprise companies including Adecco, Airbnb, GoPro, Nielsen, Novantasand others – are running Cloudera Enterprise on public cloud infrastructure. Reasons for deploying in a hybrid, multi-cloud, or single cloud service often include the desire to do the following:
    Reduce the cost associated with purchasing, configuring, and maintaining on-premises hardware required to run big data applications. Increase the ability for data engineers and data analysts to respond to business problems through self-service provisioning. Meet strategic objectives to “move to the cloud” to reduce a company’s owned data center footprint

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #43 – Tuesday December 1st

Register here: http://www.meetup.com/fr/Big-Data-Montreal/events/227066630/

Big Data Montreal would like to invite you to its 43rd meeting!

Join us on Tuesday December 3rd at 18h30 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two main presentation scheduled:

  • Hadoop Streaming Architecture in Action to Power Decision Making in Real-time by Dmitry Baev, Manager Solutions Engineering, Canada, Hortonworks Inc.
    A Demonstration of leveraging the power of a common Hadoop Data Lake to power realtime decision making. In this presentation Dmitry will walk through a Streaming Architecture built on Hortonworks Data Platform (HDP) based on a Lambda architecture  — HBase, Kafka, and Storm — to ingest realtime streaming events into the Data Lake thus allowing through YARN for access to the data both Interactively — SQL Queries through Hive on Tez — and realtime analytics through Spark Machine Learning.
  • Get the best from your cluster using YARN by Jean-Louis Quéguiner Senior Big Data Developer Wajam
    Having a big cluster in nice, but are you sure you are using it as efficiently as you can ? Yarn is a map reduce manager, that we help you to schedule, manage the memory and CPU allocated to your Map Reduce jobs. However in a distributed environment a cluster can be composed of different nodes that have different architecture (variable CPU, Memory or even Vendor). We will present how to tune up your cluster using YARN on a real life use case and how appropriate YARN tuning solved pipeline delays following major increase in data flow size.

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear at Brasseurs de Montreal.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici: http://www.meetup.com/fr/Big-Data-Montreal/events/227066630/

Big Data Montréal vous invite à sa 43e rencontre!

Joignez-vous à nous le mardi 1er décembre à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations complètes à l’horaire:

  • Hadoop Streaming Architecture in Action to Power Decision Making in Real-time by Dmitry Baev, Manager Solutions Engineering, Canada, Hortonworks Inc.
    A Demonstration of leveraging the power of a common Hadoop Data Lake to power realtime decision making. In this presentation Dmitry will walk through a Streaming Architecture built on Hortonworks Data Platform (HDP) based on a Lambda architecture  — HBase, Kafka, and Storm — to ingest realtime streaming events into the Data Lake thus allowing through YARN for access to the data both Interactively — SQL Queries through Hive on Tez — and realtime analytics through Spark Machine Learning.
  • Get the best from your cluster using YARN by Jean-Louis Quéguiner Senior Big Data Developer Wajam
    Having a big cluster in nice, but are you sure you are using it as efficiently as you can ? Yarn is a map reduce manager, that we help you to schedule, manage the memory and CPU allocated to your Map Reduce jobs. However in a distributed environment a cluster can be composed of different nodes that have different architecture (variable CPU, Memory or even Vendor). We will present how to tune up your cluster using YARN on a real life use case and how appropriate YARN tuning solved pipeline delays following major increase in data flow size.

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre aux aux Brasseurs de Montréal.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #41 – Tuesday October 6th

Register here: http://bdm41.eventbrite.ca

Big Data Montreal would like to invite you to its fortieth meeting!

Join us on Tuesday October 6th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two presentations scheduled so far:

  • Apache NiFi: Hortonworks DataFlowI (by Dmitry Baev, Manager Solutions Engineering for Canada). Massive data streams that originate from connected yet disparate sources including sensors, machines, geo-location devices, social feeds, web, clicks, server logs and more, are forming the Internet of Anything (IoAT). This new data paradigm is driving the need for trusted insights from data at the very edge to the data lake and in real-time with full fidelity and security.To derive value and real-time insights, data in motion from IoAT must be treated as dataflows—from source to destination—so that modern analytical applications can collect, conduct and curate the data in a secure, scalable and reliable manner. The Hortonworks DataFlow powered by Apache NiFi, a top-level open source project, enables customers to securely and easily collect, conduct and curate any type of data from any origin. Traditional data at rest as well as real time data in motion can now be blended to provide historical and perishable insights.
  • The Machine Learning Database (by Nicolas Kruchten from Daracratic). Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

N.B.: This edition will be in English.

 There will be food and drinks served, courtesy of Hortonworks!

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa quarante-et-unième édition!

Joignez-vous à nous le mardi 6 octobre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons deux présentations à l’horaire jusqu’à maintenant:

  • Apache NiFi: Hortonworks DataFlowI (by Dmitry Baev, Manager Solutions Engineering for Canada). Massive data streams that originate from connected yet disparate sources including sensors, machines, geo-location devices, social feeds, web, clicks, server logs and more, are forming the Internet of Anything (IoAT). This new data paradigm is driving the need for trusted insights from data at the very edge to the data lake and in real-time with full fidelity and security.To derive value and real-time insights, data in motion from IoAT must be treated as dataflows—from source to destination—so that modern analytical applications can collect, conduct and curate the data in a secure, scalable and reliable manner. The Hortonworks DataFlow powered by Apache NiFi, a top-level open source project, enables customers to securely and easily collect, conduct and curate any type of data from any origin. Traditional data at rest as well as real time data in motion can now be blended to provide historical and perishable insights.
  • The Machine Learning Database (by Nicolas Kruchten from Daracratic). Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

N.B.: Cette édition sera en anglais.

Il y aura de quoi manger et des boissons de services, gracieuseté de Hortonworks!

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #38 – Tuesday July 7th

Register here: http://bdm38.eventbrite.ca

Big Data Montreal would like to invite you to its thirty-height meeting!

Join us on Tuesday July 7th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one flash-talk and one presentation scheduled:

  • Data Lineage – Overview of Cloudera Navigator, the first fully integrated data management application for Apache Hadoop, by Joon Kim – Cloudera Systems Engineer and Jean-Marc Spaggiari, Cloudera Senior Solution Architect

    Designed to provide all the capabilities to secure, govern, classify and explore the large amount of diverse data that land in Hadoop, Navigator provides three fundamental components for data management:
    - Maintain full audit history of access and manipulation of data in order to meet compliance requirements.
    - Automatic metadata tagging and manual annotation tools.

    - Maintain data lineage as data sets progress in Hadoop providing reliability of data

  • Spark Summit – Olivier Mirandette (Engineering Manager at Guavus), Alain April (ETS teacher) and David Lauzon will do a summary of Spark Summit 2015 in this flash presentation.

    Performance optimization

    New performance interface
    Data frame
    Spark on the market

    IndexedRD

    RadixTree

  • HBaseCon 2015 – What’s new, what’s hot – Jean-Marc Spaggiari, HBase contributor (If time permit).
    Flash presentation to come back on HBaseCon 2015 and present what’s new and what’s hot in last HBase releases.

N.B.: This edition will be in English.

There will be food and drinks courtesy of Cloudera Inc. , our sponsors for this event.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Enregistrez-vous ici : http://bdm38.eventbrite.ca

Big Data Montréal vous invite à sa trente huitieme édition!

Joignez-vous à nous le mardi 7 juillet à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation-éclair et une présentation étoffée à l’horaire:

 

  • Data Lineage – Overview of Cloudera Navigator, the first fully integrated data management application for Apache Hadoop, by Joon Kim – Cloudera Systems Engineer and Jean-Marc Spaggiari, Cloudera Senior Solution Architect

    Designed to provide all the capabilities to secure, govern, classify and explore the large amount of diverse data that land in Hadoop, Navigator provides three fundamental components for data management:
    - Maintain full audit history of access and manipulation of data in order to meet compliance requirements.
    - Automatic metadata tagging and manual annotation tools.

    - Maintain data lineage as data sets progress in Hadoop providing reliability of data

  • Spark Summit – Olivier Mirandette (Engineering Manager at Guavus), Alain April (ETS teacher) and David Lauzon will do a summary of Spark Summit 2015 in this flash presentation.

    Performance optimization

    New performance interface
    Data frame
    Spark on the market

    IndexedRD

    RadixTree

  • HBaseCon 2015 – What’s new, what’s hot – Jean-Marc Spaggiari, HBase contributor (If time permit).
    Flash presentation to come back on HBaseCon 2015 and present what’s new and what’s hot in last HBase releases.

N.B.: Cette édition sera en anglais.

Nous servirons nourriture et breuvages, gracieuseté de Cloudera Inc. , qui parraine cet événement.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #37 – Tuesday June 2nd

Big Data Montreal would like to invite you to its thirty-sixth meeting!

Join us on Tuesday June 2nd at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one flash-talk and one presentation scheduled:

  • Hadoop in production – the war stories by Nikolaï Grigoriev, Principal Software Engineer, SociableLabs
    Sharing of Hadoop cluster deployment experience in production from scratch on real hardware. Brief overview of Hadoop stack, its components, major deployment and configuration challenges, performance tuning and application tuning experience. Some “war stories” about the issues we have faced while operating, the benefits of DevOps approach for running Hadoop apps. (SLIDES)
  • Scaling an API proxy in OCaml by Simon Grondin, ÉTS undergrad student, backend engineer at Mashape, creator of http://unblock.us.org and core developer of DNSChain.
    How I scaled up (not out!) a fault tolerant, high performance proxy built in OCaml. How the language helped, what worked, what didn’t and the lessons learned

N.B.: This edition will be in English.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa trente sixième édition!

Joignez-vous à nous le mardi 2 juin à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation-éclair et une présentation étoffée à l’horaire:

  • Hadoop in production – the war stories by Nikolaï Grigoriev, Principal Software Engineer, SociableLabs
    Sharing of Hadoop cluster deployment experience in production from scratch on real hardware. Brief overview of Hadoop stack, its components, major deployment and configuration challenges, performance tuning and application tuning experience. Some “war stories” about the issues we have faced while operating, the benefits of DevOps approach for running Hadoop apps. (SLIDES)
  • Scaling an API proxy in OCaml by Simon Grondin, ÉTS undergrad student, backend engineer at Mashape, creator of http://unblock.us.org and core developer of DNSChain.
    How I scaled up (not out!) a fault tolerant, high performance proxy built in OCaml. How the language helped, what worked, what didn’t and the lessons learned

N.B.: Cette édition sera en anglais.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !