Big Data Montreal #48

Inscrivez-vous ici / Sign-up here: https://www.meetup.com/Big-Data-Montreal/events/233197738/

Big Data Montreal would like to invite you to its 48th meeting!

Join us on Tuesday August 16th 2016 at 6:00PM to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have 1 presentation scheduled:

• Now that I’ve crunched some Big Data, how can I serve it online? 
By Felix GV from LinkedIn’s Data Infrastructure.

You’ve deployed Hadoop, some ETL pipeline, a job scheduler and crunched a bunch of data, what are you going to do with the fruit of your labor? If you intend to serve this data back to your online users, then you might need a derived data serving system such as Voldemort Read-Only. Voldemort is a key value store capable of bulk loading large data sets from Hadoop. It has been battle tested at LinkedIn for many years, and it currently ingests more than 100 TB of freshly computed data every day while serving requests at sub millisecond latency. In this talk, Felix will tell you about the recent improvements that went into Voldemort, as well as a preview of the future of derived data serving at LinkedIn.

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a beer in a close by location.

Please tell your friends and colleagues :) !

=====================================

Big Data Montréal vous invite à sa 48ième rencontre!

Joignez-vous à nous le mardi 16 août 2016 à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 1 présentation complète à l’horaire:

• Now that I’ve crunched some Big Data, how can I serve it online? 
By Felix GV from LinkedIn’s Data Infrastructure.

You’ve deployed Hadoop, some ETL pipeline, a job scheduler and crunched a bunch of data, what are you going to do with the fruit of your labor? If you intend to serve this data back to your online users, then you might need a derived data serving system such as Voldemort Read-Only. Voldemort is a key value store capable of bulk loading large data sets from Hadoop. It has been battle tested at LinkedIn for many years, and it currently ingests more than 100 TB of freshly computed data every day while serving requests at sub millisecond latency. In this talk, Felix will tell you about the recent improvements that went into Voldemort, as well as a preview of the future of derived data serving at LinkedIn.

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre proche.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #45 – Roaring bitmaps and HBase

Register here: http://www.meetup.com/Big-Data-Montreal/events/227426150/

Big Data Montreal would like to invite you to its 45th meeting!

Join us on Tuesday February 2nd 2016 at 18h30 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one presentation scheduled (and a few flash talks):

  • Achieving low latency for exact population count by Olivier Mirandette, Lead Software Architect at Guavus.In this presentation, we will present a scalable solution to solve a complex problem where functionality, latency, exactitude and volume of data are a challenge. The talk will include presentation of the problem, explanation of two failed proof of concept and the final solution (HBase, Coprocessor, Spark and RoaringBitMap). 

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear at Brasseurs de Montreal.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici: http://www.meetup.com/Big-Data-Montreal/events/227426150/

Big Data Montréal vous invite à sa 45e rencontre!

Joignez-vous à nous le mardi 2 février 2016 à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation complète à l’horaire (et quelques flash-talks):

  • Achieving low latency for exact population count by Olivier Mirandette, Lead Software Architect at Guavus.In this presentation, we will present a scalable solution to solve a complex problem where functionality, latency, exactitude and volume of data are a challenge. The talk will include presentation of the problem, explanation of two failed proof of concept and the final solution (HBase, Coprocessor, Spark and RoaringBitMap). 

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre aux aux Brasseurs de Montréal.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #43 – Tuesday December 1st

Register here: http://www.meetup.com/fr/Big-Data-Montreal/events/227066630/

Big Data Montreal would like to invite you to its 43rd meeting!

Join us on Tuesday December 3rd at 18h30 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two main presentation scheduled:

  • Hadoop Streaming Architecture in Action to Power Decision Making in Real-time by Dmitry Baev, Manager Solutions Engineering, Canada, Hortonworks Inc.
    A Demonstration of leveraging the power of a common Hadoop Data Lake to power realtime decision making. In this presentation Dmitry will walk through a Streaming Architecture built on Hortonworks Data Platform (HDP) based on a Lambda architecture  — HBase, Kafka, and Storm — to ingest realtime streaming events into the Data Lake thus allowing through YARN for access to the data both Interactively — SQL Queries through Hive on Tez — and realtime analytics through Spark Machine Learning.
  • Get the best from your cluster using YARN by Jean-Louis Quéguiner Senior Big Data Developer Wajam
    Having a big cluster in nice, but are you sure you are using it as efficiently as you can ? Yarn is a map reduce manager, that we help you to schedule, manage the memory and CPU allocated to your Map Reduce jobs. However in a distributed environment a cluster can be composed of different nodes that have different architecture (variable CPU, Memory or even Vendor). We will present how to tune up your cluster using YARN on a real life use case and how appropriate YARN tuning solved pipeline delays following major increase in data flow size.

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear at Brasseurs de Montreal.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici: http://www.meetup.com/fr/Big-Data-Montreal/events/227066630/

Big Data Montréal vous invite à sa 43e rencontre!

Joignez-vous à nous le mardi 1er décembre à 18h30 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations complètes à l’horaire:

  • Hadoop Streaming Architecture in Action to Power Decision Making in Real-time by Dmitry Baev, Manager Solutions Engineering, Canada, Hortonworks Inc.
    A Demonstration of leveraging the power of a common Hadoop Data Lake to power realtime decision making. In this presentation Dmitry will walk through a Streaming Architecture built on Hortonworks Data Platform (HDP) based on a Lambda architecture  — HBase, Kafka, and Storm — to ingest realtime streaming events into the Data Lake thus allowing through YARN for access to the data both Interactively — SQL Queries through Hive on Tez — and realtime analytics through Spark Machine Learning.
  • Get the best from your cluster using YARN by Jean-Louis Quéguiner Senior Big Data Developer Wajam
    Having a big cluster in nice, but are you sure you are using it as efficiently as you can ? Yarn is a map reduce manager, that we help you to schedule, manage the memory and CPU allocated to your Map Reduce jobs. However in a distributed environment a cluster can be composed of different nodes that have different architecture (variable CPU, Memory or even Vendor). We will present how to tune up your cluster using YARN on a real life use case and how appropriate YARN tuning solved pipeline delays following major increase in data flow size.

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre aux aux Brasseurs de Montréal.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #42 – Tuesday November 3rd

Register here: http://bdm42.eventbrite.ca

Big Data Montreal would like to invite you to its 42rd meeting!

Join us on Tuesday November 3rd at 18h00 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two flash presentations and one main presentation scheduled:

  • Got BigData? Watch your server clock! by Nikolai Grigoriev, Principal Software Engineer at SocialLabs
    Everyone understands NTP and how simply is to run it. Yet, many people do not understand the importance of having perfectly synchronized server time in distributed systems. Misaligned clocks on the servers may result in some funny behavior or can cause a disaster in some use cases.
  • FinTech MTL 2015 big-data highlights
    Finance field as 2 choices: self-disruption or be disrupted. Highlights about why and opportunities/chanllenge for the big-data community. We will conclude about the projected impact of IOT and market opportunity.
  • The Machine Learning Database by Nicolas Kruchten from Daracratic
    Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations, followed by a bear at Brasseurs de Montreal.

Please tell your friends and colleagues :) !

 


 

Réservez vos billets ici: http://bdm42.eventbrite.ca

Big Data Montréal vous invite à sa 42e rencontre!

Joignez-vous à nous le mardi 3 novembre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons 2 présentations rapides et une présentation complète à l’horaire:

  • Got BigData? Watch your server clock! by Nikolai Grigoriev, Principal Software Engineer at SocialLabs
    Everyone understands NTP and how simply is to run it. Yet, many people do not understand the importance of having perfectly synchronized server time in distributed systems. Misaligned clocks on the servers may result in some funny behavior or can cause a disaster in some use cases.
  • FinTech MTL 2015 big-data highlights
    Finance field as 2 choices: self-disruption or be disrupted. Highlights about why and opportunities/chanllenge for the big-data community. We will conclude about the projected impact of IOT and market opportunity.
  • The Machine Learning Database by Nicolas Kruchten from Daracratic
    Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter, ensuite de quoi nous pourrons aller prendre un verre aux aux Brasseurs de Montréal.

Passez le mot et venez en grand nombre :) !

Big Data Montréal #41 – Tuesday October 6th

Register here: http://bdm41.eventbrite.ca

Big Data Montreal would like to invite you to its fortieth meeting!

Join us on Tuesday October 6th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two presentations scheduled so far:

  • Apache NiFi: Hortonworks DataFlowI (by Dmitry Baev, Manager Solutions Engineering for Canada). Massive data streams that originate from connected yet disparate sources including sensors, machines, geo-location devices, social feeds, web, clicks, server logs and more, are forming the Internet of Anything (IoAT). This new data paradigm is driving the need for trusted insights from data at the very edge to the data lake and in real-time with full fidelity and security.To derive value and real-time insights, data in motion from IoAT must be treated as dataflows—from source to destination—so that modern analytical applications can collect, conduct and curate the data in a secure, scalable and reliable manner. The Hortonworks DataFlow powered by Apache NiFi, a top-level open source project, enables customers to securely and easily collect, conduct and curate any type of data from any origin. Traditional data at rest as well as real time data in motion can now be blended to provide historical and perishable insights.
  • The Machine Learning Database (by Nicolas Kruchten from Daracratic). Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

N.B.: This edition will be in English.

 There will be food and drinks served, courtesy of Hortonworks!

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa quarante-et-unième édition!

Joignez-vous à nous le mardi 6 octobre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons deux présentations à l’horaire jusqu’à maintenant:

  • Apache NiFi: Hortonworks DataFlowI (by Dmitry Baev, Manager Solutions Engineering for Canada). Massive data streams that originate from connected yet disparate sources including sensors, machines, geo-location devices, social feeds, web, clicks, server logs and more, are forming the Internet of Anything (IoAT). This new data paradigm is driving the need for trusted insights from data at the very edge to the data lake and in real-time with full fidelity and security.To derive value and real-time insights, data in motion from IoAT must be treated as dataflows—from source to destination—so that modern analytical applications can collect, conduct and curate the data in a secure, scalable and reliable manner. The Hortonworks DataFlow powered by Apache NiFi, a top-level open source project, enables customers to securely and easily collect, conduct and curate any type of data from any origin. Traditional data at rest as well as real time data in motion can now be blended to provide historical and perishable insights.
  • The Machine Learning Database (by Nicolas Kruchten from Daracratic). Nicolas Kruchten will present Datacratic’s new built-in-Montreal Big Data product: the Machine Learning Database (MLDB). In this presentation, we will see how MLDB running on a single node can be used to operationalize machine learning over datasets for which a distributed system would normally be the go-to solution.

N.B.: Cette édition sera en anglais.

Il y aura de quoi manger et des boissons de services, gracieuseté de Hortonworks!

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #40 – Tuesday September 1st

Register here: http://bdm40.eventbrite.ca

Big Data Montreal would like to invite you to its fortieth meeting!

Join us on Tuesday September 1st at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two presentations scheduled so far:

  • How Autodesk Implemented Scalable Eventing Over Mesos by Olivier Paugam, Software Architect at Autodesk (SLIDES)
    The world of Cloud engineering has been rapidly transformed last year with the advent of container technologies. The impact rippled through pretty much any facet of how we deal with complex distributed back-ends while scalability reached new unforeseen peaks. Coming up with a watertight architecture and new software tools is paramount to harnessing what is a huge potential for Autodesk. We will focus on the latest clustering technologies, see how they intersect with containers and finally propose a whole new stack. Cheaper, Bigger and Easier will be the motto of this talk.
  • Apache Phoenix: Bringing SQL and secondary indexes to HBase by Jean-Marc Spaggiari, Senior Solution Architect at Cloudera
    HBase is a well known very efficient and scalable NoSQL Database. However it doesn’t talk SQL and only have a primary index on the Key. Apache Phoenix brings to HBase what is missing. From types to secondary indexes through SQL queries, Phoenix allows you to get data from HBase at the speed of the light with the flexibly of the SQL language. In this presentation, we will go over all the main Phoenix features and we will do a small demonstration of the application.

N.B.: This edition will be in English.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa quarantième édition!

Joignez-vous à nous le mardi 1er septembre à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons deux présentations à l’horaire jusqu’à maintenant:

  • How Autodesk Implemented Scalable Eventing Over Mesos by Olivier Paugam, Software Architect at Autodesk (SLIDES)
    The world of Cloud engineering has been rapidly transformed last year with the advent of container technologies. The impact rippled through pretty much any facet of how we deal with complex distributed back-ends while scalability reached new unforeseen peaks. Coming up with a watertight architecture and new software tools is paramount to harnessing what is a huge potential for Autodesk. We will focus on the latest clustering technologies, see how they intersect with containers and finally propose a whole new stack. Cheaper, Bigger and Easier will be the motto of this talk.
  • Apache Phoenix: Bringing SQL and secondary indexes to HBase by Jean-Marc Spaggiari, Senior Solution Architect at Cloudera
    HBase is a well known very efficient and scalable NoSQL Database. However it doesn’t talk SQL and only have a primary index on the Key. Apache Phoenix brings to HBase what is missing. From types to secondary indexes through SQL queries, Phoenix allows you to get data from HBase at the speed of the light with the flexibly of the SQL language. In this presentation, we will go over all the main Phoenix features and we will do a small demonstration of the application.

N.B.: Cette édition sera en anglais.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #39 – Wednesday August 5th

Register here: http://bdm39.eventbrite.ca

Big Data Montreal would like to invite you to its thirthy-ninth meeting!

Join us on Wednesday August 5th at 18h00 to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Cloud.ca Center (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have two scheduled presentations:

  • HP Vertica BI: Sub-second big data analytics your users and developers can truly appreciate (by Mina Naguib, Technical Director of Platform Engineering at AdGear). Despite how fantastic pigs look with lipstick on and how magical elephants look with wings attached, there remains a large gap between what popular big data stacks offer and what end users demand in terms of reporting agility and speed. Join us to learn how Montreal-based AdGear, an advertising technology company, faced challenges as its data volume increased.  You will hear how AdGear’s data stack evolved to meet these challenges, and how HP Vertica’s architecture and features changed the game. This talk will be in English.
  • How MindGeek’s Ad Network uses Big Data Technologies to push Billions of Impressions per Day (by Olivier H. Beauchesne, Lead Data Scientist at MindGeek). MindGeek’s Ad Network pushes Ads to some of the biggest sites on the Internet. This produces a massive amount of data that must be analysed in real time to adjust bidding patterns, detect fraud, debug issues, and bill customers. To do so, we use standard Open Source technologies such as Kafka, Samza, Hive, etc. In this talk, we’ll present our technical architecture and how it’s being used by our different teams (Data Science, Sales, Monetization and Fraud Detection) at MindGeek.
  • You? We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !

There will be food and drinks served, courtesy of HP!

Finally, you are also welcome to join us for some casual networking, in the same room, after the presentations. (For the regulars, we won’t be going to Brasseurs de Montreal).

Please tell your friends and colleagues :) !


Enregistrez-vous ici : http://bdm39.eventbrite.ca

Big Data Montréal vous invite à sa trente-neuvième rencontre!

Joignez-vous à nous le mercredi 5 août à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (anciennement le RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Il y a deux présentations à l’horaire:

  • HP Vertica BI: Sub-second big data analytics your users and developers can truly appreciate (by Mina Naguib, Technical Director of Platform Engineering at AdGear). Despite how fantastic pigs look with lipstick on and how magical elephants look with wings attached, there remains a large gap between what popular big data stacks offer and what end users demand in terms of reporting agility and speed. Join us to learn how Montreal-based AdGear, an advertising technology company, faced challenges as its data volume increased.  You will hear how AdGear’s data stack evolved to meet these challenges, and how HP Vertica’s architecture and features changed the game. This talk will be in English.
  • How MindGeek’s Ad Network uses Big Data Technologies to push Billions of Impressions per Day (by Olivier H. Beauchesne, Lead Data Scientist at MindGeek). MindGeek’s Ad Network pushes Ads to some of the biggest sites on the Internet. This produces a massive amount of data that must be analysed in real time to adjust bidding patterns, detect fraud, debug issues, and bill customers. To do so, we use standard Open Source technologies such as Kafka, Samza, Hive, etc. In this talk, we’ll present our technical architecture and how it’s being used by our different teams (Data Science, Sales, Monetization and Fraud Detection) at MindGeek.
  • Vous? Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Nous servirons nourriture et breuvages, gracieuseté de HP!

Finalement, vous êtes invités à vous joindre à nous après les présentations, dans la même salle, pour continuer à réseauter. (Pour les habitués, nous n’irons pas aux Brasseurs de Montréal).

Passez le mot et venez en grand nombre :) !

 

Big Data Montréal #38 – Tuesday July 7th

Register here: http://bdm38.eventbrite.ca

Big Data Montreal would like to invite you to its thirty-height meeting!

Join us on Tuesday July 7th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one flash-talk and one presentation scheduled:

  • Data Lineage – Overview of Cloudera Navigator, the first fully integrated data management application for Apache Hadoop, by Joon Kim – Cloudera Systems Engineer and Jean-Marc Spaggiari, Cloudera Senior Solution Architect

    Designed to provide all the capabilities to secure, govern, classify and explore the large amount of diverse data that land in Hadoop, Navigator provides three fundamental components for data management:
    - Maintain full audit history of access and manipulation of data in order to meet compliance requirements.
    - Automatic metadata tagging and manual annotation tools.

    - Maintain data lineage as data sets progress in Hadoop providing reliability of data

  • Spark Summit – Olivier Mirandette (Engineering Manager at Guavus), Alain April (ETS teacher) and David Lauzon will do a summary of Spark Summit 2015 in this flash presentation.

    Performance optimization

    New performance interface
    Data frame
    Spark on the market

    IndexedRD

    RadixTree

  • HBaseCon 2015 – What’s new, what’s hot – Jean-Marc Spaggiari, HBase contributor (If time permit).
    Flash presentation to come back on HBaseCon 2015 and present what’s new and what’s hot in last HBase releases.

N.B.: This edition will be in English.

There will be food and drinks courtesy of Cloudera Inc. , our sponsors for this event.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Enregistrez-vous ici : http://bdm38.eventbrite.ca

Big Data Montréal vous invite à sa trente huitieme édition!

Joignez-vous à nous le mardi 7 juillet à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation-éclair et une présentation étoffée à l’horaire:

 

  • Data Lineage – Overview of Cloudera Navigator, the first fully integrated data management application for Apache Hadoop, by Joon Kim – Cloudera Systems Engineer and Jean-Marc Spaggiari, Cloudera Senior Solution Architect

    Designed to provide all the capabilities to secure, govern, classify and explore the large amount of diverse data that land in Hadoop, Navigator provides three fundamental components for data management:
    - Maintain full audit history of access and manipulation of data in order to meet compliance requirements.
    - Automatic metadata tagging and manual annotation tools.

    - Maintain data lineage as data sets progress in Hadoop providing reliability of data

  • Spark Summit – Olivier Mirandette (Engineering Manager at Guavus), Alain April (ETS teacher) and David Lauzon will do a summary of Spark Summit 2015 in this flash presentation.

    Performance optimization

    New performance interface
    Data frame
    Spark on the market

    IndexedRD

    RadixTree

  • HBaseCon 2015 – What’s new, what’s hot – Jean-Marc Spaggiari, HBase contributor (If time permit).
    Flash presentation to come back on HBaseCon 2015 and present what’s new and what’s hot in last HBase releases.

N.B.: Cette édition sera en anglais.

Nous servirons nourriture et breuvages, gracieuseté de Cloudera Inc. , qui parraine cet événement.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #37 – Tuesday June 2nd

Big Data Montreal would like to invite you to its thirty-sixth meeting!

Join us on Tuesday June 2nd at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one flash-talk and one presentation scheduled:

  • Hadoop in production – the war stories by Nikolaï Grigoriev, Principal Software Engineer, SociableLabs
    Sharing of Hadoop cluster deployment experience in production from scratch on real hardware. Brief overview of Hadoop stack, its components, major deployment and configuration challenges, performance tuning and application tuning experience. Some “war stories” about the issues we have faced while operating, the benefits of DevOps approach for running Hadoop apps. (SLIDES)
  • Scaling an API proxy in OCaml by Simon Grondin, ÉTS undergrad student, backend engineer at Mashape, creator of http://unblock.us.org and core developer of DNSChain.
    How I scaled up (not out!) a fault tolerant, high performance proxy built in OCaml. How the language helped, what worked, what didn’t and the lessons learned

N.B.: This edition will be in English.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Big Data Montréal vous invite à sa trente sixième édition!

Joignez-vous à nous le mardi 2 juin à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation-éclair et une présentation étoffée à l’horaire:

  • Hadoop in production – the war stories by Nikolaï Grigoriev, Principal Software Engineer, SociableLabs
    Sharing of Hadoop cluster deployment experience in production from scratch on real hardware. Brief overview of Hadoop stack, its components, major deployment and configuration challenges, performance tuning and application tuning experience. Some “war stories” about the issues we have faced while operating, the benefits of DevOps approach for running Hadoop apps. (SLIDES)
  • Scaling an API proxy in OCaml by Simon Grondin, ÉTS undergrad student, backend engineer at Mashape, creator of http://unblock.us.org and core developer of DNSChain.
    How I scaled up (not out!) a fault tolerant, high performance proxy built in OCaml. How the language helped, what worked, what didn’t and the lessons learned

N.B.: Cette édition sera en anglais.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !

Big Data Montréal #36 – Tuesday May 5th

Register here: bdm36.eventbrite.ca

Big Data Montreal would like to invite you to its thirty-sixth meeting!

Join us on Tuesday May 5th at 6pm to attend a conference, as well as to network with other Big Data enthusiasts from Montreal!

The meeting will take place at the Centre Cloud.ca (formerly RPM Startup Centre), which is located at 420 Guy street.

All are welcome, no matter if you already have some experience with Big Data technologies or if you’re simply curious to learn more.

We have one presentation scheduled:

  • NoSQL, Hadoop, and Beyond the Hype by Craig Vitter
    Hadoop and NoSQL. They are similar only in that they are both classified as software and do not use SQL. It is often helpful when learning about new technology to juxtapose two things side by side to better understand the differences, the job they are trying to get done, and how they often are seen fitting together within the wild. This talk is an architect’s perspective of what these technologies solve, tools that make using them easier to live with, and suggestions for how to fit them together within your business.

N.B.: This edition will be in English.

There will be food and drinks courtesy of Basho, our sponsors for this event.

Finally, you are also welcome to join us at the nearby Brasseurs de Montreal, after the presentations, for some casual networking (please use the appropriate ticket so we know how many people to expect).

Please tell your friends and colleagues :) !

P.S.: We are (as always) looking for speakers for future editions of BDM, so if you’re interested in presenting (or if you know people who are), please don’t hesitate to write to us at bdm-admin@googlegroups.com :) !


Inscrivez-vous ici: bdm36.eventbrite.ca

Big Data Montréal vous invite à sa trente sixième édition!

Joignez-vous à nous le mardi 5 mai à 18h00 pour assister à une conférence, ainsi que pour réseauter avec les autres enthousiastes montréalais du Big Data!

La rencontre aura lieu au Centre Cloud.ca (formerly RPM Startup Centre), qui est situé au 420 rue Guy.

Tous sont bienvenus, que vous ayez déjà de l’expérience avec les technologies de Big Data ou que vous soyez simplement curieux d’en apprendre plus.

Nous avons une présentation à l’horaire:

  • NoSQL, Hadoop, and Beyond the Hype by Craig Vitter
    Hadoop and NoSQL. They are similar only in that they are both classified as software and do not use SQL. It is often helpful when learning about new technology to juxtapose two things side by side to better understand the differences, the job they are trying to get done, and how they often are seen fitting together within the wild. This talk is an architect’s perspective of what these technologies solve, tools that make using them easier to live with, and suggestions for how to fit them together within your business.

N.B.: Cette édition sera en anglais.

Nous servirons nourriture et breuvages, gracieuseté de Basho, qui parraine cet événement.

Finalement, vous êtes invités à vous joindre à nous aux Brasseurs de Montréal, où la soirée se poursuivra après les présentations, pour continuer à réseauter (svp, utiliser le billet approprié pour que nous ayons une idée du nombre de personnes)

Passez le mot et venez en grand nombre :) !

P.S.: Nous sommes (comme toujours) à la recherche de présentateurs pour les éditions futures de BDM, donc n’hésitez pas à nous écrire à bdm-admin@googlegroups.com si vous êtes intéressés à présenter, ou si vous connaissez des gens qui le sont :) !