Online practical conference about ML, AI and Deep Learning applications

Machine Learning Prague

February 26 – 28 , 2021

We can't fix 2020, but a better ML Prague? That we can do!

ML Prague will run online to assure your attendance is 100% safe. This even allows us to bring you even more practical content! At the same time, this will be the most interactive ML Prague ever, including deeper discussions with our speakers after each talk, mastermind sessions, networking activities with peer-experts from the whole world, and a hackathon before the conference. Stay tuned for more information on what's coming!

Note: If you registered for ML Prague 2020, your ticket is still valid for our online conference on February 26-28, 2021. You'll find your gift below, under our conference program section.

1000 Attendees
3 Days
45 Speakers
10 Workshops
1 Hackathon

Phenomenal Speakers

Ashish Kapoor

Partner Research Manager, Microsoft

Ashish Kapoor leads the Aerial Robotics and Informatics group at Microsoft, Redmond. Currently, his research focuses on building intelligent and autonomous flying agents that are safe and enable applications that can positively influence our society. The research builds upon cutting edge research in machine intelligence, robotics and human-centered computation in order to enable an entire fleet of flying robots that range from micro-UAVs to commercial jetliners. Various applications scenarios include Weather Sensing, Monitoring for Precision Agriculture, Safe Cyber-Physical Systems etc. Ashish received his PhD from MIT Media Laboratory in 2006.

Hava Siegelmann

Professor and Lab director, University of Massachusetts Amherst

Dr. Siegelmann, a recognized expert in Complex Systems and Neural Networks, focuses on theoretical computational neuroscience, computation in and modeling of natural systems and their application to intelligent systems. Of particular research interest are intelligence vis-a-vis adaptive memory, advanced models of cognition, and evolving, intelligent interfaces for robotics and other intelligent systems. Her studies often involve multi-scale modeling and system level analysis of major disorders such as cancer. The creator of a new field of computer science, Super-Turing computation, Dr. Siegelmann is applying the theory to biological systems and exploring them in connection with a new generation of analog computer.

Haifeng Jin

Software engineer, Google

Haifeng is a member of the Keras team at Google and a PhD candidate in DATA Lab at Texas A&M University. His research interests are AutoML and deep learning. He is the creator and project lead of AutoKeras, which aims to make deep learning more accessible with AutoML techniques.

Tomas Mikolov

Senior Researcher, CIIRC CTU Prague

Tomas Mikolov has been a research scientist at Facebook AI Research since May 2014 where he lead the popular fastText project. He is joining CIIRC and the Prague ELLIS unit full-time from April 2020. Previously he has been a member of Google Brain team, where he developed and implemented efficient algorithms for computing distributed representations of words (word2vec project). He has obtained his PhD from Brno University of Technology (Czech Republic) for his work on recurrent neural network based language models (RNNLM project). His long term research goal is to develop intelligent machines capable of learning to communicate with people using natural language.

Vojta Jína

Privacy Enthusiast, Apple

Vojta is a privacy enthusiast. While at Google, he helped to create AngularJS to simplify web development and make testing easier. These days, he is on a quest to solve machine learning with user privacy in mind, building intelligent products at Apple.

Karthikeyan Natesan Ramamurthy

Research Staff Member, IBM Research AI

Karthikeyan Natesan Ramamurthy is a research staff member at IBM Research. His broad interests include understanding the geometry and topology of high-dimensional data and developing theory and methods for efficiently modeling the data. He has also been intrigued by the interplay between humans, machines, and data and the societal implications of machine learning. He holds a PhD in electrical engineering from Arizona State University.

Serg Masís

Machine Learning Engineer, Syngenta

Serg Masís has been at the confluence of the internet, application development, and analytics for the last two decades. Currently, he's a Climate and Agronomic Data Scientist at Syngenta, a leading agribusiness company with a mission to improve global food security. Before that role, he co-founded a search engine startup, incubated by Harvard Innovation Labs, that combined the power of cloud computing and machine learning with principles in decision-making science to expose users to new places and events efficiently. Whether it pertains to leisure activities, plant diseases, or customer lifetime value, Serg is passionate about providing the often-missing link between data and decision-making — and machine learning interpretation helps bridge this gap more robustly. His book titled "Interpretable Machine Learning with Python" is scheduled to be released in early 2021 by UK-based publisher Packt.

Or Herman-Saffar

Senior Data Scientist, Dell

Or Herman-Saffar is Senior Data Scientist at Dell. As part of her role, she designed various data science projects, from exploratory data analysis to application of machine learning models. Focus mainly on the following domains: feature engineering, time-series analysis, classification models. Or holds an MSc in biomedical engineering, where her research focused on breast cancer detection using breath signals and machine learning algorithms, and a BS in biomedical engineering specializing in signal processing.

Matthieu Cord

Principal scientist, Valeo

Matthieu Cord is a Full Professor at the Computer Science Laboratory (LIP6) of Sorbonne University, Paris, since 2006. He is also a part-time Principal scientist at the Valeo.ai research laboratory. He is a laureate of a chair of research and teaching in artificial intelligence from the national French government program on AI 2020 entitled VISA-DEEP: Towards visual reasoning in deep learning. He is an honorary member of the Institut Universitaire de France (junior 2009) and served from 2015 to 2018 as an AI expert at CNRS and French National Research Agency. His research expertise includes computer vision, machine learning, and artificial intelligence. He is the author of more than 150 international scientific publications on deep learning, computer vision, and multimodal vision and language understanding.

Uri Eliabayev

AI Consultant, Founder, Machine and Deep Learning Israel

Uri Eliabayev is a business consultant in the field of AI. Uri has worked with many consulting companies and organizations and helped them to choose and implement the best AI solution for their needs. Moreover, Uri has found the biggest AI community in Israel called “Machine and Deep Learning Israel”.

Kirill Maiantsev

Senior Data Scientist, Broadcom

Kirill Maiantsev is a Senior Data Scientist at Broadcom working on their AIOps solutions. Kirill and the AIOps machine learning team are focused on building intelligent automation systems that are self-healing with minimal human intervention.

Kirill completed PhD program in Mathematics and Computer Science from the Lomonosov Moscow State University. The focus of his study was on differential equations, dynamical systems, and optimal control.

During his professional career Kirill gathered much experience in machine learning and quantitative finance developing algorithmic trading strategies. Since 2019 Kirill is with Broadcom where he is primarily focused on the anomaly detection in time series data problems.

Nik Vostrosablin

Machine Learning Engineer, MSD IT

Nik Vostrosablin is the Python/Machine Learning Engineer at MSD Artificial Intelligence group. In this group he works mostly on projects related with computational and molecular biology.

Prior to joining MSD Nik was mostly working in academic science in different universities (Moscow State University, Palacky University in Olomouc, Denmark Technical University).

He holds a master degree in Physics with honors from Lomonosov Moscow State University and currently finalizing his PhD in quantum physics.

Jeremy Jonas

Senior Product Manager, McKinsey & Company

Jeremy Jonas oversees ‘KNOW’ Profiles and Expertise Search, the most-used product family at McKinsey & Company, with over 3 million internal profile views annually. These applications show professional profiles and help teams find appropriate colleagues for specific needs, much like an internal LinkedIn.

Working with the Firm’s Prague-based Data Science team, Jeremy oversees the development of innovative ML-driven approaches to enhancing Profiles. This includes suggesting topics of expertise to add to profiles, now being extended into recommending colleagues to the Firm’s many Practices for leadership roles.

He is also overseeing experimentation with feedback-focused chatbots, leading so far to 10x higher feedback rates than any approach previously used with the product family.

Felipe Vianna

Data Science Specialist, McKinsey & Company

Felipe is a Data Scientist engaging with McKinsey internal teams to develop Machine Learning components to their products. He is mainly involved in NLP and retrieval projects, including the development of models for Expert Profiling. Being an engineer, he also takes care of full production deployment and scalability of the models developed.

Filip Dousek

Senior Director of Augmented Analytics, Workday

Filip was the CEO at Stories.bi (Gartner Cool Vendor, acquired by Workday). Now he leads augmented analytics development at Workday. Previously an SAP Solution Architect, analytics pioneer and published author (Flock Without Birds).

Filip Plešinger

Artificial Intelligence and Medical Technologies, Institue of Scientific Instruments of the Czech Academy of Sciences

He received the M.Sc. degree (2003) and the Ph.D. degree (2008) at the Brno Univesity of Technology. He worked in a company Evektor (2006-2012); then he moved to the Institute of Scientific Instruments of the Czech Academy of Sciences in 2012, where he works until now. He received several international awards (Boston, USA, 2014; Nice, France, 2015; Rennés, France, 2017) for cardiology-related algorithms and software. From 2020 until now, he is the head of the scientific group "Artificial Intelligence and Medical Technologies" at Medical Signals department, Institute of Scientific Instruments of the CAS, v.v.i.

Tomas Pevny

Consulting Scientist, Avast

Tomas has received his PhD in 2008 in University of Binghamton, SUNY, USA, where he has pioneered the use of Machine Learning techniques in Steganography and Steganalysis, for which he was awarded by IEEE Signal Processing Society. After one year post-doc in Grenoble, France, he has returned to Artificial Intelligence Center at Czech Technical University, where he has extended his interests to machine learning problems in Cybersecurity. He was closely working with Cognitive Security startup acquired in 2013 by Cisco systems Inc. Since September 2019 he is with Avast and with Artificial Intelligence Center at CTU.

Radovan Parrák

Product owner & ModelOps, Credo

Rado is a seasoned data scientist with a background in quantitative finance. After graduating as a financial economist at Maastricht University, he worked as a number cruncher in data science and quantitative finance departments at banks across Europe. More than ten years later and with dozens of models under his belt, the sheer pain of productionalising them into model-driven applications turned him into a devotee of ModelOps - an emerging field focused on governance and lifecycle of model-driven applications. Rado currently heads the development of Credo Software's ModelOps platform 'YQ'.

Petr Schwarz

CTO and co-founder, Phonexia

Petr Schwarz, PhD, is the CTO and co-founder of Phonexia. He helped to build the well-known research group Speech@FIT at Brno University of Technology, Czech Republic, worked as a researcher at Oregon Graduate Institute in Portland, OR, USA, and founded Phonexia in 2006. He participated in the development of multiple speaker recognition and language identification systems evaluated by the United States National Institute of Standards and Technology. Petr was also a team member on several Johns Hopkins University summer research workshops in the field of human language processing, and he is the co-author of several open source software projects. He has worked on several European, USA, and Czech research projects, and is the author or co-author of dozens of impactful research articles.

Krzysztof Rojek

CTO, byteLAKE

Krzysztof is CTO at byteLAKE and associate professor at the Czestochowa University of Technology, Poland. He links byteLAKE’s business with the research and academic world. Krzysztof is a huge fan and a promoter of the ideas that can start their life in the research space and eventually land in the practical, real-life business applications. He gained his PhD+DSc degrees in Computer Science (Parallel Computing, GPGPU, self-adaptable codes, AI applications).

Adam Blažek

CEO and co-founder, Iterait

Adam is a CEO and co-founder of Iterait, a company delivering computer vision AI solutions. As a leader of a research team at IBM and Cognexa, he gained experience primarily in healthcare-oriented projects. Adam has been publishing articles in scientific journals since his university studies at Charles University, where he graduated in Artificial Intelligence & Theoretical Computer Science. He received multiple awards for his Diploma thesis by the faculty’s Dean or in IT SPY competition.

Silvestr Stanko

ML Analytics Team Lead, Qminers

Silvestr Stanko is a Machine Learning Analyst and Team Lead at Qminers, where he mostly focuses on time-series regression in the financial markets. Silvestr has previously worked for a large logistics company, where he led and completed multiple ML and analytical projects, with topics ranging from Natural Language Processing to Operations Research.
His research interests include Risk-averse Reinforcement Learning and Optimization.

Paweł Redzyński

Software Engineer, dvc.org

Electronics engineer by education, a software developer by profession, deep learning enthusiast by heart. After a few years of software development, Paweł switched to work in the field of data science. He spend one-year helping Warsaw-based startup (Sports Algorithmics and Gaming) with video analysis of football trainings. Now he is somewhere in between both fields, creating tools for machine learning practitioners at Iterative.ai (creators of dvc.org). When he is not working, can be found trekking.

Aleš Horák

Associate Professor, Informatics at Masaryk University

Aleš Horák is an Associate Professor of Informatics at Masaryk University, Brno, Czech Republic. His research concentrates on natural language processing, knowledge representation and reasoning, e-lexicography and corpus linguistics.

Adam Rambousek

Research Assistant, Faculty of Informatics at Masaryk University

Adam Rambousek is a Research Assistant at the Faculty of Informatics at Masaryk University, Brno. His main research topics include computational lexicography, corpus linguistics, ontologies, and semantic networks.

David Vrba

Data Scientist, Socialbakers

David works as a data scientist at Socialbakers. He is using Spark on daily basis for processing data on different scales from few GBs up to tens of TBs. He also does query optimizations and helps with productionalizing of various ETL pipelines. David enjoys preparing and lecturing Spark training and workshops and trained in Spark already several teams such as data engineers, analysts and researchers. David received his Ph.D. from Charles University in Prague in 2015.

Václav Pavlín

Architect/Principal Software Engineer, Red Hat

Vašek is now part of the Office of the CTO team at Red Hat working on enablement of AI/ML workloads on Kubernetes where he leads a project Open Data Hub. He has extensive experience with building, deploying and managing containerized applications on OpenShift/Kubernetes. He loves open source and openness as well as meeting new people and arguing about technologies.

Francesco Murdaca

Senior Software Engineer, Red Hat

Francesco is a Senior Data Scientist/Senior Software Engineer at Red Hat working in the AI Centre of Excellence and Office of the CTO. He works on Project Thoth, an open source project that develops tools that enhance day-to-day life of developers and data scientists using bots and machine learning. He is passionate about AI, space and technologies, all open source. He loves traveling and learning about new cultures.

Michal Pleva

Data Science Team Lead, Dataclair.ai, O2 Czech Republic

Michal is leading a team of data scientists delivering enhanced customer value through effective lifecycle management.

He has 5+ years of expertise in Telecommunications. His team utilizes various techniques of machine learning, mostly deep learning, to build prediction models of customers’ behavior employing event-based data. Currently is a Ph.D. candidate in the field of Economics connecting social network analysis with utility functions.

Petr Stanislav

Head of Engineering, Dataclair.ai, O2 Czech Republic

Petr‘s mission is to make the life of the data scientist a little bit easier. He is responsible for the development of the Data and Machine Learning platform in Dataclair.ai, O2 Czech Republic. He also leads the data and machine learning engineering team. Machine learning and data is his passion.

Until the end of the year 2019, he also served as a researcher for the Department of Cybernetics of the Faculty of Applied Science as was also a Teacher. There he worked for more than 8 years on research and development in artificial intelligence, speech technologies, natural language processing, and web technologies.

In June 2020, he successfully defended his Ph.D. in Artificial Intelligence.

Ivan Kasanicky

Data Scientist, SAS

Ivan is an experienced Data Scientist with 9 years of experience. He has obtained Ph.D. degree in Probability and Mathematical Statistics from Charles University. During his career, he has work on different projects for, e.g., utility, transportation and automotive companies. He is an author of many advanced analytical models, such as predictive model for high way parking lots occupancies or renewable energy forecasting model. Ivan has joined SAS with a mission to help its customers to uncover how modern analytical and AI solutions can speed up their business. He focuses on understanding SAS customer business needs, and on showcasing how these needs and problems can be addressed with SAS advanced solutions.

Jordan Bakerman

Sr. Analytical Training Consultant, SAS

Jordan Bakerman holds a Ph.D. in statistics from North Carolina State University. His dissertation centered on using social media to forecast real world events, such as civil unrest and influenza rates. As an intern at SAS, Jordan wrote the SAS Programming for R Users course for students to efficiently transition from the R to SAS using a cookbook style approach. As an employee, Jordan has developed courses demonstrating how to integrate open source software within SAS products. He is passionate about statistics, programming, and helping others become better statisticians.

Jo-fai (Joe) Chow

Data Science Evangelist, H2O

Jo-fai (or Joe) has multiple roles (data scientist / evangelist / community manager / customer success manager) at H2O.ai. He is best known as the H2O #360Selfie guy nowadays. On Twitter, he sounds like a die-hard MATLAB fanboy with the handle @matlabulous (because MATLAB was his favourite tool at Uni). Since joining H2O.ai in 2016, Joe has delivered H2O talks/workshops in 40+ cities around Europe, US, and Asia. He is the organizer of London Artificial Intelligence & Deep Learning meetup - one of the biggest data science communities in Europe with 9500+ members.

Kevin O'Brien

Data Scientist, Coillte

Kevin O'Brien is Coillte's Forestry Resource Modeller, based in their offices in Limerick. Kevin has been very active in the data science community over the past decade, and is now a director of Python Ireland, the Community lead for Forwards: The R Foundation taskforce on women and other under-represented groups, a European R User Meeting conference committee member, and Social media chair of JuliaCon. He was formerly a Mathematics and Statistics lecturer at the University of Limerick.

Avik Sengupta

VP Engineering, Julia Computing

Avik Sengupta is VP Engineering and head of Julia Computing's European headquarters in London. Avik is the head of product development and software engineering at Julia Computing, contributor to open source Julia and maintainer of several Julia packages. Avik is the author of Julia High Performance, co-founder of two artificial intelligence startups in the financial services sector and creator of large complex trading systems for the world's leading investment banks. Prior to Julia Computing, Avik was co-founder and CTO at AlgoCircle and at Itellix, director at Lab49 and head of algorithmic solutions at Decimal Point Analytics. Avik earned his MS in Computational Finance at Carnegie Mellon and MBA Finance at the Indian Institute of Management in Bangalore.

Jon McLoone

Director of Technical Communication & Strategy, Wolfram Research

Jon McLoone is central to driving the company's technical business strategy and leading the consulting solutions team. With over 25 years of experience working with Wolfram Technologies, Jon has helped in directing software development, system design, technical marketing, corporate policy, business strategies and much more. Jon gives regular keynote appearances and media interviews on topics such as the Future of AI, Enterprise Computation Strategies and Education Reform, across multiple fields including healthcare, fintech and data science. He holds a degree in mathematics from the University of Durham. Jon is also Co-founder and Director of Development for computerbasedmath.org, an organisation dedicated to fundamental reform of maths education and the introduction of computational thinking. The movement is now a worldwide force in re-engineering the STEM curriculum with early projects in Estonia, Sweden and Africa.

Marek "Marx" Grac

Founder, Phalanx

Marek is the founder of Phalanx, a startup focusing on applied research in NLP for Slavic languages. He is a lecturer at Masaryk University, having finished his PhD at the institution, researching cheap and fast data annotation.

Practical & Inspiring Program

Friday
Workshops

	Room 103	Room 106	Room 203	Room 205	Room 206
09:00 – 12:30	Agile Data Annotation Room 103 Marek "Marx" Grac, Phalanx Come join us for our workshop and get hands-on experience with data annotation. The main goal of data annotation in Machine Learning algorithms is to make the implicit explicit so that the learning process can be improved. Even though many people see data annotation as a mundane task the process of creating guidelines and processes can be very interesting. In this workshop you will test various data annotation techniques mainly application-driven and low-cost approaches. We will also focus on how to measure the quality of the resulting data as well as test various UX principles and see how much they impact the cost-efficiency. Finally when you get bored of doing the manual part of data annotation yourself we will go through the basic legal aspects of outsourcing it.	Automatic and Explainable Machine Learning with H2O Room 106 Jo-fai (Joe) Chow, H2O General Data Protection Regulation (GDPR) is now in place. Are you ready to explain your models? This is a hands-on tutorial for beginners. I will demonstrate the use of open-source H2O platform (https://www.h2o.ai/products/h2o/) with both Python and R for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O's AutoML. They will then be able to explain the model outcomes with various methods.	Machine Learning in Julia Room 203 Kevin O'Brien, Coillte Avik Sengupta, Julia Computing Julia is specifically designed from the start of its conception as a language for high-performance computation but at the same time highly interactive. To achieve this Julia is one of the few modern languages that relies in just-in-time (JIT) compilation via LLVM to make its code run as fast or faster than statically compiled C and fortran codes. Its modern language design has the following features: multiple dispatch Lisp-like macros dynamic types type inference built-in parallel/distributed computing lightweight threads and elegant high-level language constructs. Outline: Introduction to Julia The Julia Language Julia in Data Science Julia Interfacing with Python and R Machine Learning in Julia High-performance computing in Julia	Data Analysis in Big Data Environment with Apache Spark and Python Room 205 David Vrba, Socialbakers Peter Vasko, Socialbakers Jiri Harazim, Databricks Apache Spark became a standard for data processing and machine learning in big data environments and is popular especially for its high-level DataFrame API that allows working nicely with structured data in a very efficient way. In the first part of this workshop we will get familiar with the DataFrame API of Spark and see some challenges that you might face when processing large datasets. We will explore some advanced optimization techniques and see how to apply them to compose efficient analytical queries. In the second part of the workshop we will see how Spark can be used for machine learning and deep learning in particular. We will explore Deep Learning Pipelines - a library that integrates Spark with deep learning frameworks such as TensorFlow and Keras.	Programming the Pepper Robot Room 206 Aleš Horák, Informatics at Masaryk University Adam Rambousek, Faculty of Informatics at Masaryk University Zuzana Nevěřilová, Informatics at Masaryk University Marek Medved, Informatics at Masaryk University The social robot by Softbank Robotics denoted as Pepper will be introduced. The robot hardware capabilities as well as examples of natural human-machine interaction in English and Czech (which are being developed by the team at FI MU) will be presented in detail including a tutorial on your own programming for a virtual or a real Pepper robot. The 1.2-m-tall robot is designed for social interactions with people and it is equipped with an extensive API set to detect faces mood or age and to react to their values.
12:30 – 14:00	Lunch
14:00 – 17:30	Zero to AI: Workshop on the Wolfram Language Room 103 Jon McLoone, Wolfram Research Designed by Wolfram data science experts this workshop will provide an introduction to machine learning techniques illustrated with live dynamic examples using the Wolfram Language. The workshop will walk you step-by-step through the basics of machine learning methodologies and techniques and how to apply them using the Wolfram Language. Upon completion you will come away with enough practical knowledge to immediately use the Wolfram Language for your own machine learning tasks on text data or images including supervised classification and prediction unsupervised feature identification sequence prediction and computer vision.	Developing Autonomous Vehicles with High Fidelity Simulation Room 106 Ashish Kapoor, Microsoft High-fidelity simulations can provide a rich platform to develop autonomy by enabling the use of AI technologies such as deep learning computer vision reinforcement learning etc. We have developed AirSim which is a simulator for autonomous vehicles built on the Unreal Engine. It is open-source cross platform and supports hardware-in-loop simulation thus allowing rapid development and testing of the system. The simulation is developed as a plugin and can be simply be dropped into any Unreal environment. AirSim supports AI development capabilities by exposing APIs to enable data logging and controlling vehicles in a platform independent manner. We will give an overview of how to use AirSim for building realistic simulation environments and doing development for quadrotors that use popular flight controllers such as Pixhawk. It is developed as a plugin that can simply be dropped in to any Unreal environment you want. We will also showcase how the system can be used to incorporate machine learning components useful for building such autonomous systems.	Cloud-native AI on OpenShift Room 203 Václav Pavlín, Red Hat Francesco Murdaca, Red Hat Ever thought of doing a cloud-native AI work? What does that even mean? This workshop will introduce you to running AI related services like Spark Seldon or Jupyter on Kubernetes as part of a project Open Data Hub. You will learn how to move your AI workloads to the cluster and implement a basic data science workflow. As Jupyter notebooks have become the de facto standard in data science we will show you how to use them and adopt some of the best practices that we’ve developed over time.	How to Make Data-Driven Decisions: The Case for Contextual Multi-armed Bandits Room 205 Michal Pleva, Dataclair.ai, O2 Czech Republic Petr Stanislav, Dataclair.ai, O2 Czech Republic Supervised learning has done wonders but it’s fundamentally limited. A good prediction of customers' churn or the likelihood of new acquisition may not always help you to do what is best in a given situation. By attending our workshop you will get hands-on experience with algorithms for direct optimization of decision-making with uncertainty. We will be focusing on the special case of reinforcement learning known as Contextual multi-armed bandit problems. Those problems arise frequently in important industrial applications played a role in AlphaGo success and are very often adopted by industry leaders such as Google and Netflix. Decision making with uncertainty is a challenge so we will show you how to effectively balance between trying new things to find better solutions and repeating the behavior that works well. During the workshop you will have an opportunity to play with a linear algorithm to solve a simple problem as well as with more advanced solution involving a deep neural network to learn a latent representational feature space for a problem.	SAS Viya & Open Source Integration focus on Python Room 206 Ivan Kasanicky, SAS Jordan Bakerman, SAS In this course you will learn to use the Python API to take control of SAS Cloud Analytic Services (CAS) actions. You will also learn to upload data into the in-memory distributed environment analyze data and create predictive models in CAS using familiar Python functionality via the SWAT (SAS Wrapper for Analytics Transfer) package. You will then learn to download results to the client and use native Python syntax to compare models.

Room 103

Room 106

Room 203

Room 205

Room 206

09:00 – 12:30

Agile Data Annotation

Room 103

Marek "Marx" Grac, Phalanx

Come join us for our workshop and get hands-on experience with data annotation. The main goal of data annotation in Machine Learning algorithms is to make the implicit explicit so that the learning process can be improved. Even though many people see data annotation as a mundane task the process of creating guidelines and processes can be very interesting. In this workshop you will test various data annotation techniques mainly application-driven and low-cost approaches. We will also focus on how to measure the quality of the resulting data as well as test various UX principles and see how much they impact the cost-efficiency. Finally when you get bored of doing the manual part of data annotation yourself we will go through the basic legal aspects of outsourcing it.  

Automatic and Explainable Machine Learning with H2O

Room 106

Jo-fai (Joe) Chow, H2O

General Data Protection Regulation (GDPR) is now in place. Are you ready to explain your models? This is a hands-on tutorial for beginners. I will demonstrate the use of open-source H2O platform (https://www.h2o.ai/products/h2o/) with both Python and R for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O's AutoML. They will then be able to explain the model outcomes with various methods.

Machine Learning in Julia

Room 203

Kevin O'Brien, Coillte
Avik Sengupta, Julia Computing

Julia is specifically designed from the start of its conception as a language for high-performance computation but at the same time highly interactive. To achieve this Julia is one of the few modern languages that relies in just-in-time (JIT) compilation via LLVM to make its code run as fast or faster than statically compiled C and fortran codes. Its modern language design has the following features: multiple dispatch Lisp-like macros dynamic types type inference built-in parallel/distributed computing lightweight threads and elegant high-level language constructs. Outline: Introduction to Julia The Julia Language Julia in Data Science Julia Interfacing with Python and R Machine Learning in Julia High-performance computing in Julia  

Data Analysis in Big Data Environment with Apache Spark and Python

Room 205

David Vrba, Socialbakers
Peter Vasko, Socialbakers
Jiri Harazim, Databricks

Apache Spark became a standard for data processing and machine learning in big data environments and is popular especially for its high-level DataFrame API that allows working nicely with structured data in a very efficient way. In the first part of this workshop we will get familiar with the DataFrame API of Spark and see some challenges that you might face when processing large datasets. We will explore some advanced optimization techniques and see how to apply them to compose efficient analytical queries. In the second part of the workshop we will see how Spark can be used for machine learning and deep learning in particular. We will explore Deep Learning Pipelines - a library that integrates Spark with deep learning frameworks such as TensorFlow and Keras.

Programming the Pepper Robot

Room 206

Aleš Horák, Informatics at Masaryk University
Adam Rambousek, Faculty of Informatics at Masaryk University
Zuzana Nevěřilová, Informatics at Masaryk University
Marek Medved, Informatics at Masaryk University

The social robot by Softbank Robotics denoted as Pepper will be introduced. The robot hardware capabilities as well as examples of natural human-machine interaction in English and Czech (which are being developed by the team at FI MU) will be presented in detail including a tutorial on your own programming for a virtual or a real Pepper robot. The 1.2-m-tall robot is designed for social interactions with people and it is equipped with an extensive API set to detect faces mood or age and to react to their values.

12:30 – 14:00

Lunch

14:00 – 17:30

Zero to AI: Workshop on the Wolfram Language

Room 103

Jon McLoone, Wolfram Research

Designed by Wolfram data science experts this workshop will provide an introduction to machine learning techniques illustrated with live dynamic examples using the Wolfram Language. The workshop will walk you step-by-step through the basics of machine learning methodologies and techniques and how to apply them using the Wolfram Language. Upon completion you will come away with enough practical knowledge to immediately use the Wolfram Language for your own machine learning tasks on text data or images including supervised classification and prediction unsupervised feature identification sequence prediction and computer vision.

Developing Autonomous Vehicles with High Fidelity Simulation

Room 106

Ashish Kapoor, Microsoft

High-fidelity simulations can provide a rich platform to develop autonomy by enabling the use of AI technologies such as deep learning computer vision reinforcement learning etc. We have developed AirSim which is a simulator for autonomous vehicles built on the Unreal Engine. It is open-source cross platform and supports hardware-in-loop simulation thus allowing rapid development and testing of the system. The simulation is developed as a plugin and can be simply be dropped into any Unreal environment. AirSim supports AI development capabilities by exposing APIs to enable data logging and controlling vehicles in a platform independent manner. We will give an overview of how to use AirSim for building realistic simulation environments and doing development for quadrotors that use popular flight controllers such as Pixhawk. It is developed as a plugin that can simply be dropped in to any Unreal environment you want. We will also showcase how the system can be used to incorporate machine learning components useful for building such autonomous systems.

Cloud-native AI on OpenShift

Room 203

Václav Pavlín, Red Hat
Francesco Murdaca, Red Hat

Ever thought of doing a cloud-native AI work? What does that even mean? This workshop will introduce you to running AI related services like Spark Seldon or Jupyter on Kubernetes as part of a project Open Data Hub. You will learn how to move your AI workloads to the cluster and implement a basic data science workflow. As Jupyter notebooks have become the de facto standard in data science we will show you how to use them and adopt some of the best practices that we’ve developed over time.

How to Make Data-Driven Decisions: The Case for Contextual Multi-armed Bandits

Room 205

Michal Pleva, Dataclair.ai, O2 Czech Republic
Petr Stanislav, Dataclair.ai, O2 Czech Republic

Supervised learning has done wonders but it’s fundamentally limited. A good prediction of customers' churn or the likelihood of new acquisition may not always help you to do what is best in a given situation. By attending our workshop you will get hands-on experience with algorithms for direct optimization of decision-making with uncertainty. We will be focusing on the special case of reinforcement learning known as Contextual multi-armed bandit problems. Those problems arise frequently in important industrial applications played a role in AlphaGo success and are very often adopted by industry leaders such as Google and Netflix. Decision making with uncertainty is a challenge so we will show you how to effectively balance between trying new things to find better solutions and repeating the behavior that works well. During the workshop you will have an opportunity to play with a linear algorithm to solve a simple problem as well as with more advanced solution involving a deep neural network to learn a latent representational feature space for a problem.

SAS Viya & Open Source Integration focus on Python

Room 206

Ivan Kasanicky, SAS
Jordan Bakerman, SAS

In this course you will learn to use the Python API to take control of SAS Cloud Analytic Services (CAS) actions. You will also learn to upload data into the in-memory distributed environment analyze data and create predictive models in CAS using familiar Python functionality via the SWAT (SAS Wrapper for Analytics Transfer) package. You will then learn to download results to the client and use native Python syntax to compare models.

Saturday, March 21
Workshops

08:50 – 09:00

Welcome to ML Prague

09:00 – 09:30

Autonomous driving: few insights on perception and explainability

Matthieu Cord, Valeo

Self-driving is a safety-critical application. In this talk, I first present the machine learning framework used for autonomous driving, gathering contributions from computer vision, deep learning, and autonomous robotics research fields. I then discuss some of the main challenges we face at valeo.ai to improve advanced driver-assistance systems. I will give some examples such as unsupervised domain adaptation for visual segmentation, or driving behavior explanation system using natural language processing.

09:30 – 10:00

Confidence Estimation Learning for Production-ready Neural Networks

Adam Blažek, Iterait

Deploying your ML model to production may bring you headaches for many reasons, e.g. out-of-distribution input data or low-quality user input. Recognizing those cases is a crucial step for providing actionable feedback and handling those cases properly. This talk uncovers our simple yet effective recipe for integrated confidence estimation learning alongside a practical example used in a production environment.

10:00 – 10:30

AI-accelerated Computational Fluid Dynamics (CFD)

Krzysztof Rojek, byteLAKE

CFD, Computational Fluid Dynamics are numerical methods or algorithms to solve fluid flows problems. They help model fluids density, velocity, pressure, temperature, and chemical concentrations in relation to time and space. Many industries such as automotive, chemical, aerospace, biomedical, power and energy, and construction rely on fast CFD analysis turnaround time. Typical applications include weather simulations, aerodynamic characteristics modeling and optimization, and petroleum mass flow rate assessment.

ByteLAKE has been working on leveraging Artificial Intelligence (AI) and Deep Learning to significantly accelerate CFD simulations. These typically take anything between hours, days or weeks. byteLAKE's CFD Suite, a collection of AI models helps predict accurate results within minutes. During his presentation, Krzysztof Rojek will take share more details about the solution, its scalability, compatibility with CAE tools and OpenFOAM solvers and present benchmarks for commercial simulations. Also, Krzysztof will present how to get started with CFD Suite and accelerate your simulations.

10:30 – 11:00

COFFEE BREAK & EXPO

11:00 – 11:30

AI in Cardiology: detecting heart dysfunctions

Filip Plešinger, Institue of Scientific Instruments of the Czech Academy of Sciences

Regardless of your job, you need your heart working. Early diagnostics, available through many wearable devices on the market, can capture diseases before they can prograde to the severe form. But we do not want to scare you in this talk; we will iterate from simple to more complicated ML/DL methods and their application in early diagnostics using ECG signals from telemedicine data.

11:30 – 12:00

Modular MLOps architecture built to last

Radovan Parrák, Credo

Every company that takes machine learning seriously needs to ‘productionalize’ their ML pipelines. Efficiently, robustly and at scale. The emerging methodology called Machine Learning Operations (MLOps) comes to rescue.

However, there are already hundreds of convenient, stand-alone and overlapping ML tools, workflow managers, automation and orchestration frameworks, developed by both vendors and the open-source community striving to put this methodology into practice. New ones keep on appearing (and disappearing). As a result, many companies are contemplating whether to buy an MLOps platform or to build one internally. And if the latter then they hope to postpone the architectural decisions until the sheer amount of available options reduces to a widely accepted set of tooling. But will it ever?

In this talk, I will share some of Credo’s experience on how to design a modular and future-proof MLOps platform, based on open-source tooling, that hits the ground running today and survives still tomorrow in the everchanging zoo of ML tooling.

12:00 – 12:30

Continuous Machine Learning

Paweł Redzyński, dvc.org

In the software engineering world, CI/CD practices have proven to be a reliable and effective approach to automating recurring tasks, like running tests, code analysis checks and even delivering final products to production. In this talk, we will present how to automate ML processes using GitHub Actions or GitLab CI/CD and Continuous Machine Learning (CML) library that will take care of:
• transferring large datasets to CI runners
• managing GPU/CPU resources for computations
and
• generating ML model report with metrics and plots right in GitHub Pull Request
so that ML specialists can focus on research.

12:30 – 13:30

POSTER SESSION & LUNCH

Martin Holeček: Table understanding in structured documents

Arun Mathew: SAP Behavioral Insights

Dominik Krzemiński: U-Net for Automated Segmentation of Knee Cartilage Imaging

Jakub Slovan, Jan Rus, Luboš Andert and Petr Jančařík: Bayesian Social Media Content Inspiration

Sebastian Eresheim: Cybersecurity Containment Agent

Rafał Bachorz, Małgorzata Mochol-Grzelak, Grzegorz Miebs: Efficient strategies of static features incorporation into the Recurrent Neural Network

Martin Plajner: Generic system for promotional sales prediction from time series data and individual observations.

Jakub Bartel, Matej Choma, Vojtěch Rybář, Petr Šimánek: ML for High-Resolution Rainfall Forecast

13:30 – 14:30

MASTERMIND SESSION: AI Safety and Value Alignment

Jan Romportl, Dataclair.ai, O2 Czech Republic
Jan Kulveit, Future of Humanity Institute, University of Oxford
Ondřej Bajgar, Future of Humanity Institute, University of Oxford

This panel discussion will feature three panelists from world's renowned research groups where the issues of AI value alignment are taken very seriously. It's a topic inherently related to AI ethics, safety, risks, benefits and future potential. But the goal is to show in a very open discussion with the audience that it really should concern every ML practitioner.

13:30 – 14:30

MASTERMIND SESSION: Deep Learning vs. Rule-based Systems in Practical Applications

Petr Somol, Avast
Viliam Lisy, Avast

Deep learning has achieved unprecedented performance in a wide range of domains ranging from computer vision, speech recognition, and natural language processing to game playing. However, many industrial systems still rely on human-written and maintain rule-based systems to perform classification. The reasons include better explainability of the rule-based systems and their modularity, which is crucial in dealing with non-stationary problems. We will discuss each approach's advantages and disadvantages and the possibilities of getting the best of both worlds.

13:30 – 14:30

MASTERMIND SESSION: From knowledge graphs to drug development

Jakub Kotowski, MSD IT
Pavel Vacha, MSD IT
Michael Wurst, MSD IT
Petr Mejzlik, MSD IT
Nik Vostrosablin, MSD IT

AI in Pharma and Life Sciences is connected predominantly with early drug discovery in popular news. There are many more opportunities to apply both classical and modern AI in Pharma. In this session, we will present an overview of drug discovery and development phases together with examples of how AI applies to them. We will mention also a couple of selected examples from our own work: Reaction PathFinder - computation of optimal synthesis routes based on a graph of chemical reactions, Mutation Maker - a tool for designing optimized proteins, CAKE - an evaluation and recommendation engine for streamlining pharma manufacturing change requests (also presented at Amazon Re:Invent), and several Natural Language Processing use cases. The panelists are hands-on experts looking forward to an engaging discussion with you.

13:30 – 14:30

MASTERMIND SESSION: Learning predictors with limited labels

Jan Brabec, Cisco
Pavel Procházka, Cisco
Tomáš Jirsík, Cisco

Most successful industrial ML systems of today require large amounts of labeled data for training to perform well. Yet high quality labeled data are a scarce resource. This is especially true in cybersecurity and other domains which deal with difficult to label, severely class-imbalanced and highly confidential data. We will discuss approaches to learning practical classification systems with limited amount of ground truth. We would like to focus on concrete learning approaches and also on the broader challenges related to obtaining and working with labeled data in practical classification systems.

13:30 – 14:30

MASTERMIND SESSION: Artificial Intelligence (AI) accelerating industrial Computational Fluid Dynamics (CFD) simulations

Marcin Rojek, byteLAKE
Mariusz Kolanko, byteLAKE
Damo Vedapuri, Tridiagonal Solutions
Robert Daigle, Lenovo Data Center
Andrzej Jankowski, Intel Corporation
Valerio Rizzo, Lenovo Data Center
Ashish Kulkarni, Tridiagonal Solutions

Computational Fluid Dynamics (CFD) are numerical methods used across many industries (chemical, pharma, automotive, construction, oil&gas just to name a few) to model fluids pressure, velocity, temperature etc. Typical applications include modelling aerodynamics, chemical mixing, air flows around buildings etc. CFD simulations usually take anything between many hours to even days, depending on the amount of the information that needs to be processed i.e. geometry, boundary conditions, initial parameters like velocities, viscosity etc. byteLAKE, a company specializing in machine and deep learning, has been developing a collection of Artificial Intelligence (AI) models that are targeted to significantly reduce time to results for such simulations.

We invite you to a moderated panel discussion where byteLAKE co-founders, together with a producer of the leading CFD tool for enterprise mixing analysis (MixIT), a company named Tridiagonal Solutions will discuss how Deep Learning models accelerate complex chemical mixing simulations. Panelists will talk about how such simulations help address various industries challenges, explain how AI helps reduce the cost of trial & errors experiments and discuss the future of AI in the CFD space. We will also have representatives from Lenovo Data Center and Intel Corporation who will weigh in on scalability of the technology, and how various hardware configurations can deliver maximum value for AI+CFD adopters.

14:30 – 15:00

COFFEE BREAK & EXPO

15:00 – 15:30

Challenges of Machine Learning Under Distribution Shift

Silvestr Stanko, Qminers

Most machine learning algorithms depend on the assumption that training and testing data are sampled independently from the same distribution. But what happens when this assumption doesn't hold? At Qminers, we are facing this problem constantly, since data from financial markets are notoriously non-stationary.
In this talk, we will discuss the different faces of distribution shift and how to fight it, both in theory and practice. Topics we will touch on include Robustness, Risk-Aversion and Invariant Risk Minimization. I will show that distribution shift is a real problem faced by ML practitioners, and that solutions exist.

15:30 – 16:00

AIOPS, Machine Learning and Anomaly detection, our experience implementing a virtual assistant engine to detect and triage anomalous behavior in a data center

Kirill Maiantsev, Broadcom

Join this session to learn about our experience and challenges in designing a virtual assistant engine used in many of our IT data center monitoring products. We will discuss both the use case we have solved, its evolution as we encountered challenges and some of the machine learning models implemented to solve these problems. We will also provide in-depth review of the Kernel Density Estimation model we used to study time series in order to obtain the expected value ranges. When anomalies are detected in the expected values, we perform some additional learning on these anomalies to detect patterns and auto-correlate live events in the data center. During the session, we will share both our learnings and challenges building this enterprise grade virtual assistant.

16:00 – 16:30

Expertise recommendations - A supervised approach that surmounts incomplete datasets

Jeremy Jonas, McKinsey & Company
Felipe Vianna, McKinsey & Company

For knowledge-based organizations, finding precise expertise to address specific projects is increasingly important. At McKinsey we’ve been improving our internal expertise search capability, by enriching colleague profiles in various ways, including ML-driven recommendations for ‘Topics to call me about’. Through a number of innovations, our Prague-based Data Science team has created a highly-effective prediction model.

Traditionally, expert profiling and retrieval are based on document retrieval approaches. But can the information available in profiles be used to train a supervised model? As with many retrieval applications, our challenges began with a limited amount of data available, as well as the format, which at McKinsey is mostly PowerPoint files. Several well-known approaches were combined to perform a Document Classification step in unsupervised fashion, providing data to create the expert-candidate representations. In a later step, profiling of the experts was achieved despite noisy label data (incomplete profiles) and a large amount of features (compared to the amount of samples available).

Despite these challenges, the model is now achieving 80% acceptance of recommendations. In turn this is materially helping the Firm find appropriate experts when needed.

16:30 – 17:00

COFFEE BREAK & EXPO

17:00 – 17:30

Understanding and mitigating unwanted bias in Artificial Intelligence

Karthikeyan Natesan Ramamurthy, IBM Research AI

AI and machine learning models are increasingly used to inform high-stakes decisions. Discrimination by AI becomes objectionable when it places certain privileged groups at systematic advantage and certain unprivileged groups at systematic disadvantage. In this talk, we will discuss the sources of unwanted bias in AI, and how it manifests along various points in the AI pipeline. We will also explore several methods of bias mitigation. Finally, we will discuss how bias can be measured and mitigated using the open source AI Fairness 360 toolkit.

17:30 – 18:00

Ensuring Machine Learning Fairness with Monotonic Constraints

Serg Masís, Syngenta

The first part of the session underpins the importance of Machine Learning Interpretation. Fundamentally, it is needed because machine learning by itself is incomplete as a solution. After all, the problems they solve are not deterministic, so the solution cannot cover all of it because it is an optimization. One of the most significant issues is that AI faces today is overconfidence. Given the high accuracy of AI solutions, we tend to increase our confidence level to the point we fully understand the problem. Then, we are misled into thinking our solution covers all of it! The machine learning interpretability toolkit can help us first learn from our models. Then, leverage what was learned or our domain knowledge to place guardrails, mitigate bias, and enhance model reliability, making them safe to use even in rare and unexpected situations and free from non-discriminatory practices. One of the ways in which fairness can be ensured is through monotonic constraints. We will discuss several scenarios in which this may be needed.

During the second part, we will dive into a law school scholarship problem. Let’s suppose a law school wants to handout merit-based scholarships to those students most likely to pass the bar exam. To that end, they want to train classifiers that score students on this probability. However, the classifier must be consistent with merit-based norms such as having the highest grades in other examinations. Employing monotonic constraints in XGBoost and Tensorflow will place the guardrails so that students with high examination scores are never unfairly penalized. We will walk through the code that assesses, establishes, and confirms model fairness.

18:00 – 18:30

Private Federated Learning

Vojta Jína, Apple

Federated Learning is a new approach that is picking up steam in the machine learning community as a way to improve global models by leveraging on-device training on user data. At WWDC 2019, Apple announced Private Federated Learning by combining Federated Learning with Differential Privacy. We have started to use this technology in iOS 13 for a variety of use cases, including QuickType keyboard, Found in Apps, and Smart Replies. In this talk, Vojta will provide more details about this technique.

18:30 – 18:50

Announcement of the hackathon winners

Ivan Kasanicky, SAS

Sunday, March 22
Conference day 1

09:00 – 09:30

Antibiotics discovery and design of mRNA- and protein-based therapeutics by Machine Learning and Optimization strategies

Nik Vostrosablin, MSD IT

Join my session to learn about published and unpublished studies in which we developed and applied: (i) a deep learning and NLP strategy to mine bacterial genomes in order to identify natural product with antibacterial activity (Hannigan et al., 2019, Nucleic Acids Research) (ii) a bundle of constraint-satisfaction, optimization, heuristics and backtracking algorithms to enable design of novel proteins that can be used in research, therapeutics and industrial processes (Hiraga et al., 2021, ACS Synthetic Biology) (iii) An integrated constraint-satisfaction approach to design, optimize and visualise, mRNA-based therapeutics (Vostrosablin et al., 2021, in preparation).

09:30 – 10:00

Harnessing relational learning for explainable learning

Tomas Pevny, Avast

While most of the machine learning methods assume that samples are vectors, matrices, or sequences, in many real-world problems they have a rich structure. While this structure makes the manual design of features non-trivial, I see it as an inductive bias that should drive the design of models. In this talk, I will introduce a simple, yet powerful framework for learning on structured data. A side, yet important feature is the explainability of decisions, which is the result of ingesting data as-is instead of devising artificial features. A concrete implementation of the framework will be demoed on data from various stages of analysis of malware.

10:00 – 10:30

How to build the perfect model of a human according to their voice

Petr Schwarz, Phonexia

Voice biometry is a technology that overperforms humans. Petr Schwarz will present how modern voice biometry systems are built and how they are deployed. The key issues are how to collect data, what are the input features describing the human vocal tract, what machine learning techniques are used for modeling, how to train the models, and how to deliver the model to its user while keeping the best accuracy.

10:30 – 11:00

COFFEE BREAK & EXPO

11:00 – 11:30

The Ethical aspects of Machine Learning

Uri Eliabayev, Machine and Deep Learning Israel

Machine Learning has become a major part of our lives. As more and more companies and organizations implementing ML-based solutions, we need better understand the ethical aspects of Machine Learning algorithms.

In this talk, we will speak about the key element of this field (Fairness, explainability, bias and more) and give some past examples of ethical problems in the ML field. Alongside that, We will suggest ways to solve or reduce the ethical problem in each ML project and finally, we will learn how companies like Google and Microsoft make their algorithms fairer.

11:30 – 12:00

ML powered Crime Prediction

Or Herman-Saffar, Dell

What if we could predict when and where the next crimes will be committed? Crimes in Chicago is a publicly published dataset which reflects the reported incidents of crime that occurred in Chicago since 2001. Using this data, we would like not only to be able to explore specific crimes to find interesting trends, but also predict how many crimes will be taking place next week, and even next month.

12:00 – 12:30

How We Foster Superhuman Analysts

Filip Dousek, Workday

One year ago, Prague-based Stories.bi were acquired by Workday. Today, the same team is building its ML-driven augmented analytics for Workday's largest customers. Filip will talk about the concept behind Stories.bi, how it is different and why it's called the next generation of BI&analytics.

12:30 – 13:30

POSTER SESSION & LUNCH

Martin Holeček: Table understanding in structured documents

Arun Mathew: SAP Behavioral Insights

Dominik Krzemiński: U-Net for Automated Segmentation of Knee Cartilage Imaging

Jakub Slovan, Jan Rus, Luboš Andert and Petr Jančařík: Bayesian Social Media Content Inspiration

Sebastian Eresheim: Cybersecurity Containment Agent

Rafał Bachorz, Małgorzata Mochol-Grzelak, Grzegorz Miebs: Efficient strategies of static features incorporation into the Recurrent Neural Network

Martin Plajner: Generic system for promotional sales prediction from time series data and individual observations.

Jakub Bartel, Matej Choma, Vojtěch Rybář, Petr Šimánek: ML for High-Resolution Rainfall Forecast

13:30 – 14:30

MASTERMIND SESSION: Recent advancements in Speech and Language processing. How research is being applied in commercial projects today

Dima Turchyn, Microsoft
Dmitry Soshnikov, Microsoft
Mikhail Burtsev, Moscow Institute for Physics and Technology
Ádám Feldmann, University of Pecs
Panos Periorellis, Microsoft
Kshama Pawar, Microsoft

During the panel, our speaker will share their view on recent advancements in speech and language processing field, and their personal experience in training large-scale natural language models. We will also discuss how accelerating pace of innovation andadoption in the field of AI leads to fast productization of research results. Panel speakers include researchers and representativesof the Speech and Language Product Groups from Microsoft, as well as researchers from organizations across Central and Eastern Europe, who will share their experience from recent projects as well as thinking on the next innovations in that area.

13:30 – 14:30

MASTERMIND SESSION: Operationalizing Analytics & ModelOps

Ivan Kasanicky, SAS
Jan Černý, SAS
Dalibor Šrámek, SAS
Ľubomír Boďa, SAS

ModelOps is a holistic approach for rapidly and iteratively moving models through the analytics life cycle so they are deployed faster and deliver expected business value. ModelOps is based on the application development community's DevOps approach. But where DevOps focuses on application development, ModelOps focuses on getting models from the lab through validation, testing and deployment phases as quickly as possible, while ensuring quality results. It also focuses on ongoing monitoring and retraining of models to ensure peak performance.

To help you cross the "last mile" of deployment much faster, and ensure that your analytic models deliver expected value, ModelOps defines people (or culture), process and technology changes that facilitate smooth, efficient and continuous development and deployment of high-impact analytic models.

Join us to this Mastermind session, where experts from different areas will discuss current challenges that often prevent organization to get the full potential of their analytics. In the second half of the session, we will also invite person from audience to come on a stage and share their experience or ask questions.

13:30 – 14:30

MASTERMIND SESSION: How to control and achieve Data Quality

Lukáš Matějka, Lundegaard
Petr Šmíd, PEKAT VISION
Michal Štefánik, Gauss Algorithmic
Lukáš Vrábel, thevertical.ai, keyless.io

Critical success factor is having data in a good shape, let’s discusss what could be good practices in terms of organizational point of view or suitable technical tools in order to support data quality. What data quality means in particular? We would like also focus on effectivity within small start-up product teams without dedicated „quality department“ and what could be done in good enough principle. Let’s discuss techniques how to properly monitor incoming data or model itself, setup smart alerts based on time periods comparison, production data issues and how to avoid them, validation automation.

13:30 – 14:30

MASTERMIND SESSION: Computer Vision applications in Manufacturing Industry

Pavel Dvořák, Konica Minolta
Lukas Havlicek, Konica Minolta
Matej Dusik, Konica Minolta
Martin Jahoda, Konica Minolta
Branislav Hesko, Konica Minolta

Machine Learning technology has become an important part of the ongoing Fourth Industrial Revolution. This revolution transforms manufacturing industry into a new era through digitization and automation of various processes. Especially Computer Vision-based systems have already brought significant benefits to the companies. These benefits include e.g. increased efficiencies and decreased waste, thus having an impact not only on the company itself but also on the whole society. Let’s discuss together both the applications and implications of ML technology in the manufacturing domain.

13:30 – 14:30

MASTERMIND SESSION: Machine learning in the field of social media

Peter Krejzl, Socialbakers
Jan Rus, Socialbakers
Jakub Slovan, Socialbakers
Lenka Šimková, Socialbakers
Simona Kolenčíková, Socialbakers

In Socialbakers, we analyze around 100M texts every day. We process them using our natural language processing (sentiment, NER, ...) or computer vision (image & video classification) systems. In the session, a large part of our research team will be available to share the knowledge and answer any questions you might have.

14:30 – 15:00

COFFEE BREAK & EXPO

15:00 – 15:30

Complex Systems for AI

Tomas Mikolov, CIIRC CTU Prague

Machine learning has been tremendously successful in the last decade. The core concepts for training the models is to use supervision, error backpropagation and stochastic gradient descent. However, many scientists believe that to make steps towards more autonomous AI systems, we need to discover learning approaches that are fundamentally less supervised than the current ones. In this talk, I will describe a project where we attempted to define a system which can evolve for indefinitely long, possibly reach arbitrary complexity, and use no supervision. It is based on an old idea of cellular automaton which can be seen as a special type of a recurrent-convolutional network. I will show examples of interesting behavior that we did observe in automatically constructed models. We were able to discover these interesting automata using a novel metric which measures structured complexity growth in time. This work could be a basis of a new generation of machine learning models which can continue learning in interesting ways in situations where no supervision of even rewards are available.

15:30 – 16:00

AutoML with Keras Ecosystem

Haifeng Jin, Google

The Keras ecosystem now has two new members, Keras Tuner and AutoKeras. They are built with AutoML techniques to dramatically reduce the manual work for designing and training deep learning models. They work seamlessly with Keras and TensorFlow for model export, saving, and deployment. The talk not only covers how to use them but their underlying mechanism as well.

16:00 – 16:30

Deep Neural Networks Abstract Like Humans

Hava Siegelmann, University of Massachusetts Amherst

Deep neural networks (DNNs) have revolutionized AI due to their remarkable performance in pattern recognition, comprising of both memorizing complex training sets and demonstrating intelligence by generalizing to previously unseen data (test sets). The high generalization performance in DNNs has been explained by several mathematical tools, including optimization, information theory, and resilience analysis. In humans, it is the ability to abstract concepts from examples that facilitates generalization; this presentation describes DNN generalization from that perspective. A recent computational neuroscience study revealed a correlation between abstraction and particular neural firing patterns. We express these brain patterns in a closed-form mathematical expression, termed the “Cognitive Neural Activation metric” (CNA) and apply it to DNNs. Our findings reveal parallels in the mechanism underlying abstraction in DNNs and those in the human brain. Beyond simply measuring similarity to human abstraction, the CNA is able to predict and rate how well a DNN will perform on test sets, and determines the better network architectures for a given task in a manner not possible with extant tools. These results were validated on a broad range of datasets (including ImageNet and random labeled datasets) and neural architectures.

16:30 – 17:00

COFFEE BREAK & EXPO

17:00 – 17:30

Building Safety Mechanisms in Autonomous Systems

Ashish Kapoor, Microsoft

Machine Learning is one of the key component that enables systems that operate under uncertainty. For example, AI systems and robots might employ sensors together with a machine learned system to identify obstacles. However, such data driven system are far from perfect and can result in failure cases that can jeopardize safety. In this talk we will explore a framework that aims to preserve safety invariants despite the uncertainties in the environment arising due to incomplete information. We will describe various methods to reason about safe plans and control strategies despite perceiving the world through noisy sensors and machine learning systems. We will also consider extensions of these ideas, using high-fidelity simulation, to a sequential decision making framework that considers the trade-off in risk and reward in a near-optimal manner.

17:30 – 18:30

Panel Discussion

Ashish Kapoor, Microsoft
Karthikeyan Natesan Ramamurthy, IBM Research AI
Tomas Mikolov, CIIRC CTU Prague
Hava Siegelmann, University of Massachusetts Amherst

18:30 – 18:35

Closing Remarks

Have a great time A great gift for this year’s attendees

Did you get your ticket before November 25, 2020? Then you’ll get a 50% discount to purchase your ticket for ML Prague 2022 to celebrate the return to the house of Machine Learning in CE, the Rudolfinum Music Hall!

Now or never Tickets

Standard Ticket

Sold Out

Conference days € 120
Only workshops € 170
Conference + workshops € 270

Late Ticket

Sold out

Conference days € 150
Only workshops € 195
Conference + workshops € 295

What You Get

Practical and advanced level talks led by top experts
Connect with ML pros from all around the world to share expertise
Access to actionable practical content

They’re among us We are in The ML Revolution age

Machines can learn. Incredibly fast. Faster than you. They are getting smarter and smarter every single day, changing the world we’re living in, our business and our life. The artificial intelligence revolution is here. Come, learn and make this threat your biggest advantage.