Machine Learning in Python with Scikit-learn
This workshop will provide participants with the basics of machine learning in Python.
Date and time
Location
Netherlands eScience Center
402 Science Park 1098 XH Amsterdam NetherlandsRefund Policy
About this event
The workshop will take place at Science Park 402, 1098 XH Amsterdam. Please note that lunch and drinks at the end of the workshop are included.
This hands-on workshop will provide you with the basics of machine learning using Python.
Machine learning is the field devoted to methods and algorithms that ‘learn’ from data. It can be applied to a vast range of different domains, from linguistics to physics and from medical imaging to history.
This workshop covers the basics of machine learning in a practical and hands-on manner so that upon completion, you will be able to train your first machine learning models and understand what next steps to take to improve them.
We start with data exploration and prepare the data so that it is suitable for machine learning. Then we learn how to train a model on the data using scikit-learn. We learn how to select the best model from the trained models and how to use different machine learning models (like linear regression, logistic regression, and decision tree models). Finally, we discuss some of the best practices when starting your own machine learning project.
The workshop is based on the teaching style of the Carpentries, and learners will follow along while the instructors write the code on screen. More information can be found on the workshop website (will be activated once registration is live, usually 3 weeks before the start of the workshop).
Cancellation and No-Show Policy
Please be advised that by signing up, you agree to our Cancellation and No-Show Policy, which states that cancellations made less than 2 workings days prior to the event will incur a no-show fee. Please read the full policy here for more details.
If you won’t be able to attend, please cancel your registration (by following the instructions here) so that we can offer your seat to someone on the waiting list.
Audience
The workshop is aimed at PhD candidates, postdocs, (assistant-)professors and other researchers and also open for research support staff and research software engineers. We offer tickets for researchers who are affiliated with Dutch research institutions. We also offer industry tickets for attendees who are not affiliated with Dutch research institutions. We do not accept registrations by Master students.
Ticket prices
Ticket prices are as follows:
- For participants affiliated with Dutch research institutions: €225
- For participants from industry: €675
Early Bird Discount
Register within the first week of open registration and receive a 25% discount on your participation fee. Don’t miss out on this opportunity to secure your spot at a reduced price!
Group Discount
Bring your team along and benefit from our group discount. Groups of five or more participants are eligible for a 25% discount. For more details and to register your group, please email us at training@esciencecenter.nl.
Prerequisites
The course aims to be accessible without a strong technical background.
This course is for you if:
- You have basic knowledge of Python programming : defining variables, writing functions, importing modules. Some prior experience with the NumPy, pandas and Matplotlib libraries is recommended but not required.
- You want to learn how to setup a full machine learning pipeline in Python for various machine learning tasks.
- You want to get an intuition of basic machine learning concepts, such as train-test data splits, model training and evaluation, different machine learning algorithms, overfitting/underfitting, bias-variance trade-off.
This course is not for you if:
- You already have experience with machine learning or its concepts, this is really an introduction for people that have never done machine learning or only just started but need more guidance.
- You want to get a solid mathematical understanding of machine learning theory. This course aims to quickly get participants comfortable applying machine learning in practice, we therefore only cover the basis of theoretical concepts without going into depth.
- You want to learn about deep learning - You want to learn about more advanced data preprocessing, like data cleaning, handling missing values etcetera. We only cover the basics of data preprocessing that are needed to setup a machine learning pipeline.
Also have a look at the syllabus to see what topics we will cover.
If you are uncertain whether this course is for you, please send us an email.
Syllabus
Machine learning concepts
- What is machine learning?
- Different types of machine learning
- Big picture of machine learning models
- General pipeline
The predictive modeling pipeline
- Tabular data exploration
- Fitting a scikit-learn model on numerical data
- Handling categorical data
Selecting the best model
- Overfitting and underfitting
- Validation and learning curves
- Bias versus variance trade-off
Machine learning algorithms
- Intuitions on linear models
- Intuitions on tree-based models
Machine learning best practices
- Data hygiene
- Correct evaluation
- How to keep your machine learning project organised