Tuesday, April 16, 2024

Data Preprocessing and Transformation in Machine Learning | Kelly Hoang, Gilead Sciences


In the realm of machine learning, the quality of data directly influences the performance and accuracy of models. The process of data preprocessing and transformation plays a pivotal role in shaping raw data into a format suitable for effective machine learning algorithms. Using real world data set from the WiDS Datathon 2024 challenge, this workshop aims to delve into the fundamental concepts and demonstrates different techniques of data preprocessing and transformation for machine learning tasks. Participants will be introduced to an overview of data preprocessing, including data cleaning, handling missing values, feature scaling, and feature engineering. Through hands-on exercises and practical examples, attendees will gain knowledge in utilizing popular Python libraries such as Pandas, NumPy, and Scikit-learn to preprocess and transform real-world data effectively. Link to slides for this workshop: https://drive.google.com/file/d/1Z2GXGTnqsOZHVRGWXo4lprxNVwypHWaU/view?usp=sharing Google Colab Notebook for this workshop: https://colab.research.google.com/drive/1eOfF-lmK7l-sJAnKHmWyBdC3LloCizBE?usp=sharing 0:00 Introduction 1:27 Kelly's Workshop 46:14 Q&A 1:02:05 Closing Learn more about WiDS Workshops: https://www.widsworldwide.org/learn/upskill-workshops/ #GileadSciences #WiDSDatathon2024 #Datathon #DataWrangling #handson #ai #machinelearning #Python #SkillSets # NLP #tutorial #howto #wids #womenindata #widsworkshops #womenindatascience #datascience #widsworldwide

No comments:

Post a Comment