Allan Rosenfield Building
722 W. 168 St., New York, NY 10032


Read More


Jun 06 - 07 2024


9:00 am - 5:00 pm

Formats (virtual, in person, hybrid)


Python Data Wrangling Boot Camp

The Python Data Wrangling Boot Camp is a two-day intensive course that combines concept-focused seminars with hands-on exercises pairing Python fundamentals with practical data wrangling and analysis.

June 6-7, 2024 | In-person training

This two-day course will provide an introduction to the python programming language and demonstrate how it can be used to do essential data wrangling, manipulation and cleaning tasks using real-world biomedical data. Bringing together scalable methods and popular libraries for data manipulation, basic statistical analysis and visualization, this boot camp will provide participants with all the necessary tools and background for getting started with Python for data work. Through hosted notebooks, participants will leave the workshop with functioning code that they can then apply to their own data sets. Participants will receive orienting videos before the real-time sessions so they can familiarize themselves with the Jupyter Notebook/Google Colab environment; all code samples will be available in this format for participant use.

By the end of the workshop, participants will be able to:

  • Load and explore data sets in Python
  • Join, reconcile and otherwise clean up messy data sets
  • Do basic statistical analyses, including linear and logistic regression
  • Render exploratory visualizations

Audience and Requirements

Investigators from any institution and from all career stages are welcome to attend, and we particularly encourage trainees and early-stage investigators to participate.

No prior programming experience is required to participate in this workshop. However, participants must have (or create) an unrestricted Google account for working with sample notebooks (via Google Colab) and data sets. Likewise, participants will be expected to complete a brief survey and watch up to 3 hours of pre-recorded introductory material before the start of the real-time workshop activities.


Training Director: Susan McGregor, Associate Research Scholar, Columbia University Data Science Institute (DSI).

Additional Information

Capacity is limited. Paid registration is required to attend.

Event Contact Information:
Python Data Wrangling Boot Camp