Prepare to become an education researcher capable of leveraging large-scale data using data science methods, improving education quality with digital learning platforms. 

Over the last 15 years, the educational data mining and learning analytics communities have developed a range of algorithms tailored to the data and R&D goals of digital learning platforms. The Data Science Methods for Digital Learning Platforms certificate provides learners with a breadth and depth of skills that expand beyond existing courses of its length in data science or education more generally. 

What Sets Us Apart

14 Learn from 14 leading researchers with expertise in different areas of learning analytics.
  Earn your certificate for free, thanks to support from the Institute of Education Sciences.
 A focused 16-week certificate that teaches career-useful skills.

About the Program

Working with the data from digital learning platforms requires a range of data science methods. Do you have the right tools to pursue your education research questions?  

Through the Data Science Methods for Digital Learning Platforms certificate program, you will learn to use both algorithms designed specifically for digital learning platforms and how to effectively apply algorithms developed for more general purposes to digital learning platform data. 

This program is open to US citizens or permanent residents only.

Duration of program
16 weeks; 6–8 hours per week asynchronously required

Certificate offered: Penn GSE certificate in Data Science Methods for Digital Learning Platforms 

Ideal candidates

  • Researchers (or aspiring researchers) with a clear intent to pursue education research questions
  • Researchers (or aspiring researchers) with a diverse range of academic backgrounds, including education, psychology, sociology, economics, and computer science
  • Researchers (or aspiring researchers) with professional experience in academia, industry, non-profits, school districts, and government


  • Introductory-level knowledge of statistics and/or quantitative research
  • Beginner knowledge of Python OR sufficient knowledge of R 
  • Must be a U.S. citizen or permanent resident
  • Online

In this 16-week program, participants will learn to conduct analysis on real-world data, working directly with authentic student interaction data that has not been cleaned. You’ll compare statistical and psychometric approaches to machine learning and data mining methods. You will also learn how to move beyond the well-structured use cases often utilized in introductory data science and statistics courses, which are often not representative of the data that comes from digital learning platforms. 

The program is online and asynchronous, with one optional synchronous and virtual “ask me anything” session with the instructors. Each module includes discussion-based interactions with peers and instructors and a project-based assignment for which fellows will be able to apply the skills they learn using authentic tools and datasets. The examples and assignments corresponding with each module will align with real challenges and scenarios common to digital learning platforms. Emphasis is given to identifying the development of relevant research questions and understanding the limitations and affordances that different types of digital learning platform data may provide in addressing these questions. 

This program is designed for individuals with a clear intent to pursue education research, and who have some degree of prior quantitative analysis background and either an intermediate-level understanding of statistics or psychometrics or a background in computer science. Twenty participants will be selected for the first cohort based on a competitive application process. 

There is no cost to participate in this program. 

Program Schedule

August 19 - 23, 2024Introduction, Challenges, and Framework
August 26 - 30, 2024Data and Measurement Validity
September 2 - 4, 2024Prediction Modeling and Metrics
September 9 - 13, 2024Feature Extraction and Feature Engineering
September 16 -20, 2024Data Visualization
September 23- 27, 2024Ethics, Equity, and Algorithmic Bias
September 30 - October 4, 2024Data Management and Database Access
October 7 - 11, 2024Knowledge Graphs
October 14 - 18, 2024Knowledge Tracing
October 21 -25, 2024Cluster Analysis
October 28 - November 1, 2024Network Analysis
November 4 - 8, 2024Sequential Pattern Mining and Temporal Analysis
November 11 - 15, 2024Causal Reasoning
November 18 - 22, 2024Neural Networks and Deep Learning
November 25 - 29, 2024Natural Language Processing
December 2 - 5, 2024Transformer and Foundation Models

Partners in Collaboration

Partners in Collaboration


This program and certificate are made possible through a partnership between Penn GSE, the University of Florida, and Digital Promise. 

Penn GSE Learning Analytics Logo
University of Florida Logo
Digital Promise Logo


This program is led by Principal Investigator (PI) Ryan Baker, Penn GSE; along with 5 co-PIs representing the collaborating institutions: Anthony Botelho, University of FloridaBodong Chen, Penn GSE;  Elizabeth Cloude, Tampere UniversityStefani Pautz Stephenson, Digital Promise; and Jeremy Roschelle, Digital Promise.


IES logo


This project is supported by the Institute of Education Sciences, U.S. Department of Education, through Grant R305B230007 to the University of Pennsylvania. The opinions expressed are those of the authors and do not represent views of the Institute or the U.S. Department of Education.

By IES requirements, only US citizens or permanent residents are eligible for this program. If you are ineligible for this program, you can find other training opportunities at

Our Faculty

Penn GSE Faculty Seth Akonor Adjei
Assistant Professor, Northern Kentucky University
Ph.D., Worcester Polytechnic Institute
Penn GSE Faculty Michael Ashenafi
Project Scientist, Penn Center for Learning Analytics
Ph.D., University of Trento
Penn GSE Faculty Ryan S. Baker
Ph.D., Carnegie Mellon University
Penn GSE Faculty Anthony Botelho
Assistant Professor, University of Florida
Ph.D., Worcester Polytechnic Institute
Penn GSE Faculty Alex J. Bowers
Professor of Education Leadership, Teachers College, Columbia University
Ph.D., Michigan State University
Penn GSE Faculty Wendy Chan
Assistant Professor
Ph.D., Northwestern University
Penn GSE Faculty Bodong Chen
Associate Professor
Ph.D., University of Toronto
Penn GSE Faculty Scott Crossley
Professor, Vanderbilt University
Ph.D., University of Memphis
Penn GSE Faculty Shamya Karumbaiah
Assistant Professor, University of Wisconsin–Madison
Ph.D., University of Pennsylvania
Penn GSE Faculty Walter Leite
Professor, University of Florida
Ph.D., University of Texas at Austin
Penn GSE Faculty Haiying Li
Program Manager
Ph.D., University of Memphis
Penn GSE Faculty Jaclyn Ocumpaugh
Associate Director, Penn Center for Learning Analytics
Ph.D., Michigan State University
Penn GSE Faculty Jinnie Shin
Assistant Professor, University of Florida
Ph.D., University of Alberta

Program Leadership

Ryan Baker  
University of Pennsylvania

Anthony Botelho
University of Florida

Bodong Chen
University of Pennsylvania

Elizabeth Cloude
Tampere University

Jeremy Roschelle
Digital Promise

Stefani Pautz Stephenson
Digital Promise