Overview
Over the last 15 years, the educational data mining and learning analytics communities have developed a range of algorithms tailored to the data and R&D goals of digital learning platforms. The Data Science Methods for Digital Learning Platforms certificate provides learners with a breadth and depth of skills that expand beyond existing courses of its length in data science or education more generally.
Through the Data Science Methods for Digital Learning Platforms certificate program, you will learn to use both algorithms designed specifically for digital learning platforms and how to effectively apply algorithms developed for more general purposes to digital learning platform data.
| This program is open to US citizens or permanent residents only. |
Program Details
-
November 17 Priority Deadline
-
Rolling Admissions After the Priority Deadline
-
January 12 Decisions Released
About the Program
In this 16-week program, participants will learn to conduct analysis on real-world data, working directly with authentic student interaction data that has not been cleaned. You’ll compare statistical and psychometric approaches to machine learning and data mining approaches, and learn how to move beyond the well-structured use cases often utilized in introductory data science and statistics courses that are often not representative of the data that comes from digital learning platforms.
The program is online and asynchronous, with one optional synchronous and virtual “Ask me anything (AMA)” session with the instructors. Each module includes discussion-based interactions with peers and instructors and a project-based assignment for which fellows will be able to apply the skills they learn using authentic tools and datasets. The examples and assignments corresponding with each module will align with real challenges and scenarios common to digital learning platforms, and emphasis is given to identifying the development of relevant research questions and understanding the limitations and affordances that different types of digital learning platform data may provide in addressing these questions.
This program is designed for individuals with a clear intent to pursue education research, and who have some degree of prior quantitative analysis background and either an intermediate-level understanding of statistics or psychometrics or a background in computer science. Fifty participants will be selected based on a competitive application process.
This is the fifth and final cohort of this program. While we encourage applications to be submitted by the priority application deadline, admissions will be reviewed on a rolling basis until the cohort is filled. Admission decisions are expected to be announced by January 12, 2026. There is no cost to participate in this program.
This program is best suited for researchers or aspiring researchers who are committed to exploring education-focused research questions and bring professional experience from academia, industry, non-profits, school districts, or government. Applicants should have a foundational understanding of statistics or quantitative research methods and beginner-level proficiency in Python or sufficient knowledge of R. A wide range of academic backgrounds is welcomed, including education, psychology, sociology, economics, and computer science.
U.S. citizenship or permanent residency is required.
| Dates | Topics |
|---|---|
| February 16 - 20, 2026 | Introduction, Challenges, and Framework |
| February 23 - 27, 2026 | Data and Measurement Validity |
| March 2 - 6, 2026 | Feature Extraction and Feature Engineering |
| Spring Break | |
| March 16 - 20, 2026 | Prediction Modeling and Metrics |
| March 23 - 27, 2026 | Causal Reasoning |
| March 30 - April 3, 2026 | Ethics, Equity, and Algorithmic Bias |
| April 6 - 10, 2026 | Cluster Analysis |
| April 13 - 17, 2026 | Network Analysis |
| April 20 - 24, 2026 | Sequential Pattern Mining and Temporal Analysis |
| April 27 - May 1, 2026 | Neural Networks and Deep Learning |
| May 4 - 8, 2026 | Natural Language Processing |
| May 11 - 15, 2026 | Transformer and Foundation Models |
| May 18 - 22, 2026 | Knowledge Tracing |
| May 25 - 29, 2026 | Knowledge Graphs |
| June 1 - 5, 2026 | Data Visualization |
| June 8 - 12, 2026 | Data Management and Database Access |
Our Partners
This program and certificate are made possible through a partnership between Penn GSE, the University of Florida, and Digital Promise.
This program is led by Principal Investigator (PI) Ryan Baker, University of Pennsylvania, along with 4 co-PIs representing the collaborating institutions: Anthony Botelho, University of Florida; Bodong Chen, Penn GSE; Elizabeth Cloude, Tampere University; and Stefani Pautz Stephenson, Digital Promise.
Jeremy Roschelle, Digital Promise, provides additional leadership and advising.
This project is supported by the Institute of Education Sciences, U.S. Department of Education, through Grant R305B230007 to the University of Pennsylvania. The opinions expressed are those of the authors and do not represent views of the Institute or the U.S. Department of Education.
By IES requirements, only US citizens or permanent residents are eligible for this program. If you are ineligible for this program, you can find other training opportunities in this list.
Take the Next Step
Simply click the Apply Now button to create an account and submit an application.
Instructors
Questions?
Do you have questions about this program? We're here to help.