Fall 2023 - 61344 - PA 397C - Data Management and Research Life Cycle

Data Management and Research Life Cycle



Course Description

  • Instructor: Prof. Ji Ma
  • Time: 2023 Fall
  • Location: TBD (in-person only).
  • Office hour: TBD.

This class equips thoughtful thinkers with powerful data science skills. You will learn how to manage and work with complex and big datasets in social science research, particularly in policy and nonprofit studies. You are expected to learn the following skills and respond to “big questions” that have social importance: 1) Understand the structure of data and how to work with big and complex datasets; 2) Understand the workflows of acquiring and managing data; 3) Able to conduct data-intensive and replicable social science research.

Programming is not a prerequisite of this class, and you will have a chance to develop your own programming skill set. I primarily uses Python for data work and Stata for statistical analysis, but you are welcome to use any programming language or software as long as you can complete the assignments.

Course roadmap

Course roadmap

As a student of this course, you have free access to DataCamp.


See a list of Assignments

  • A >= 98%, A- >= 90
  • B+ >= 87%, B >= 83%, B- >= 80%
  • C+ >= 77%, C >= 73%, C- >= 70%
  • D+ >= 67%, D >= 63%, D- >= 60%


This syllabus and all course content created by the instructor, TA, and students are licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

Instruction Mode