社会数据分析,Python应用入门
课程教师
Blake Miller
教师简介
Blake Miller is an Assistant Professor of Computational Social Science in the Methodology Department at the London School of Economics and Political Science. He received his PhD in Political Science and Scientific Computing from the University of Michigan in 2018 where he was also a graduate research affiliate in the Lieberthal-Rogel Center for Chinese Studies. Before coming to LSE, he was a Post-Doctoral Fellow at the Dartmouth College Program in Quantitative Social Science. Blake has also spent several years in Silicon Valley as an executive for tech start-up companies. For more information, please visit www.blakeapm.com.
课程内容
The massive amount of data available online contin- ues to increase the bounds of social scientific inquiry. Researchers in both academia and the private sector can gain a greater understanding of human behavior by analyzing the abundant social data stored online. To make use of these data, one must first master technical skills necessary to gather and process these data, which can be quite challenging to do properly.
The main goal of this course is to provide students with the necessary tools for the construction, processing, and cleaning of data found online. After taking this course, students will have mastered the requisite tools needed to construct datasets out of unstructured, semi-structured, and structured online data.
预期目标
to introduce students to important concepts and methodologies related to the management, collection, processing, and cleaning of data for social science and public policy research
to teach students practical concerns and best practices for data man- agement and data collection
to build foundational skills necessary to construct useful datasets for their research from unstructured, semi-structured, and secondary data
to build a roadmap for continued learning through promoting awareness of more advanced and specialized tools and where to look for problem- solving/reference.
课程安排
|
|
|
|
|
|
|
|
|
|
Textbooks and Course Materials
Readings for each session are detailed below.
Textbooks: Think Python: How to Think Like a Computer Scientist, 2nd Edition (English, Chinese)
Other resources:
Required Software
This course is taught in Python, using Python 3. You will need to have Python 3 installed on your computer and bring it to class each session. If you have not yet installed Python 3, you will need to do so. Please use the following resources for installing Python 3 on your machine:
For Windows users:
Install Sublime Text from the official Sublime Text website.
Install Python 3 (click “Latest Python3 Release”)
Open Git Bash and run python3 --version to verify Python 3 is accessible. If that does not work, try python --version.
For Mac users:
Open Terminal on Mac (Applications → Utilities → Terminal).
Install Xcode Command Line Tools by running xcode-select --install in Terminal.
Install homebrew by pasting the installation command from the website into Terminal. Alternatively download the installer here.
Install Sublime Text from the official Sublime Text website and follow the installation instructions.
Use homebrew to install Python 3 by running brew install python in Terminal.
Run python3 --version in Terminal to verify Python 3 is accessible.
Grading
Quizzes (60%, in class): There will four in-class quizzes. The quizzes will test knowledge of the material covered in the previous classes and readings.
Final Problem Set (40%, due July 19): You will have one final problem set to apply all of the things we learned. For the problem set, you will analyze a dataset we will discuss in class in the final lectures.