Location | Leacock 917 |
Time | Winter 2025, Mon and Wed 8:35–9:55am |
Instructor |
Peter McMahan
(peter.mcmahan@mcgill.ca) |
TA | Chris Borst |
Office hours | Mondays 11:00–12:00 (Leacock 727 or online by appointment) |
Work sessions | Mondays 10:00–11:00am (Location TBD) |
Syllabus | https://soci620.netlify.app/ |
As the second of two courses in the quantitative methods sequence, this class will build insight into the underlying logic of standard multivariate regression models and introuce students to statistical techniques that extend and diverge from those models. To this end, the course will have two main goals. First, students will become familiar with a range of quantitative methods common in social science research. Methodological topics will include generalized linear models for predicting categorical, ordered, and count data, multilevel/hierarchical linear models, and strategies for analyzing time-series and panel data. Students will learn to critically interpret these methods as they are used in the literature, and to utilize the methods for their own research.
The second (and perhaps the more central) goal of the course will be to provide students with an overarching framework to understand not just the methods we discuss, but the models and techniques they may encounter elsewhere in the literature. Instruction will therefore focus on a probabilistic interpretation of statistical models. In addition to fostering a strong understanding of statistical dependence and parametric estimation, this approach will unify the methods we cover and enable students to build interpretable, theory-driven models of their own.
Students are expected to be familiar with the readings, engage during class (in-class and/or online), complete assignments, and prepare an independent research project. Students must have taken SOCI 514 or a similar class, as a basic understanding of multivariate regressions is assumed. While knowledge of the R statistical language is not assumed, familiarity with at least one advanced statistical or general programming language (e.g. R, Stata, Python, Matlab, …) will be very helpful.
The scheduled classes for the course will be hybrid lectures, discussions, and instructor-led lab sessions. It is vital that students attend class regularly, having completed the readings and prepared to engage with the topics covered.
Slides and code will be made available before class. Lectures will be live-streamed on Teams (when necessary), and recordings will be posted shortly after class.
In addition to the scheduled classes, we will have a weekly work session (location to be determined). These sessions will be led by the T.A. as a space to practice techniques, raise questions and concerns, and discuss course content with one another. Attendance at these sessions is optional but strongly encouraged.
We will be working with data and learning analysis and visualization in-class, so students must bring a laptop computer with them. Mobile devices such as tablets and phones, even with an external keyboard, will not be sufficient. If you do not have access to a laptop please talk to me as soon as possible so we can work out a way for you to participate.
We will be using the R statistical language and software for data processing, statistical estimation, and visualization in this class. It is recommended that students install the RStudio graphical interface, which I will be using for demonstration in class.
We will use the second edition of the textbook Statistical Rethinking by Richard McElreath for the course (McElreath 2020). The book is available as an ebook through the library website. If you are unable to access the textbook, please let me know as soon as possible.
There will be five worksheets due throughout the semester. These are intended to help you learn to use the methods we discuss in R and to give you practice in interpreting statistical models.
Each worksheet will be structured with a provided R Markdown document. R Markdown provides a way to mix code (in the R statistical language) and prose into a single document. Worksheets will be distributed ahead of Monday labs, and will be due Wednesday of the following week. Students can (and are encouraged to!) work together and consult one another on assignments, but each student will be responsible for completing their own worksheet by the due date.
Worksheets will be evaluated using peer assessment. After the deadline for a worksheet, each student will be responsible for evaluating an anonymized version of one of their classmate’s worksheets. The peer assessment is intended to expose students to different programming styles and interpretations of the data, and to encourage the production of readable R code.
Each student will finish an independent research project by the end of the semester. These projects will be empirical, scholarly analyses, including a source of data, a well formed research question, a motivated statistical analysis, and a thorough interpretation of the results. Ideally, the projects will be related to work students are doing outside of class. Projects that represent a piece of a student’s broader research agenda are encouraged. The projects will be graded on the basis of four required assessments:
Each student should arrange a brief meeting with me early in the term to discuss ideas for their research project and the appropriateness for the course.
The evaluation components and due dates for this course are strict. If outside circumstances will make it difficult to meet a requirement please raise the issue with me as soon as possible so we can find a solution. Regular absences will affect your ability to do well on assignments and the final project.
Note: In the event of extraordinary circumstances beyond the University’s control, the content and/or evaluation scheme in this course is subject to change
Worksheet 1 | Wed, Jan 22 | 6% of final grade |
WS1 peer assessment | Mon, Jan 27 | 3% of final grade |
Worksheet 2 | Wed, Feb 5 | 6% of final grade |
WS2 peer assessment | Mon, Feb 10 | 3% of final grade |
Worksheet 3 | Wed, Feb 19 | 6% of final grade |
WS3 peer assessment | Mon, Feb 24 | 3% of final grade |
Worksheet 4 | Wed, Mar 12 | 6% of final grade |
WS4 peer assessment | Wed, Mar 19 | 3% of final grade |
Worksheet 5 | Wed, Mar 26 | 6% of final grade |
WS5 peer assessment | Mon, Mar 31 | 3% of final grade |
Project précis | Thu, Mar 9 | 5% of final grade |
Project proposal | Thu, Mar 23 | 10% of final grade |
Project presentation | Thu, Apr 6 | 15% of final grade |
Project writeup | Fri, Apr 21 | 25% of final grade |
Students who need accommodation or who are having trouble accessing any aspect of the course may contact me directly. I will make every effort to accommodate individual situations, including religious, medical, or other personal circumstances.
Students with disabilities or otherwise in need of formal accommodation are encouraged to contact the Office for Student Accessibility & Achievement (formerly Office for Students with Disabilities: https://www.mcgill.ca/access-achieve/, phone 514-398-6009).
Les étudiants qui ont besoin d’un accommodation ou qui ont des difficultés à accéder à un aspect du cours peuvent me contacter directement. Je ferai tout mon possible pour tenir compte des circonstances individuelles, y compris des circonstances religieuses, médicales ou autres.
Les étudiants handicapés ou ayant besoin d’un aménagement formel sont encouragés à contacter le Service étudiant d’accessibilité et d’aide à la réussite (https://www.mcgill.ca/access-achieve/fr, téléphone 514-398-6009).
McGill University values academic integrity. Therefore, all students must understand the meaning and consequences of cheating, plagiarism and other academic offences under the Code of Student Conduct and Disciplinary Procedures (see http://www.mcgill.ca/students/srr/honest/ for more information).(approved by Senate on 29 January 2003)
L’université McGill attache une haute importance à l’honnêteté académique. Il incombe par conséquent à tous les étudiants de comprendre ce que l’on entend par tricherie, plagiat et autres infractions académiques, ainsi que les conséquences que peuvent avoir de telles actions, selon le Code de conduite de l’étudiant et des procédures disciplinaires (pour de plus amples renseignements, veuillez consulter le site http://www.mcgill.ca/students/srr/honest/).
In accord with McGill University’s Charter of Students’ Rights, students in this course have the right to submit in English or in French any written work that is to be graded. (approved by Senate on 21 January 2009)
Conformément à la Charte des droits de l’étudiant de l’Université McGill, chaque étudiant a le droit de soumettre en français ou en anglais tout travail écrit devant être noté (sauf dans le cas des cours dont l’un des objets est la maîtrise d’une langue).
The use of generative artificial intelligence tools or apps for assignments in this course, including tools like ChatGPT, Apple Intelligence, Gemini, Claude, Microsoft Copilot and other AI writing or coding assistants, is prohibited. While the use of grammar- and spell-checking software is permitted, products and services that rewrite, summarize, paraphrase, or otherwise substantially change input text, including Grammarly’s “rewrite” and “paraphrase” features and Apple’s “writing tools”, are prohibited.
Assignments that are submitted late (without prior approval for an extension) will be assessed with the following penalties: 1. 15 percentage points deducted from submissions up to 24 hours late 2. 10 percentage points for each additional 24 hours (or portion thereof) late
In addition to the above penalties, late work with a peer assessment component will be assessed solely by the instructor and teaching assistants. In these cases, students who submit late may also be unable to provide assessments to peers, which may further affect their grade.
Instructors and teaching assistants take the marking of assignments very seriously, and we work diligently to be fair, consistent, and accurate. Nonetheless, mistakes and oversights occasionally happen. If you believe that to be the case, you must adhere to the following rules:
Unless otherwise noted, this website and all co-hosted resources (e.g., linked pages, slides) are licensed under CC BY-NC-SA 4.0
All other instructor-generated course materials (e.g., handouts, notes, summaries, exam questions, and lecture recordings) are protected by law and may not be copied or distributed in any form or in any medium without explicit permission of the instructor. Note that copyright infringements can be subject to follow-up by the University under the Code of Student Conduct and Disciplinary Procedures.
Background: parametric probability models
(McElreath 2020, Ch. 1)
(McElreath 2020, Ch. 2)
(McElreath 2020, Ch. 3)
(McElreath 2020, secs. 4.1–4.3)
Linear models and model checking
(McElreath 2020, Sec 4.4-4.7)
(McElreath 2020, Ch. 5)
Creating indicators and transforming variables
(McElreath 2020, Ch. 6)
Prior and posterior predictive plots
Deviance and information criteria
(McElreath 2020, Ch. 7)
Generalized linear models
Intercept-only logistic regression
(McElreath 2020, Ch. 10 and Section 11.1)
Prior-predictive simulation
Poisson regression in R using glm
and
brm
(McElreath 2020, sec. 11.2)
Overdispersed and zero-inflated Poisson regressions in R
(McElreath 2020, secs. 12.1–12.2)
Multinomial regression in R
(McElreath 2020, secs. 11.3–11.5)
Ordered logistic regression in R
(McElreath 2020, secs. 12.2–12.5)
Ordinal regressions (Michael Betancourt 2019)
Complications in data and estimation
Common convergence issues with lme4
and
brms
(McElreath 2020, Ch. 9)
Imputing missing data
(McElreath 2020, Ch. 15)
Incorporating weights
Multilevel models
Partial pooling of averages
(McElreath 2020, sec. 13.1)
Random intercepts in R
(McElreath 2020, sec. 13.2)
Simple random slopes in R
(McElreath 2020, sec. 13.4)
Specifying LKJ priors in brms
(McElreath 2020, secs. 14.1–14.2)
Two-level model with lme4
and brms
(McElreath 2020, secs. 14.3–14.4)
GMLM in with lme4
and brms
Building more complex models
(McElreath 2020, sec. 16.4)
Presentations
Student presentations