Syllabus
Click here to download a PDF copy of the syllabus.
Course info
Day | Time | Location | |
---|---|---|---|
Lectures | Weekdays | 11:00 am - 12:15 pm | Old Chemistry 101 |
Labs | Mon and Wed | 2:00 pm - 3:15 pm | Old Chemistry 101 |
Yunran’s OH | Thursdays | 2:00 pm - 3:00 pm | Old Chemistry 025 |
Mondays | 7:00 pm - 8:00 pm | Zoom | |
Joseph’s OH | Tue and Thur | 8:00 pm - 9:00 pm | Zoom |
Learning objectives
By the end of the semester, you will be able to…
- analyze real-world data to answer questions about multivariable relationships.
- fit and evaluate linear and logistic regression models.
- assess whether a proposed model is appropriate and describe its limitations.
- use Quarto to write reproducible reports and GitHub for version control and collaboration.
- communicate results from statistical analyses to a general audience.
Community
Duke Community Standard
As a student in this course, you have agreed to uphold the Duke Community Standard as well as the practices specific to this course.
Inclusive community
It is my intent that students from all diverse backgrounds and perspectives be well-served by this course, that students’ learning needs be addressed both in and out of class, and that the diversity that the students bring to this class be viewed as a resource, strength, and benefit. It is my intent to present materials and activities that are respectful of diversity and in alignment with Duke’s Commitment to Diversity and Inclusion. Your suggestions are encouraged and appreciated. Please let me know ways to improve the effectiveness of the course for you personally, or for other students or student groups.
Furthermore, I would like to create a learning environment for my students that supports a diversity of thoughts, perspectives and experiences, and honors your identities. To help accomplish this:
- If you feel like your performance in the class is being impacted by your experiences outside of class, please don’t hesitate to come and talk with me. If you prefer to speak with someone outside of the course, your academic dean is an excellent resource.
- I (like many people) am still in the process of learning about diverse perspectives and identities. If something was said in class (by anyone) that made you feel uncomfortable, please let me or a member of the teaching team know.
Accessibility
If there is any portion of the course that is not accessible to you due to challenges with technology or the course format, please let me know so we can make appropriate accommodations.
The Student Disability Access Office (SDAO) is available to ensure that students are able to engage with their courses and related assignments. Students should be in touch with the Student Disability Access Office to request or update accommodations under these circumstances.
Where to get help
- If you have a question during lecture or lab, feel free to ask it! There are likely other students with the same question, so by asking you will create a learning opportunity for everyone.
- The teaching team is here to help you be successful in the course. You are encouraged to attend office hours to ask questions about the course content and assignments. Many questions are most effectively answered as you discuss them with others, so office hours are a valuable resource. Please use them!
- Outside of class and office hours, any general questions about course content or assignments should be posted on the course forum Conversations. There is a chance another student has already asked a similar question, so please check the other posts in Conversations before adding a new question. If you know the answer to a question posted in the discussion forum, I encourage you to respond!
- Emails should be reserved for questions not appropriate for the public forum. If you email me, please include “STA 210” in the subject line. Barring extenuating circumstances, I will respond to STA 210 emails within 48 hours Monday - Friday. Emails sent Friday evening - Sunday may be answered on Monday.
Check out the Support page for more resources.
Textbooks
While there is no official textbook for the course, we will be assigning readings from the following textbooks.
R for Data Science by Garret Grolemund and Hadley Wickham
Introduction to Modern Statistics by Mine Çetinkaya-Rundel and Johanna Hardin
Tidy modeling with R by Max Kuhn and Julia Silge
Beyond Multiple Linear Regression by Paul Roback and Julie Legler
Lectures and labs
The goal of both the lectures and the labs is for them to be as interactive as possible. My role as instructor is to introduce you new tools and techniques, but it is up to you to take them and make use of them. A lot of what you do in this course will involve writing code, and coding is a skill that is best learned by doing. Therefore, as much as possible, you will be working on a variety of tasks and activities throughout each lecture and lab. You are expected to attend all lecture and lab sessions and meaningfully contribute to in-class exercises and discussion. Additionally, some lectures will feature application exercises that will be graded. In addition to application exercises will be periodic activities help build a learning community. These will be short, fun activities that will help everyone in the class connect throughout the semester.
You are expected to bring a laptop to each class so that you can take part in the in-class exercises. Please make sure your laptop is fully charged before you come to class as the number of outlets in the classroom will not be sufficient to accommodate everyone. More information on loaner laptops can be found here.
Teams
You will be assigned to a team at the beginning of each week. You are encouraged to sit with your teammates in lecture and you will also work with them in the lab sessions. All team members are expected to contribute equally to the completion of the labs and project and you will be asked to evaluate your team members throughout the semester. Failure to adequately contribute to an assignment will result in a penalty to your mark relative to the team’s overall mark.
You are expected to make use of the provided GitHub repository as their central collaborative platform. Commits to this repository will be used as a metric (one of several) of each team member’s relative contribution for each project.
Assessment
Assessment for the course is comprised of six components: application exercises, homework assignments, labs, exams, projects, and teamwork.
Application exercises
Parts of some lectures will be dedicated to working on Application Exercises (AEs). These exercises which give you an opportunity to practice apply the statistical concepts and code introduced in the readings and lectures. These AEs are released on Monday, Wednesday and Friday. They are due within two days of the corresponding lecture period. Specifically, AEs from Monday and Wednesday lectures are due Wednesday and Friday by 11:59 pm ET respectively. AEs from Friday lectures are due Sunday by 11:59 pm ET.
Because these AEs are for practice, they will be graded based on completion, i.e., a good-faith effort has been made in attempting all parts. Successful on-time completion of at least 80% of AEs will result in full credit for AEs in the final course grade.
Labs
In labs, you will apply the concepts discussed in lecture to various data analysis scenarios, with a focus on the computation. Most lab assignments will be completed in teams, and all team members are expected to contribute equally to the completion of each assignment. You are expected to use the team’s GitHub repository on the course’s GitHub organization as the central platform for collaboration. Commits to this repository will be used as a metric of each team member’s relative contribution for each lab, and there will be periodic peer evaluation on the team collaboration. Lab assignments will be completed using Quarto, correspond to an appropriate GitHub repository, and submitted for grading in Gradescope.
Homework
In homework, you will apply what you’ve learned during lecture and lab to complete data analysis tasks. You may discuss homework assignments with other students; however, homework should be completed and submitted individually. Similar to lab assignments, homework must be typed up using Quarto and GitHub and submitted as a PDF in Gradescope.
One homework assignment will be dedicated to a statistics experience. The statistics experience is an opportunity to engage with statistics and data science outside of the classroom through podcasts, books, seminars, data analysis competitions, and other activities. As you complete these experiences, the goal is to consider how the material you’re learning in the course connects with society more broadly.
Exams
There will be three, take-home, open-note exams. Through these exams you have the opportunity to demonstrate what you’ve learned in the course thus far. The exams will focus on the conceptual understanding of the content, and they may also include small analysis and computational tasks. The content of the exam will be related to the content in the prepare, practice, and perform assignments. More details about the exams will be given during the semester.
Project
The purpose of the project is to apply what you’ve learned throughout the semester to analyze an interesting, data-driven research question. The project will be completed with your teams, and each team will present their work in class and in writing during the final exam period. More information about the project will be provided during the semester.
Grading
The final course grade will be calculated as follows:
Category | Percentage |
---|---|
Application exercises | 3% |
Homework | 35% (7% x 5) |
Project | 15% |
Lab | 14% (2.5% x 6) |
Exam 01 | 10% |
Exam 02 | 10% |
Exam 03 | 10% |
Teamwork | 2% |
The final letter grade will be determined based on the following thresholds:
Letter Grade | Final Course Grade |
---|---|
A | >= 93 |
A- | 90 - 92.99 |
B+ | 87 - 89.99 |
B | 83 - 86.99 |
B- | 80 - 82.99 |
C+ | 77 - 79.99 |
C | 73 - 76.99 |
C- | 70 - 72.99 |
D+ | 67 - 69.99 |
D | 63 - 66.99 |
D- | 60 - 62.99 |
F | < 60 |
Course policies
Academic integrity
TL;DR: Don’t cheat!
All students must adhere to the Duke Community Standard (DCS): Duke University is a community dedicated to scholarship, leadership, and service and to the principles of honesty, fairness, and accountability. Citizens of this community commit to reflect upon these principles in all academic and non-academic endeavors, and to protect and promote a culture of integrity.
To uphold the Duke Community Standard:
Students affirm their commitment to uphold the values of the Duke University community by signing a pledge that states:
- I will not lie, cheat, or steal in my academic endeavors;
- I will conduct myself honorably in all my endeavors;
- I will act if the Standard is compromised
Regardless of course delivery format, it is your responsibility to understand and follow Duke policies regarding academic integrity, including doing one’s own work, following proper citation of sources, and adhering to guidance around group work projects. Ignoring these requirements is a violation of the Duke Community Standard. If you have any questions about how to follow these requirements, please contact Jeanna McCullers (jeanna.mccullers@duke.edu), Director of the Office of Student Conduct.
Collaboration policy
Only work that is clearly assigned as team work should be completed collaboratively.
- The homework assignments must be completed individually and you are welcomed to discuss the assignment with classmates at a high level (e.g., discuss what’s the best way for approaching a problem, what functions are useful for accomplishing a particular task, etc.). However you may not directly share answers to homework questions (including any code) with anyone other than myself and the teaching assistants.
- For the projects, collaboration within teams is not only allowed, but expected. Communication between teams at a high level is also allowed however you may not share code or components of the project across teams.
Policy on sharing and reusing code
I am well aware that a huge volume of code is available on the web to solve any number of problems. Unless I explicitly tell you not to use something, the course’s policy is that you may make use of any online resources (e.g.RStudio Community, StackOverflow) but you must explicitly cite where you obtained any code you directly use (or use as inspiration). Any recycled code that is discovered and is not explicitly cited will be treated as plagiarism. On individual assignments you may not directly share code with another student in this class, and on team assignments you may not directly share code with another team in this class.
Late work policy
The due dates for assignments are there to help you keep up with the course material and to ensure the teaching team can provide feedback within a timely manner. We understand that things come up periodically that could make it difficult to submit an assignment by the deadline. Note that the lowest homework and lab assignment will be dropped to accommodate such circumstances.
Homework and labs may be submitted up to 1 day late. There will be a 5% deduction for each 6-hour period the assignment is late.
There is no late work accepted for application exercises, since these are designed to help you prepare for labs and homework.
The late work policy for exams will be provided with the exam instructions.
The late work policy for the project will be provided with the project instructions.
Regrade request policy
Regrade requests must be submitted on Gradescope within 24 hours of when an assignment is returned. Regrade requests will be considered if there was an error in the grade calculation or if you feel a correct answer was mistakenly marked as incorrect. Requests to dispute the number of points deducted for an incorrect response will not be considered. Note that by submitting a regrade request, the entire question will be graded which could potentially result in losing points.
No grades will be changed after the final project presentations.
Attendance policy
Responsibility for class attendance rests with individual students. Since regular and punctual class attendance is expected, students must accept the consequences of failure to attend. More details on Trinity attendance policies are available here.
However, there may be many reasons why you cannot be in class on a given day, particularly with possible extra personal and academic stress and health concerns. If you miss a lecture, make sure to review the material before the next class session. Lab time is dedicated to working on your lab assignments and collaborating with your teammates on your project. If you miss a lab session, make sure to communicate with your team about how you can make up your contribution. Given the technologies we use in the course, this is straightforward to do asynchronously. If you know you’re going to miss a lab session and you’re feeling well enough to do so, notify your teammates ahead of time. Overall these policies are put in place to ensure communication between team members, respect for each others’ time, and also to give you a safety net in the case of illness or other reasons that keep you away from attending class.
Inclement weather policy
In the event of inclement weather or other connectivity-related events that prohibit class attendance, I will notify you how we will make up missed course content and work. This might entail holding the class on Zoom synchronously or watching a recording of the class.
Important dates
- May 11: Classes begin (Monday meeting schedule).
- May 12: Regular class meeting schedule begins.
- May 13: Drop/add for term 1 ends.
- May 30: Memorial Day holiday. No classes are held.
- June 8: Last day to withdraw with W.
- June 15: Last lab.
- June 17: Classes end.
- June 20: Juneteenth holiday. No classes/office hour are held.
- June 21: Reading period.
- June 22: Final project presentation. Due date for final project report.
Click here for the full Duke academic calendar.