# Welcome to STA 210!

STA 210 - Summer 2022

# Welcome

## Meet the instructor

• Third-year Ph.D. student, Department of Statistical Science
• Find out more at my personal website
• Pecan, One-year-old male Bernese Mountain Dog

## Meet the TA

• Joseph Ekpenyong
• Hold office hour and grade labs + HWs

## Meet each other!

• Name, year, major, hometown
• Any pets or favorite movie star?
• What do you hope to get out of this course?
• Anything else you want to share/ask?
05:00

## Check out Conversations

• Go to Conversations💬
• Answer the discussion question: What do you love most for summer (in one word) ?

# Regression analysis

## What is regression analysis

“In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between a dependent variable and one or more independent variables (or predictors). More specifically, regression analysis helps one understand how the typical value of the dependent variable (or ‘criterion variable’) changes when any one of the independent variables is varied, while the other independent variables are held fixed.”

Source: Wikipedia

## Course FAQ

• What background is assumed for the course? Introductory statistics or probability course.
• Will we be doing computing? Yes. We will use R.
• Will we learn the mathematical theory of regression? Yes and No. The course is primarily focused on application; however, we will discuss some of the mathematics of simple linear regression. If you want to dive into more of the mathematics, I can introduce some mathematics during labs.
• What is expected course load? Super intense! 25 hr per week. Deadlines on Mon/Wed/Fri/Sun.

## Course learning objectives

• Fit and evaluate linear and logistic regression models.
• Assess whether a proposed model is appropriate and describe its limitations.
• Use Quarto to write reproducible reports and GitHub for version control and collaboration.
• Communicate results from statistical analyses to a general audience.

# Course overview

## Homepage

yunranchen.github.io/STA210Summer/

• All course materials
• Links to Sakai, GitHub, RStudio containers, etc.
• Let’s take a tour!

## Course toolkit

All linked from the course website:

Important

Reserve an RStudio Container (titled STA 210) before lab !

## Activities: Prepare, Participate, Practice, Perform

• Prepare: Introduce new content and prepare for lectures by completing the readings (and sometimes watching the videos)
• Participate: Attend and actively participate in lectures and labs, office hours, team meetings
• Practice: Practice applying statistical concepts and computing with application exercises during lecture, graded for completion
• Perform: Put together what you’ve learned to analyze real-world data
• Lab assignments x 7 (team-based)
• Homework assignments x 5 (individual)
• Three take-home exams (individual)
• Term project presented during the final exam period (team-based)

• Labs, HWs, and AEs: Due on Mon/Wed/Fri/Sun 11:59pm.
• Exams: Exam review Friday in class, exam posted Friday morning 9:00 am, due Monday 11:59pm.
• Project: Deadlines throughout the semester, with some lab and lecture time dedicated to working on them, and most work done in teams outside of class

## Teams

• Team assignments
• Assigned by me (Weekly vs Whole semester ? )
• Application exercises, labs, and project
• Peer evaluation during teamwork and after completion
• Expectations and roles
• Everyone is expected to contribute equal effort
• Everyone is expected to understand all code turned in
• Individual contribution evaluated by peer evaluation, commits, etc.

Category Percentage
Application exercises 3%
Homework 35% (7% x 5)
Project 15%
Lab 14% (2.5% x 6)
Exam 01 10%
Exam 02 10%
Exam 03 10%
Teamwork 2%

See course syllabus for how the final letter grade will be determined.

## Support

• Attend office hours
• Reserve email for questions on personal matters and/or grades
• Read the course support page

## Announcements

• Posted on Sakai (Announcements tool) and sent via email, be sure to check both regularly
• I’ll assume that you’ve read an announcement by the next “business” day
• Go to website to check what you need to do to prepare, practice, and perform

## Diversity + inclusion

It is my intent that students from all diverse backgrounds and perspectives be well-served by this course, that students’ learning needs be addressed both in and out of class, and that the diversity that the students bring to this class be viewed as a resource, strength and benefit.

• If you feel like your performance in the class is being impacted by your experiences outside of class, please don’t hesitate to come and talk with me.
• I come from a different cultural background, and am still in the process of learning about diverse perspectives and identities. If something was said in class (by anyone) that made you feel uncomfortable, please talk to me about it.

## Accessibility

• The Student Disability Access Office (SDAO) is available to ensure that students are able to engage with their courses and related assignments.

• I am committed to making all course materials accessible and I’m always learning how to do this better. If any course component is not accessible to you in any way, please don’t hesitate to let me know.

# Course policies

## COVID policies

• Wear a mask at all times!

## Late work, waivers, regrades policy

• We have policies!
• Read about them on the course syllabus and refer back to them when you need it

## Collaboration policy

• Only work that is clearly assigned as team work should be completed collaboratively.

• Homeworks must be completed individually. You may not directly share answers / code with others, however you are welcome to discuss the problems in general and ask for advice.

• Exams must be completed individually. You may not discuss any aspect of the exam with peers. If you have questions, post as private questions on the course forum, only the teaching team will see and answer.

## Sharing / reusing code policy

• We are aware that a huge volume of code is available on the web, and many tasks may have solutions posted

• Unless explicitly stated otherwise, this course’s policy is that you may make use of any online resources (e.g. RStudio Community, StackOverflow, etc.) but you must explicitly cite where you obtained any code you directly use or use as inspiration in your solution(s).

• Any recycled code that is discovered and is not explicitly cited will be treated as plagiarism, regardless of source

To uphold the Duke Community Standard:

• I will not lie, cheat, or steal in my academic endeavors;

• I will conduct myself honorably in all my endeavors; and

• I will act if the Standard is compromised.

## Most importantly!

Ask if you’re not sure if something violates a policy!

# Making STA 210 a success

## Five tips for success

1. Complete all the preparation work before class.
4. Do the homework and lab.
5. Don’t procrastinate and don’t let a week pass by with lingering questions.

## Learning during a pandemic

I want to make sure that you learn everything you were hoping to learn from this class. If this requires flexibility, please don’t hesitate to ask.

• You never owe me personal information about your health (mental or physical) but you’re always welcome to talk to me. If I can’t help, I likely know someone who can.
• I want you to learn lots of things from this class, but I primarily want you to stay healthy, balanced, and grounded during this crisis.