class: center, middle, inverse, title-slide # Welcome to STA 199! ### Prof. Maria Tackett --- class: center, middle # Welcome! --- ## What is data science? *"Data science is a concept to unify statistics, data analysis, machine learning and their related methods in order to understand and analyze actual phenomena with data. It employs techniques and theories drawn from many fields within the context of <font class="vocab">mathematics, statistics, information science, and computer science</font>."* .pull-right[ [-Wikipedia](https://en.wikipedia.org/wiki/Data_science) ] --- ## Course objectives -- - Learn to explore, visualize, and analyze data in a *reproducible* and *shareable* manner -- - Gain experience in data wrangling, exploratory data analysis, predictive modeling, and data visualization -- - Work on problems and case studies inspired by and based on *real-world questions* and data -- - Learn to effectively communicate results through written assignments and final project presentation --- ## Some of what you will learn .pull-left[ - Fundamentals of `R` - Data visualization and wrangling with `ggplot2` and `dplyr` from the `tidyverse` - Web scraping - Web based applications with `RShiny` - Spatial data visualization ] .pull-right[ - Data types and functions - Version control with `GitHub` - Reproducible reports with `R Markdown` - Regression and classification - Statistical inference ] --- ## Where to find information .vocab[Course Website]: [sta199-fa20-002.netlify.app/](https://sta199-fa20-002.netlify.app/) - Central hub for the course! .vocab[Sakai] - Gradebook - Class videos - Link to class meetings on Zoom .vocab[GitHub]: [https://github.com/sta199-fa20-002](https://github.com/sta199-fa20-002) - Assignment repos (we'll talk more about that later) --- class: middle, center ## Course strcture --- ## Class meetings -- .vocab[Lecture] - Focus on concepts behind data analysis - Has two components: - **Lecture content videos** to watch before we meet - **Live lecture session** to ask questions and apply concepts from videos -- .vocab[Lab] - Focus on computing using R `tidyverse` syntax - Apply concepts from lecture to case study scenarios - Work on labs individually or in teams of 3 - 4 --- ## Textbooks and readings - [OpenIntro Statistics](https://www.openintro.org/stat/textbook.php?stat_book=os) - [R for Data Science](http://r4ds.had.co.nz/) - Occasional articles and other readings posted on the course website --- ## Activities and assessments -- - .vocab[Homework]: Individual assignments combining conceptual and computational skills. -- - .vocab[Labs]: Individual or team assignments focusing on computational skills. -- - .vocab[Exams]: Two take-home exams. -- - .vocab[Final Project]: Team project presented during the final exam period. -- - .vocab[Application Exercises]: Exercises worked on during the live lecture session. -- - .vocab[Statistics Experiences]: Engage with statistics outside of the classroom and reflect on your experience. --- ## Where to find help in the course -- - Attend .vocab[Office hours] to meet with a member of the teaching team. -- - Use .vocab[Piazza] for general questions about course content and/or assignments, since other students may benefit from the response. -- - Use email for questions regarding personal matters and/or grades. --- ## Academic Resource Center The [Academic Resource Center (ARC)](https://arc.duke.edu/) offers free services to all students during their undergraduate careers at Duke. Services include - Learning Consultations - Peer Tutoring and Study Groups - ADHD/LD Coaching, Outreach Workshops - and more. Contact the ARC at [ARC@duke.edu](mailto:arc@duke.edu) or call 919-684-5917 to schedule an appointment. --- ## CAPS [Duke Counseling & Psychological Services (CAPS)](https://studentaffairs.duke.edu/caps) helps Duke Students enhance strengths and develop abilities to successfully live, grow and learn in their personal and academic lives. Services include - brief individual and group counseling - couples counseling - outreach to student groups - and more... Services provided via Telehealth. To initiate services, you can contact their front desk at 919-660-1000. --- class: middle, center [sta199-fa20-002.netlify.app](https://sta199-fa20-002.netlify.app)