Computation and Visualization for Analytics
IE6600 • Spring 2023 • Northeastern University • Seattle
IE6600 covers basic of the R, and R Shiny for data preprocessing, and visualization. It introduces students to static and interactive visualization, dashboard, and platform that reveal information, patterns, interactions, and comparisons by paying attention to details such as color encoding, a shape selection, spatial layout, and annotation. Based on these fundamentals of analytical and creative thinking, the course then focuses on data visualization techniques and the use of the most current popular software tools that support data exploration, analytics-based storytelling and knowledge discovery, and decision-making in engineering, healthcare operations, manufacturing, and related applications.
- Class: Tuesday, Friday 09:50 - 11:30 AM (PT)
- Office hour: Wed 9:00 AM - 10:00 AM (PT) on Zoom
- Location: Room 305, 225 Terry Ave
- Dates: 01/09/2023 – 04/14/2023
- Administration platform: All questions, discussion or notes will be only posted via Campuswire. See sign up link on Canvas.
- HW submission: Canvas
- Course notes: Notes
- Teaching style: There’s no speed limit.
-
Guidelines:
- Please see the post How to ask a good question before posting any questions or discussions.
- How to properly cite references in you HWs Purdue OWL Research and Citation Resources.
- Instructor Zhenyuan Lu
- Email:
- Office hours: Wed 9:00 AM - 10:00 AM (PT) on Zoom
- TA Sharayu Thosar
- Email:
- Office hours: Thur 1:00 PM - 2:00 PM (PT) on Zoom
- IA Wenwei Li
- Email:
- Office hours: During the class meeting
Table of contents
- Course goals
- Textbooks
- R-related Materials
- Schedule
- Administration
- Homework
- Final Projects
- Honor Code
- Title IX
- Accommodations for Students with Disabilities
- Take care of yourself
- Course Evaluation
- Hall of Fame
Course goals
This section of IE6600 follows the flipped classroom model, and delivers all course materials online. The schedule below shows the due dates for all modules.
Textbooks
The required textbook:
- R For Data Science (R4DS), Wickham, Hadley, and Garrett Grolemund
The required tutorials:
- Shiny tutorial, R Shiny
Additional textbooks:
- R For Everyone (R4E), Lander, Jared P.
- R Markdown (RMD), Xie, Yihui, et al.
R-related Materials
- R Graphics Cookbook (RGC), Chang, Winston.
- Advanced R (ADR), Wickham, Hadley.
- R Packages (RPK), Wickham, Hadley.
- Text Mining with R (TM), Silge, Julia, and David Robinson.
Schedule
(subject to change)
Date | Lecture | Content | Logistics | |
---|---|---|---|---|
Module 1: Introduction | ||||
1/10 |
Course introduction and expectations
|
|
||
1/13 |
Basic of R I
[slides] [slides2] |
|
||
1/17 |
Basic of R II
[slides] [slides2] |
|
||
1/20 |
R functions and the grammar of visualization I
[slides] |
|
HW0 out |
|
1/24 |
R functions and the grammar of visualization II
[slides] |
|
||
Module 2: Basic Visualization and Data Engineering | ||||
1/27 |
Data Visualization Concepts I
[slides] [github] |
HW0 due 1/27 @ 11:59pm PT |
||
1/30, Last day to drop a full-semester summer class without a W grade | ||||
1/31 |
Data Visualization Concepts II
[slides] [github] |
Group arrangement due 1/30 @ 11:59pm PT HW1 out |
||
2/3 |
Basic data visualization in R I
[slides] |
|
||
2/7 |
Basic data visualization in R II
[slides] |
|
HW1 due 2/6 @ 11:59pm PT |
|
2/10 |
Data transformation with dplyr I
[slides] |
|||
2/14 |
Data transformation with dplyr II
[slides] |
HW2 due 2/13 @ 11:59pm PT |
||
2/17 |
Data wrangling with tibbles, readr and tidyr I Optional: Data wrangling with stringr, forcats [slides] [slides2] |
|
||
2/21 |
Data wrangling with tibbles, readr and tidyr II Optional: Data wrangling with stringr, forcats [slides] [slides2] |
|
HW3 due 2/20 @ 11:59pm PT |
|
2/24 |
Visualizing relational data
[slides] |
|||
Module 3: Advanced Visualization | ||||
2/28 |
Introduction to Shiny interactive visualization web app I
[slides] |
|
HW4 due 2/27 @ 11:59pm PT HW5 out |
|
3/3 |
Introduction to Shiny interactive visualization web app II
[slides] |
|
Project proposal due 3/2 @ 11:59pm PT |
|
3/7 | No classes - Spring Break | |||
3/10 | No classes - Spring Break | |||
3/14 |
Introduction to Shiny interactive visualization web app II
[slides] |
|
||
3/17 |
Introduction to Shiny interactive visualization web app III
[slides] |
|
||
3/21 |
Exploratory data analysis and more data visualization
[slides] |
|||
3/24 |
Visualization by PCA I
[slides] |
|
||
3/28 |
Visualization by PCA I
[slides] |
|
||
3/31 |
Visualization by PCA II
[slides] |
|
HW5 due 3/30 @ 11:59pm PT |
|
4/4 |
Additional Workshop
[github] |
|
||
4/7 |
Additional Workshop
[github] |
|
||
Project and Slides due 4/10 @ 11:59pm PT | ||||
4/11 |
Project Presentation
|
|||
Peer Review and Intra-Group Evaluation due 4/15 @ 11:59pm PT |
Administration
Please post questions, or discussion only via Canvas. The visibility of questions and discussion are expected to set for public view (to the Entire class on Canvas). Please feel free to send instructor/TAs emails regarding any personal or other private issues/concerns.
Homework
There are 6 individual homework assignments. Due day will be posted with the homework. Late submission would not be accepted. Each student has one time 3-day extension per semester only applied on homework. This extension will be applied automatically. Please let me know 24 hours in advance before the due day if any emergencies or difficulties occur.
Requests for regrades in writing will only be accepted no less than 10 days after receiving grade. Please send the instructor your NUID, and name with title “Request for regrade: HW+number” via email. The new grade may be lower than the original one.
Please properly cite any references used in your homework. An overview and summary of how to reference sources can be found at Purdue OWL Research and Citation Resources.You may discuss homework with your peers, but you must complete all assignments on your own. You can also utilize chatGPT or some other AI systems as a partial reference in your assignment, but you should optimize and enhance your answers rather than simply copying and pasting.
Final Projects
Project Instructions and Github: github
Project Proposal Examples
Project Demos
- Video Game Sales Visualization by Qiu Yucheng, Yu Wei
- COVID-19 Analysis by I-Hsuan Huang, Yiming Wang, Yen-Fong Li, Wenzheng Liao
Honor Code
Plagiarism, cheating, and any form of unauthorized collaboration will not be tolerated, certain work will be marked as zero and will be handled in accordance with university policies described in the Student Handbook. Sharing of completed solutions will not be tolerated. Plagiarism will be considered, if solutions and project documentations with a very high degree of similarity with other student’s or materials online. Such academic dishonesty will be handled in accordance with university policies. We will report violations to the Office of Student Conduct and Conflict Resolution (OSCCR). For additional information on Northeastern University’s Academic Integrity Policy.
Title IX
Northeastern University strictly prohibits discrimination or harassment on the basis of race, color, religion, religious creed, genetic information, sex, gender identity, sexual orientation, age, national origin, ancestry, veteran, or disability status. Please review Northeastern’s Title IX policy, which protects individuals from sex or gender-based discrimination, including discrimination based on gender-identity. Faculty members are required to report all allegations of sex/gender-based discrimination to the Title IX coordinator.
Accommodations for Students with Disabilities
If you have a disability, I encourage you to contact Disability Resource Center to register and request the accommodations. Also please discuss your needs with me as early in the semester as possible.
Take care of yourself
Eating healthy food, having regular exercises, avoiding alcohol and drugs, getting adequate sleep and taking time to relax. This will help you achieve your goals and tame stress.
If you have difficulty to keep up with any materials or homework for personal reasons please let me know early. If you or your friends/classmates who appears to be struggling, or having trouble coping with stress. We strongly encourage you to seek support at the We Care program at NEU. At Northeastern, a student is never alone when struggling with a demanding situation.
Course Evaluation
- Homework 42%
- Each HW 7%
- Final Project 50%
- Proposal 10%
- Presentation, Project and Documentation 38%
- Peer Review 2%
- Class Participation 8%
- Intra-Group Evaluation 1%
- Q&A during the class and on Campuswire 7%
This course does not have any quizzes or exams.
Hall of Fame
Top Projects
: Best Project : Honorable Mention (awarded equally)
Spring2023
: (Co-Winner) General Wealth Analysis by Country Across the World
Seattle OG: Rishabh Chawla, Shucheng Zhang, Weihua Pan, Yufei Sun
: (Co-Winner) Drinking Water, Sanitation, and Hygiene (WASH) Analysis
Shabu Shabu: Jessica Tanumihardja, Jiayue Han, Tomoki Kyotani, Xinan Wang
: Excessive Alcohol Use
Team 2: Chenchen Jiang, Chih-Hsuan Su, Cristina Stone Pedraza, Yiwei Li
: Global Mortality
Marsala: Hui Du, Jessica Anna James, Jianjian Liu
Fall2022
: COVID-19 Analysis
Bananas: I-Hsuan Huang, Yiming Wang, Yen-Fong Li, Wenzheng Liao
: Academic success visualization and analysis
TaylorExpansion: Farnaz Mohseni, Maryam Kian, David Suero Cobos, Sashankh Addanki, Yu Swe Zin Aung
: Washington Electric Vehicle Data Visualization and Analysis
KohenKappa: Gen Li, Zhichun Li, Fante Meng, Lingyun Ding
Top Contributors
Spring2023
: Tomoki Kyotani
: Jessica Tanumihardja
: Weimeng Duan; Xin shen; Jessica James
Fall2022
: Yiming Wang
: Farnaz Mohseni
: Gen Li; Zhichun Li; Fante Meng