DATA 100: Wicked Problems
Course ID: DATA 100
Course Attribute: COLL 100
Title: Wicked Problems: The New Data Paradigm and Emergent Human Development Processes
Credit Hours: 4
Meeting Times: 10:00 to 10:50 MWF
Location: ISC3 1111
Date Range: Fall Semester 2021
Course Description
Global, near present-time, high-resolution data is openly accessible to construct scientific descriptions of human development. In this course you will use data describing populations, governments, and both the natural and built environments in order to construct a close-to-reality description of your selected region or country. You will then intersect a selected dimension of human development as an initial boundary for investigating a contextually relevant research question. While development is often thought of as a “missing ingredient” that is needed to improve the well-being of people or the output of firms, in this course human development is defined as the ability to enlarge people’s choices, capabilities and freedoms and is understood as an emergent property from our complex and adapting economic and social system. Data science methods are used to construct close-to-reality descriptions of human development processes in order to identify interactions, describe co-evolving agents, recognize the conditions prevalent for emergence, understand the significance of scale and ultimately begin to establish a framework for analyzing human development’s multitude of seemingly intractable, wicked problems. During this course you will also learn to use the flexible data science, programming language R, but no prior experience is needed. Pre-requisite(s): None
Courses Objectives
- To introduce students to human development as the ability to enlarge people’s choices, capabilities and freedoms and as an emergent property from society functioning as a complex adaptive social and economic system.
- To introduce students to data science through understanding previously computationally intractable, wicked problems.
- To demonstrate competent information literacy and data science skills by producing knowledge from data through the creation of plots, graphs, charts and maps.
- To demonstrate competent application of fundamental computer science and statistics methods within the context of geospatial human development processes.
- To challenge students to become apprentice scholars by supporting their answer to a formulated central research question with close-to-reality geospatial descriptions of human development processes.
Honor Code
Among our most significant traditions is the student-administered honor system. The Honor Code is an enduring tradition with a documented history that originates as far back as 1736. The essence of our honor system is individual responsibility. Today, students, such as yourself, administer the Honor pledge to each incoming student while also serving to educate faculty and administration on the relevance of the Code and its application to students’ lives.
The Pledge
“As a member of the William and Mary community, I pledge on my honor not to lie, cheat, or steal, either in my academic or personal life. I understand that such acts violate the Honor Code and undermine the community of trust, of which we are all stewards.”
Accessibility, Attendance & Universal Learning
William & Mary accommodates students with disabilities in accordance with federal laws and university policy. Any student who feels s/he may need accommodation based on the impact of a learning, psychiatric, physical, or chronic health diagnosis should contact Student Accessibility Services staff at 757-221-2509 or at sas@wm.edu to determine if accommodations are warranted and to obtain an official letter of accommodation. For more information, please see www.wm.edu/sas.
I am committed to the principle of universal learning. This means that our classroom, our virtual spaces, our practices, and our interactions be as inclusive as possible. Mutual respect, civility, and the ability to listen and observe others carefully are crucial to universal learning. Active, thoughtful, and respectful participation in all aspects of the course will make our time together as productive and engaging as possible.
Grade Categories
exceptional A = 100 ≥ 97.0 | excellent A = 96.9 ≥ 93.0 | superior A- = 92.9 ≥ 90.0 |
very good B+ = 89.9 ≥ 87.0 | good B = 86.9 ≥ 83.0 | above average B- = 82.9 ≥ 80.0 |
normal C+ = 79.9 ≥ 77.0 | average C = 76.9 ≥ 73.0 | sub par C- = 72.9 ≥ 70.0 |
below average D+ = 69.9 ≥ 67.0 | poor D = 66.9 ≥ 63.0 | very poor D- = 62.9 ≥ 60.0 |
failing F < 60.0 |
note .9 = .9 with bar notation
Grading Opportunities
Five Data Science reflections | 25% | Due every other Friday |
Five Individual Projects | 50% | Due every other Saturday |
Final Project | 25% | due by 5PM on the last day of finals |
Semester Schedule
Week 1 (...)
- Wednesday:
- Introductions
- Zoom, Slack, Blackboard & the Syllabus
- For next time:
- Introductions
- Friday:
- Using GitHub & Markdown
- For next time:
Week 2 (...)
- Monday:
- Using RStudio
- For next time:
- Wednesday:
- Using RStudio
- Friday: add/drop period ends
- Using RStudio
Week 3 (...)
- Monday:
- Wednesday:
- Friday: Open
Week 4 (...)
- Monday:
- Wednesday:
- Friday:
- Data Science Reflection #1
- Sunday: Project 1 due at midnight
- Project 1 deliverable:
- Project 1 extra credit:
Week 5 (...)
- Monday:
- Wednesday:
- Friday:
Week 6 (...)
- Monday:
- Wednesday:
- Friday: Open Session / Workshop (3/5)
- Sunday: Project 2 due at midnight
- Project 2 deliverable:
- Project 2 extra credit:
Week 7 (...)
- Monday:
- Wednesday:
- Friday:
Week 8 (...)
- Monday:
- Wednesday: Open
- Friday:
- Project 3 Modeling & Predicting Spatial Values
Week 9 (...)
- Monday:
- Stretch Goal
- Wednesday:
- Stretch Goal
- Friday:
- Data Science Reflection #3
- Sunday: Project 3 due at midnight
- Project 3 Deliverables:
- Project 3 Extra Credit:
Week 10 (...)
- Monday:
- Project 4 Modeling & Predicting Spatial Values
- Wednesday:
- Project 4 Modeling & Predicting Spatial Values
- Friday:
- Project 4 Modeling & Predicting Spatial Values
Week 11 (...)
- Monday:
- Project 4 Modeling & Predicting Spatial Values
- Wednesday: Open
- Friday:
- Project 4 Investigating and Comparing Results
Week 12 (...)
- Monday:
- Project 4 Investigating and Comparing Results
- Wednesday:
- Project 4 Investigating and Comparing Results
- Friday:
- Project 4 Investigating and Comparing Results
- Sunday: Project 4 due at midnight
- Project 4 Deliverables:
- Project 4 Extra Credit:
Week 13 (...)
- Monday:
- Project 5
- Wednesday:
- Project 5
- Friday:
- Data Science Reflection #4
Week 14 (...)
- Monday: Open
- Wednesday:
- Project 5
- Friday:
- Project 5
Week 15 (...)
- Monday:
- Final Project
- Wednesday:
- Final Project
- Friday: Last day of class
- Data Science Reflection #5
- Project 5 benchmark
Final
- Final Project is due on the last day of the finals period at 5PM.
- Combination of Accessibility 1, 2 & 3
Links to Responses
Resources
Peer Reviewed Publication Sources
- Worldpop - publications
- Flowminder - publications
- Facebook Data for Good
- Data-Intensive Development Lab - publications
- BayesPop Project - papers
- DHS Spatial Analysis Reports
Data
- Development Data Exchanges
- Remotely sensed environmental data
- Political boundaries and administrative subdivisions
- Population estimates
- WorldPop - Population
- Center for International Earth Science Information Network / Facebook Data for Good - High Resolution Settlement Layer
- Socioeconomic Data and Applications Center (SEDAC) - Gridded Population of the World (GPW), v4
- Oak Ridge National Laboratory (ORNL) - LandScan Global
- POPGRID Data Collaborative
- Demographic estimates
- Primary census and household survey data
- Built environment including settlements, facilities and infrastructures
- Population movement data
Topics
-
Data Science to understand Human Development as a complex system
- Don’t forget people in the use of big data for development (Joshua Blumenstock)
- The Best Stats You've Never Seen (Hans Rosling, Gapminder)
- Development as Freedom (Amartya Sen, 1999)
- Development and Complexity (Owen Barder, 2012)
- Big Data, New Epistemologies and Paradigm Shifts (Rob Kitchin)
- Scale (Geoff West, 2018)
-
Population and development data descriptions
- WorldPop, open data for spatial demography (Tatem)
- High resolution global gridded data for use in population studies (Lloyd, Sorichetta & Tatem)
- Spatially disaggregated population estimates in the absence of national population and housing census data (Wardrop et al.)
- Mapping road network communities for guiding disease surveillance and control strategies (Strano, Viana, Sorichetta & Tatem)
- A spatial database of health facilities managed by the public health sector in sub Saharan Africa (Maina et al.)
Methods: application
- Dasymmetric Allocation
- Random Forest Model
- Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data (Stevens, Gaughan, Linard & Tatem)
- Examining the correlates and drivers of human population distributions across low- and middle-income countries (Nieves et al.)
- Gridded Population Maps Informed by Different Built Settlement Products (Reed et al.)
- Random Forest Model
- Land Use Classification
- Random Forest Model
- Spatial Interpolation
- Hierarchical Bayesian Model
- Guidance for Use of The DHS Program Modeled Map Surfaces
- Creating Spatial Interpolation Surfaces with DHS Data
- Fine resolution mapping of population age-structures for health and development applications (Alegana et al.)
- High Resolution Age-Structured Mapping of Childhood Vaccination Coverage in Low and Middle Income Countries (Utazi et al.)
- Mapping vaccination coverage to explore the effects of delivery mechanisms and inform vaccination strategies (Utazi et al.)
- Spline Interpolation
- Comparison of Spatial Interpolation Methods to Create High-Resolution Poverty Maps for Low- And Middle-Income Countries (Wong, Brady, Campbell & Benova)
- Artificial Neural Networks
- Hierarchical Bayesian Model
- Spatial Interaction Processes
- Gravity Type Models
- The Use of Census Migration Data to Approximate Human Movement Patterns across Temporal Scales (Wesolowski et al.)
- Modeling internal migration flows in sub-Saharan Africa using census microdata (Garcia, Pindolia, Lopiano & Tatem)
- Mapping internal connectivity through human migration in malaria endemic countries (Sorichetta et al.)
- Exploring the use of mobile phone data for national migration statistics (Lai et al.)
- Utilizing general human movement models to predict the spread of emerging infectious diseases in resource poor settings (Kraemer et al.)
- Impedance Model
- Gravity Type Models