Publications

Privacy guarantees for personal mobility data in humanitarian response

Publication date: 2024

Personal mobility data from mobile phones and other sensors are increasingly used to inform policymaking during pandemics, natural disasters, and other humanitarian crises. However, even aggregated mobility traces can reveal private information about individual movements to potentially malicious actors. This paper develops and tests an approach for releasing private mobility data, which…

Definitions of Fairness Differ Across Socioeconomic Groups & Shape Perceptions of Algorithmic Decisions

Publication date: 2024

Understanding how people perceive algorithmic decision-making is remains critical, as these systems are increasingly integrated into areas such as education, healthcare, and criminal justice. These perceptions can shape trust in, compliance with, and the perceived legitimacy of automated systems. Focusing on San Francisco’s decade-long policy of algorithmic school assignments, we draw on…

Small Worlds: Measuring the Mobility of Characters in English-Language Fiction

Publication date: 2024

The representation of mobility in literary narratives has important implications for the cultural understanding of human movement and migration. In this paper, we introduce novel methods for measuring the physical mobility of literary characters through narrative space and time. We capture mobility through geographically defined space, as well as through generic locations such as homes,…

Targeting Social Protection Programs with Machine Learning and Digital Data

Publication date: 2024

Social protection programs are essential to assisting the poor, but governments and humanitarian agencies are rarely resourced to provide aid to all those in need, so accurate targeting of benefits is critical. In developed economies, targeting decisions typically rely on administrative income data or broad survey-based social registries. In low-income countries, however, poverty…

Using AI/Machine Learning to Extract Data from Japanese American Confinement Records

Publication date: 2024

With funding from a 2019 National Park Service Japanese American Confinement Sites grant, The Bancroft Library digitized the complete set of Form WRA-26 “individual records” for more than 110,000 Japanese Americans incarcerated in War Relocation Authority camps during WWII. The library partnered with Doxie.AI to utilize AI/machine learning to automate text extraction from over 220,000 images;…

Conspiracy, misinformation, radicalisation: understanding the online pathway to indoctrination and opportunities for intervention

Publication date: 2024

In response to the rise of various fringe movements in recent years, from anti-vaxxers to QAnon, there has been increased public and scholarly attention to misinformation and conspiracy theories and the online communities that produce them. However, efforts at understanding the radicalisation process largely focus on those who go on to commit violent crimes. This article draws on three waves…

Short Essays on Entrepreneurship

Publication date: 2024

The 30-chapter book contains a compelling collection of short essays that serves as a beacon for aspiring founders and business leaders alike. In this thought-provoking anthology, the author shares insights, experiences, and valuable lessons learned first-hand in building and funding successful ventures. Each short essay offers a unique perspective on key facets of entrepreneurship, through…

Educational overview of the concept and application of computer vision in arthroplasty

Publication date: 2023

Image data has grown exponentially as systems have increased their ability to collect and store it. Unfortunately, there are limits to human resources both in time and knowledge to fully interpret and manage that data. Computer Vision (CV) has grown in popularity as a discipline for better understanding visual data. Computer Vision has become a powerful tool for imaging analytics in orthopedic…

Application of NLP in total joint arthroplasty: opportunities and challenges

Publication date: 2023

Total joint arthroplasty (TJA) is becoming one of the most common surgeries within the United States, creating an abundance of analyzable data to improve patient experience and outcomes. Unfortunately, a large majority of this data is concealed in electronic health records only accessible by manual extraction, which takes extensive time and resources. Natural language processing (NLP), a field…

An overview of machine learning in orthopedic surgery: an educational paper

Publication date: 2023

The growth of artificial intelligence (AI) combined with the collection and storage of large amounts of data in the electronic medical record collection (EMR) has created an opportunity for orthopaedic research and translation into the clinical environment. Machine learning (ML) is a type of AI tool well suited for processing the large amount of available data. Specific areas of ML frequently…

Embodying the Future: Modeling Visually Guided Planning as Prospective Mental Simulation

Publication date: 2023

What would it feel like to run outside, right now, and attempt a somersault on the first surface you find? Taking seriously an invitation like this to imagine a (perhaps unlikely) future, prompts the activation of evolutionary machinery in the mind and body that took millions of years to emerge. The ability to answer this question depends upon a surprisingly complex model of yourself, the…

Differential Privacy for Black-Box Statistical Analyses

Publication date: 2023

We formalize a notion of a privacy wrapper, defined as an algorithm that can take an arbitrary and untrusted script and produce an output with differential privacy guarantees. Our novel privacy wrapper, named TAHOE, incorporates two design ideas: a type of stability under subsetting, and randomization over subset size. We show that TAHOE imposes differential privacy for every possible script.…

Monkeypox Outbreak Analysis: An Extensive Study Using Machine Learning Models and Time Series Analysis

Publication date: 2023

The sudden unexpected rise in monkeypox cases worldwide has become an increasing concern. The zoonotic disease characterized by smallpox-like symptoms has already spread to nearly twenty countries and several continents and is labeled a potential pandemic by experts. monkeypox infections do not have specific treatments. However, since smallpox viruses are similar to monkeypox viruses…

A comparative analysis of human and AI performance in forensic estimation of physical attributes

Publication date: 2023

Human errors in criminal investigations have previously led to devastating miscarriages of justice. For example, flaws in forensic identification based on physical or photographic evidence are notoriously unreliable. The criminal justice system has, therefore, started to turn to artificial intelligence (AI) to improve the reliability and fairness of forensic identification. So as not to repeat…

Survivability of industrial internet of things using machine learning and smart contracts

Publication date: 2023

Due to data collection, there is a potential risk concerning security and privacy, so IoT reliability and survivability are of utmost concern. In this paper, we address the concern using two methods. The first method is device identification, which uses an extensive set of machine learning algorithms for identifying IoT devices. The algorithms include Logistic Regression, K- Nearest…

SDN and application layer DDoS attacks detection in IoT devices by attention‐based Bi‐LSTM‐CNN

Publication date: 2023

The Internet of Things (IoT) is connecting more devices every day. Security is critical to ensure that the devices operate in a trusted environment. The lack of proper IoT security encourages cybercriminals to target many smart devices across the network and gain sensitive information. Distributed Denial of Service (DDoS) attacks are common in the IoT infrastructure and involve hijacking IoT…

A transfer learning approach for detecting offensive and hate speech on social media platforms

Publication date: 2023

Over the last few decades, the expansion of technology and the internet has led to the number of users proliferating on social media, with a simultaneous increase in hate speech. A critical concern is, hate speech is not only responsible for igniting violence and spreading hatred, but its detection also requires a considerable amount of computing resources and content monitoring by human…

Situating Web Searching in Data Engineering: Admissions, Extensions, Repairs, and Ownership

Publication date: 2022

When does web search work? There is a significant amount of research showing where and how web search seems to fail. Researchers identify various contributing causes of web search breakdowns: the for-profit orientation of advertising driven companies, racial capitalism, the agonistic playing field with search engine optimizers and others trying to game the algorithm, or perhaps ‘user error’.…

Creating and Collecting Meaningful Musical Materials with Machine Learning

Publication date: 2022

This dissertation explores how machine learning and artificial intelligence can be applied within music composition and production. My approach in this research stems from an underlying perspective that these technologies are deeply intertwined with the people who use them or are affected by them: we can’t hope to understand one side of the picture without looking at the other. From this…

Search quality complaints and imaginary repair: Control in articulations of Google Search

Publication date: 2022

In early 2017, a journalist and search engine expert wrote about “Google’s biggest ever search quality crisis.” Months later, Google hired him as the first Google “Search Liaison” (GSL). By October 2021, when someone posted to Twitter a screenshot of misleading Google Search results for “had a seizure now what,” users tagged the Twitter account of the GSL in reply. The GSL frequently publicly…