TARIFAS PARA EL TRANSPORTE DE GAS NATURAL Capítulo Primero

Public Interest Feedback Loops

Introduction

Kinsa is a San Francisco based startup that designed a smart thermometer and app service to create a distributed real-time data generating network with public good uses. This information infrastructure was designed to be used by public health officials, researchers, and policymakers to generate new insights— insights that could in turn feed back into algorithmic development and better serve Kinsa users, as well as the public good. This unique business model and innovation strategy—informed by a founder’s past experiences working on the ground in a data-poor health policy environment—to work with distributed

research stakeholders represents a unique alignment of interests evolving around sharing user-generated biodata. Unlike other Silicon Valley innovation models in Chapter 2 that are usually unidirectional with knowledge flows feeding into the firm, Kinsa represents a firm that has mutually beneficial and porous potential from the data infrastructure built into their smart device and platform. However, associated data sharing agreements and research collaborations are not always easy to broker, and this case illustrates the need for established models that facilitate these partnerships and information flows. This public interest

orientation additionally opens up other market opportunities for users as they select between smart products and have the additional selections to choose a platform that contributes aggregate data to the greater public good and research- derived policymaking.

Public Health Information Primer

health of individuals and their community.331_{Public health operates at a}

population level, while focusing on regions (geographical and political) and individual communities in order to target responses or prevention efforts. Public health efforts include a range of actions including:332_{1) identifying, tracking, and}

remediating communicable diseases or pathogen outbreaks; 2) researching new prevention methods and treatments; 3) improving access to medical care within communities; and 4) addressing long-term and chronic conditions including social and behavioral actions that create population health risks.

Public health has been around for over a century, and has a long-standing relationship to the collection of social or community data to inform centralized decisions. For instance, in 1854 a large cholera outbreak was spreading through London and there was no centralized and accepted theory for how cholera spread from patient to patient. British Physician John Snow began talking to residents and charting the cases of cholera on a map, which revealed that most of the cases centered around one particular water well. Microbes were too small to be

visualized with equipment at the time, so examination of the water was futile.333

This founding instance of epidemiology research, however, coupled with one of the most canonical examples of data visualization helped to curb the deadly outbreak. The efforts Snow took to gather data from individuals and synthesize the data for study are similar to public health efforts still in effect in

contemporary society. For instance, in a large-scale foodbourne illness outbreak, samples from patients are sent to CDC approved labs and bacterial genomes are sampled to track common infectious agents. If the source is unclear

epidemiologists are sent to patients to talk about what they ate and find common sources (e.g., contaminated food from a restaurant or bought from a larger

national distributor). These information sharing systems are set up so that they trigger automatically when possible, and help identify cases that may be

geographically dispersed and asynchronous in timing.

Within the United States, data that are collected and distributed by the Centers for Disease Control and Prevention (CDC) are from a diverse network of public health agencies and private actors, and do not all originate from data

331_{"What Is Public Health?", 2019, accessed February, 2019, https://www.cdcfoundation.org/what-}

public-health.

332_{Loosely based on the CDC website’s categorization of activities. "About." Centers for Disease}

Control and Prevention,, 2019, accessed January, 2019, https://www.cdc.gov/about/organization/ mission.htm.

Note: The link had already expired by the time archiving took place and it could not be retrieved on the WayBack Machine. It appears it was similar to https://www.cdc.gov/about/organization/cio.htm (April 2019)

333_{Interestingly, this investigation even without microbial proof showed that a local brewery—whose}

water came from the contaminated well—did not have any infections because the water had been boiled before consumption.

"1854 Broad Street Cholera Outbreak." Wikipedia, 2019, 2019, https://en.wikipedia.org/wiki/ 1854_Broad_Street_cholera_outbreak.

collected by the CDC itself. A variety of data are held by the CDC including disease surveillance data, population health statistics and demographics, behavioral survey data, and merged datasets from diverse sources. These data may be entirely collected by the CDC (e.g., survey data), other federal agencies (e.g., NIH, VA, Centers for Medicare & Medicaid Services), international partners (e.g., WHO, other national health departments), state/local health departments (e.g., mandatory disease reporting or state level data such as morbidity data), non-profits or foundations (e.g., Kaiser Family Foundation), and private entities such as health insurance companies.334_{Each of these broad categories breaks}

down into practitioners and healthcare providers who are among first reporting entities. This network of private and public institutions is vast and distributed, and most often relies on voluntary reporting that has been fostered over time through trust and mutual partnerships. As much data as possible are made publicly accessible through reports, memos, and open data so that researchers, healthcare providers, pharmaceutical companies, non profits, community organizers, and private citizens can access data that will help inform decisions and generate prevention/remediation efforts that improve public health. An extensive discussion of this sharing ecosystem and how privacy concerns and governance decisions are made is included in the paper “Public Health as a Model for Cybersecurity Information Sharing.”335

Big Data Cataclysm: Google Flu Trends

Flu tracking and Silicon Valley have a somewhat complicated recent history. In 2008 and in the foreshadow of the swelling “Swine Flu” epidemic, Google researchers published in Nature a method of detecting influenza outbreaks using Google search queries to find a highly correlated trend to physician visits and reported influenza-like symptoms.336_{Google boasted a 1 day reporting lag,}

334_{See for example data sources for:}

“eWorld Appendix." Center for Disease Control and Prevention, 2019, accessed January, 2019, https://wwwn.cdc.gov/eworld/Appendix/SourcesOfData.

"Cancer Prevention and Control." 2019, accessed January, 2019, https://www.cdc.gov/cancer/dcpc/ data/other.htm.

"HIV Statistics." Center for Disease Control and Prevention,, 2019, accessed January, 2019, https:// www.cdc.gov/hiv/pdf/statistics_Lab_reporting_DCL_final.pdf.

"Datasets CDC Wonder Systems." Center for Disease Control and Prevention, 2019, accessed January, 2019, https://wonder.cdc.gov/datasets.html.

335_{Sedenberg, Elaine M, and Deirdre K Mulligan. Public Health as a Model for Cybersecurity}

Information Sharing. University of California Berkeley (Berkeley Technology Law Journal: 2015). http:// btlj.org/data/articles2015/vol30/30_3/1687-1740%20Sedenberg.pdf.

336_{The paper made a correction to the reference list, so the updated February 2009 is often cited as}

whereas the CDC took 1-2 weeks in order publish data on regions hit by the flu.337

The time lag for federal data is caused by the logistics of collecting data from distributed sources, but also because the CDC and most federally-released dataset require an extensive amount of quality assurance and cleaning to make sure data are accurate and representative. This decreased reporting time could improve response and prevention actions by public health officials, and help with

allocating resources—particularly in a severe influenza outbreak like the Swine Flu. Big data had delivered on its promise to usurp conventional information sharing mechanisms and offer convenient insights from our perfunctory routines into matters of life an death. However by 2013, Google Flu Trends had missed the peak of flu season by 140% and researchers documented these failures in

Science.338_{The takeaway was not to discredit attempts to utilize big data}

generated by the private sector to supplement or replace traditional models—in fact their calls state the opposite and underscore the responsibility to use these data in the public interest—but rather intended to call out the “big data hubris” that clouds the potential uses of these data. In the case of Google Flu Trends, the methods and data were kept opaque so that policymakers would not have been able to interrogate these data if real high-stakes decisions were being made

around it. The algorithm tailor to track these outbreaks was also left vulnerable to overfitting to other terms that seasonally occurred at the same time as outbreaks over time. Google also introduced suggested search features and health based add-ons that helped users find information relevant to their illness, but had an unintended side effect of driving more clicks and searches relating to the flu.339

Google stopped producing the trends around 2014, including their efforts to track Dengue Fever.340

Despite the failures and cautionary “big data” parable, Google Flu Trends represented an important shift in thinking about how public good oriented

research—including public health—could leverage real-time data from the private sector to enable new analyses and research. Further, this seeded a larger theme: that data held by the private sector was beginning to offer granularity and

insights unmet by the traditional and canonical government data sources and methods. This belief was not cured by the failure of this one instance of execution and data source.

337_{Ginsberg, J., M. H. Mohebbi, R. S. Patel, L. Brammer, M. S. Smolinski, and L. Brilliant. "Detecting}

Influenza Epidemics Using Search Engine Query Data." Nature 457, no. 7232 (Feb 19 2009): 1012-4. https://doi.org/10.1038/nature07634. https://www.ncbi.nlm.nih.gov/pubmed/19020500.

338_{Lazer, David, Ryan Kennedy, Gary King, and Alessandro Vespignani. "The Parable of Google Flu:}

Traps in Big Data Analysis." Science 343, no. 6176 (2014): 1203-05. http://science.sciencemag.org/ content/343/6176/1203.full.

339_{Lazer, David, and Ryan Kennedy. "What We Can Learn from the Epic Failure of Google Flu}

Trends." Wired, 2015. https://www.wired.com/2015/10/can-learn-epic-failure-google-flu-trends/.

Flu Data

The CDC tracks multiple elements of influenza throughout the flu season, and works with other countries (e.g., Australia in the Southern Hemisphere which indicates the severity and type of influenza to hit during Northern

Hemisphere winter) to understand the trends and make decisions. Data on the influenza virus and severity inform the vaccine formulation for each year.341_The

CDC collects data on influenza from many different partners including public health and clinical laboratories located in all states and territories as part of the WHO Collaborating Laboratory system and the National Respiratory and Enteric Virus Surveillance System (NREVSS). These laboratories test for the type and strain of influenza, and share viral specimens with partners and the CDC for testing so that antiviral resistance and the antigenic or genetic characterization can be documented and tracked. The CDC also collects data from over 3,500 healthcare providers on all “influenza-like illness” through the US Outpatient Influenza-like Illness Surveillance Network (ILINet) on age profiles and number of cases seen. These data are also reported automatically when enabled by

electronic health records. Further, each state health department reports an estimated level of influenza cases each week and documents it through the State and territorial Epidemiologists Reports which serve as a summary of influenza activity and does not relay the severity of an outbreak. To track more severe illnesses, the CDC collects data from the Influenza Hospitalization Surveillance Network (FluSurv-Net) that reports laboratory confirmed cases (as opposed to NREVSS which reports all cases with flu-like symptoms). This network includes 70 countries, but does not include all states (only ten participate)—it also

contains data gaps in cases where testing is not performed or symptoms are attributed to other similar illnesses. Finally, mortality surveillance is conducted through two larger mortality surveillance systems that monitor cause of death in adults and minors and report “Pneumonia and Influenza (P&I)” as a cause of death code.

These systems are distributed and not perfect, but designed to have representation among geographic regions, socio-economic statuses, ages, and levels of healthcare. There is also redundancy and known gaps in the information systems that are meant to be taken together in order to inform public health decisions and preparation for future influenza seasons. These data include

information from private healthcare providers, but do not leverage other types of sensors or “big data’ surveillance.

341_{Center for Disease Control and Prevention. Overview of Influenza Surveillance in the United}

States. (Centers for Disease Control and Prevention, National Center for Immunization and Respiratory Diseases (NCIRD): 2018). https://www.cdc.gov/flu/weekly/overview.htm.

Kinsa Motivations

Kinsa was founded in 2012 by Inder Singh, who was a former Executive Vice President of the Clinton Foundation’s Health Access Initiative (CHAI) which worked with over 70 governments and 22 pharmaceutical companies to decrease the cost of diagnostics and drugs for HIV/AIDS, malaria, and TB.342_{According to}

one employee, Inder became frustrated by the lack of current data about outbreaks and incidence levels in countries where the disease spread rapidly.

“He was incredibly frustrated by the lack of quality information and data around where people needed those medicines and even just how to allocate billions of dollars of funding. If you asked one team how many people get malaria every year and you ask another team, we'll get orders of magnitude different answers and that was just crazy.”

Believing that modern sensors could provide more frequent and granular data, Inder founded Kinsa as a company that could distribute a product and service while collecting population—or at least community—level data. Kinsa has a stated mission to create “the world’s first real-time map of human health.”343

This knowledge and analysis could then be leveraged on an individual level to algorithmic predictions and feedback to users.

Thermometers have been around as a biosensing modality since 1625,344_but

the acceptance of use and normalized body temperature was not made until around 1868. The first electronic thermometer based on a Carboloy thermistor was invented in 1954.345_{Since then, digital thermometers of improving accuracy}

as well as decreasing size and cost have become a part of at home and

professional medical care. Unlike other biosensing companies where the sensor,

342_{"Inder Singh, Kinsa Founder & Ceo." CrunchBase, 2019, accessed January, 2019, https://}

www.crunchbase.com/person/inder-singh#section-overview.

343_{"Kinsa Organization Overview." Crunch Base, 2019, accessed January, 2019, https://}

www.crunchbase.com/organization/kinsa-inc#section-overview.

344_{Pearce, J. M. "A Brief History of the Clinical Thermometer." [In eng]. QJM 95, no. 4 (Apr 2002):}

251-2. https://www.ncbi.nlm.nih.gov/pubmed/11937653.

Impressively, Carl Wunderlich published over 1 million readings from 25,000 patients in order to establish the normal temperature range of a healthy person. One can imagine how much easier these data collections could have been in the modern digital world.

345_{"Fast Clinical Thermometer Takes Temperature in Seconds." Popular Mechanics, 1954, 123.}

https://books.google.com/books?

id=sdwDAAAAMBAJ&pg=PA123&dq=1954+Popular+Mechanics+January&hl=en&sa=X&ei=Pv4YT5SlG 5PoggekxISEDA&ved=0CDgQ6AEwAg#v=onepage&q&f=true.

or set of sensors, has not been used in orchestration for that particular purpose before (e.g., Lioness) or where algorithmic development and choices on sensor design are still being developed (e.g., Basis), the digital thermometer has an established history. What makes the newest generation of “smart” digital thermometers unique are the platforms to help organize data for tracking or healthcare appointments, as well as the recommendations and personalized suggestions based on other demographic health information (i.e., personalizing responses based on if the individual measured is an infant which follows different guidelines).

Kinsa has a platform that organizes user data and information, and offers recommendations on treatment and when to see a healthcare professional. The device is available for purchase at stores like Target and Walgreens, and connects to a mobile app. There are also enterprise versions of the device—service

packages designed to connect communities like workplaces and schools—that have been used to track illness at a smaller scale and offer guidance to promote proper care and limit spread.346_{The enterprise modeling of this smart device is a}

standout example of an innovative biosensing business model that focuses on not only larger population health trends, but trends within smaller communities.

“Kinsa was founded and the initial phase of the company was building the first smart thermometer, first FDA cleared smart thermometer, and proving that consumers see value in this device and building out the sensor network to have sufficient scale. And now that we have over one point 5 million users across the United States and three years of history of data that we can validate the end of the value of that aggregated information.”

The founders of Kinsa made a deliberate choice to not make the company a nonprofit so that they were not relying on small grants and gifts in order to sustain data operations, all of which can be unreliable and under pressure from political trends over time. The company also wanted to prove that they could collect these real time health data and establish private public health information infrastructure, all while providing a product that consumers wanted to buy. As of June 2018 when this interview was conducted, Kinsa had over 1.5 million users.

Kinsa worked with academics to establish how their data could be used by external entities for study and as a way to obtain external validation:

“As a part of our development process, we knew that for most customers they need to have some sort of external validation that this data is accurate, that is accurately

representing their illness, some sort of that real world behavior. So we started reaching out to several academic groups that have done a lot of work in the flu forecasting space, knowing that they would likely be really interested in working with us because no one's ever access to this kind of data before.”

The startup reached out to researchers who had done flu surveillance work before to propose the partnership. By providing the clean data, many academic partners felt there was only minimal work left to provide a literature review and run the correlation analysis to see how well the data tracked to flu trends. In one case, a researcher had moved away from that line of influenza tracking research but was able to give the data to a graduate student for their own research. The structuring of the hypothesis and execution of the research was conducted wholly on the academic side. These data would not have been possible to collect and study without the partnership of a private firm like Kinsa—both in its scale, overall feasibility from a university research perspective, and on the return on investment for creating the sensor infrastructure at scale for these fundamental questions. Yet this exchange provided mutual benefit: enabling academic inquiry by supplying fundamental data, and by helping the startup validate their models and approach.347

When asking about barriers and friction to these academic partnerships, an

In document Aprueban el Reglamento de Transporte de Hidrocarburos por Ductos DECRETO SUPREMO N EM (página 30-36)