The data for replication contain three parts: Centers for Disease Control and Prevention (CDC) data, Google Trends (GT) data, and Electronic Health Record (EHR) data. All data are obtained and frozen as of July 9, 2016.
CDC publish weekly unweighted Influenza-like Illness (ILI) activity level (gis.cdc.gov/grasp/fluview/fluportaldashboard.html). The initial report is subject to revision in later weeks as more data are gathered and processed from participating clinics around the country. We have consolidated the CDC's weekly unweighted ILI activity level data with later revisions into one single csv file.
Google Trends (www.google.com/trends) data are publicly available. The query terms that we used were identified from Google Correlate (www.google.com/trends/correlate), where we identified 129 flu-related Google search terms in total. The Google Trends data are then manually downloaded and consolidated into one single csv file.
The EHR data are from athenahealth, a provider of cloud-based services and mobile applications for medical groups and health systems (www.athenahealth.com). The data we used are historical values of three nationally aggregated weekly fraction from total patient visit counts: (a) flu visit counts, (b) ILI visit counts, and (c) unspecified viral or ILI visit counts. That is, data reported are rounded fraction of each type of counts to total patient visit counts. The EHR data are available in real time starting from July 2009.
CC0 1.0
Use email button above to contact.
Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.
This file has already been deleted (or replaced) in the current version. It may not be edited.
Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.
The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.
The file will be deleted after you click on the Delete button.
Files will not be removed from previously published versions of the dataset.
Please select one or more files.
Share this dataset on your favorite social media networks.
Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.
The selected file(s) may not be downloaded because you have not been granted access.
The files selected are too large to download as a ZIP.
You can select individual files that are below the 15.0 GB download limit from the files table, or use the Data Access API for programmatic access to the files.
Please select a file or files to be downloaded.
The restricted file(s) selected may not be downloaded because you have not been granted access.
Click Continue to download the files you have access to download.
Some file(s) cannot be transferred. (They are restricted, embargoed, or not Globus accessible.)
Click Continue to transfer the elligible files.
Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.
Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.
Private URL can only be used with unpublished versions of datasets.
Are you sure you want to disable the Private URL? If you have shared the Private URL with others they will no longer be able to use it to access your unpublished dataset.
The file(s) will be deleted after you click on the Delete button.
This dataset contains restricted files you may not compute on because you have not been granted access.
Are you sure you want to deaccession? The selected version(s) will no longer be viewable by the public.
Are you sure you want to deaccession this dataset? It will no longer be viewable by the public.
Please select two versions to view the differences.
Please select a file or files for access request.
Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.
Select existing file tags or create new tags to describe your files. Each file can have more than one tag.
You need to Sign Up or Log In to request access.
Please confirm and/or complete the information needed below in order to request access to files in this dataset.
This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.
Upon downloading files the guestbook asks for the following information.
Account Information
Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL
https://dataverse.harvard.edu/api/access/datafile/
You will not be able to make changes to this dataset while it is in review.
Are you sure you want to republish this dataset?
Select if this is a minor or major version update.
This dataset cannot be published until ARGO Dataverse is published by its administrator.
This dataset cannot be published until ARGO Dataverse and Harvard Dataverse are published.
Return this dataset to contributor for modification.
Harvard Dataverse Support
Please fill this out to prove you are not a robot.