- Return to SPARCS
NYU School of Medicine - Health Care By The Numbers Curriculum / Student Project Guide
The SPARCS Student Project takes place within the NYU School of Medicine Practice of Medicine course. SPARCS faculty include: Ruth M. Crowe, MD, PhD;
Joseph Nicholson, MLIS, MPH;
Martin Pusic, MD, PhD;
Mark Schwartz, MD; and Marc Triola, MD.
For academic year 17-18, the activity will consist of:
||Dr. Triola will provide an Introduction to SPARCS talk to discuss the database and provide an overview of the exercise.
||Joey Nicholson will talk with you about formulating SPARCS Literature Research Questions.
||DUE DATE for submitting your team's clinical question for SPARCS via Brightspace. Please designate one student within your pair to submit your proposal (do not submit twice). The groups are posted to the Brightspace calendar.
||Dr. Triola will present the analyses of three of those submissions in his 'SPARCS Follow-up' talk.
||Joey Nicholson will meet with you to debrief about the research questions submitted and strategies for strengthening and improvement.
How to approach creating your clinical question:
Make sure your question is testable by the data present in SPARCS.
Some examples from previous NYU students that are answerable with SPARCS:
- Read the SPARCS Data Dictionary to better understand the data elements.
- Is there enough variation in the dataset?
- Is there a sufficient number of cases?
- Would the result be interesting?
- Could the health care system, or individual providers, act on the result?
If you want to look at the data, use the NYU SPARCS tools to filter your data by a given diagnosis/procedure and download it.
There are two data download options you will see on our SPARCS site:
- Does day of admission correlate with length-of-stay for CHF?
- Does severity of illness score correlate with length-of-stay for patients with Drug and Alcohol dependence?
- How does hospital level case-load relate to length-of-stay for those undergoing hip replacement?
- Does a patient’s race impact the rate of cardiac catheterization among patients admitted admitted with acute MI?
- The All Hospitals button will download the data from all of the NY hospitals that treat that condition. This may be a huge dataset and could be too big for Excel.
- The '8 Hospitals Only' button only downloads the data from 8 NY hospitals that we chose. These data are more manageable, will work in Excel, and are likely a better choice for an initial student project.
- For the purposes of the student exercise, these data are best evaluated in a spreadsheet or basic statistical program.
Any questions? Email us.
Frequently Asked Questions
What permission do we need to use these data in a presentation or publication?
The SPARCS data are under the open government public use license and made available through the Open NY initiative which poses no limitations “over its end use" though they do require attribution to the NYS DOH. The full license is here. Relevant portion:
"Unless otherwise noted on an individual document, file, web page or other item, the Department of Health grants users permission to reproduce materials published by the Department on this Website so long as the Department of Health is noted as the source, and the data the web page was accessed, along with the date of publication of the material cited, is noted. "
The Hospital compare data are under a similar open license. Relevant portion: "Works of the U.S. Government are in the public domain and permission is not required to reuse them."
How do the data on this site differ from the raw SPARCS data available on health.data.ny.gov?
We have made some changes to these data to make them easier to use for medical students. Remember that you can always get the original raw data from the DOH site. Our changes include:
Looking for more open clinical data sets for your projects?
- Updating the hospital names across all three years of data so they are consistent.
- Where possible, matching the payer names to be consistent. For example, NY State changed 'Insurance Company' to 'Private Health Insurance' in 2014. We changed the 2012 and 13 data to reflect that.
- We do not include the provider license number in the CSV files exported from this system. You can still get those from the public data file on the DOH site.
- We only include the first listed source of payment (payer) type.
We list more here.
What are the zip codes used in SPARCS?
SPARCS only includes the first three digits of the patient's zip code. Its left blank if the population size for that area is less than 20,000. “OOS” are Out of State zip codes.
What are the diagnosis and procedure codes used in SPARCS?
SPARCS includes two types of diagnostic codes: DRGs and CCS. For the sake of simplicity, we only include the DRG when viewing the data via this website - however the underlying raw data and exports from this site also include the full CCS data.
Wikipedia article on APR-DRG Classification Codes.
More information on the AHRQ Clinical Classification Software (CCS) Code.