Below is the entity-relationship diagram (ERD) that provides an overview of the relational CLIF database structure. This version of CLIF is
Relational CLIF tables are organized into clinically relevant column categories - demographics, objective measures, respiratory support, orders, and inputs-outputs. Below are sample templates for each table in R-CLIF. Here you can find detailed descriptions of each table and their fields.
You can use our custom GPT- CLIF Assistant to learn more about CLIF and develop analysis scripts using Clifford (Synthetic CLIF).
Patient_encounters
patient_id |
VARCHAR |
ID variable for each patient. Every patient assigned a unique identifier is presumed to be a distinct individual |
encounter_id |
VARCHAR |
ID variable for each patient encounter (a given patient can have multiple encounters). Each encounter_id represents a unique hospitalization for a patient, capturing the entire duration of the hospital stay. This is the primary key for most other tables |
Example:
Patient_demographics
encounter_id |
VARCHAR |
ID variable for each patient encounter |
|
race |
VARCHAR |
Description of patient’s race. Each site could have different strings in source data |
Black , White , American Indian or Alaska Native , Asian , Native Hawaiian or Other Pacific Islander , Unknown , Other |
ethnicity |
VARCHAR |
Description of patient’s ethnicity |
Hispanic , Non-Hispanic , Unknown |
sex |
VARCHAR |
Patient’s biological sex |
Male , Female , Unknown |
Example:
Limited_identifiers
encounter_id |
VARCHAR |
ID variable for each patient encounter. Each encounter_id represents a unique hospitalization for a patient, capturing the entire duration of the hospital stay |
admission_dttm |
DATETIME |
Date and time the patient is admitted (in the format %Y-%m-%d %H:%M:%S). Use this date to determine the start date and time of the patient hospitalization |
discharge_dttm |
DATETIME |
Date and time the patient is discharged (in the format %Y-%m-%d %H:%M:%S). Use this date to determine the distacharge date of the patient hospitalization |
birth_date |
DATETIME |
Patient date of birth. This variable is used to calculate age at admission for analysis |
zipcode_9digit |
VARCHAR |
Patient zipcode. This variable is used to link the database with other indices like ADI, SVI etc |
Example:
Encounter_demographics_disposition
encounter_id |
VARCHAR |
ID variable for each patient encounter. |
|
age_at_admission |
INT |
Age of the patient at the time of admission. Calculated using the admission_dttm and birth_date from the limited identifiers table. |
|
disposition_name |
VARCHAR |
Original disposition name string recorded in the raw data. This field allows for the storing of the dispostion value as it appears in the source data. This field is not used for analysis. |
|
disposition_category |
VARCHAR |
Description of disposition when discharged. Map source values stored in disposition_name to the mCIDE categories. |
Home , Hospice , Discharged to another facility , Dead , Admitted , Other |
Example:
ADT
encounter_id |
VARCHAR |
ID variable for each patient encounter |
|
hospital_id |
VARCHAR |
Assign an ID to each hospital in the hospital system |
|
in_dttm |
DATETIME |
Start date and time at a particular location |
Datetime format should be %Y-%m-%d %H:%M:%S |
out_dttm |
DATETIME |
End date and time at a particular location |
Datetime format should be %Y-%m-%d %H:%M:%S |
location_name |
VARCHAR |
Location of the patient inside the hospital. This field is used to store the patient location from the source data. This field is not used for analysis. |
No restriction |
location_category |
VARCHAR |
Map location_name from the source data to categories identified under CLIF. |
ER , OR , ICU , Ward , Other |
Example:
Vitals
encounter_id |
VARCHAR |
ID variable for each patient encounter. |
|
recorded_dttm |
DATETIME |
Date and time when the vital is recorded. |
Datetime format should be %Y-%m-%d %H:%M:%S |
vital_name |
VARCHAR |
This field is used to store the description of the flowsheet measure from the source data. This field is not used for analysis. |
No restriction |
vital_category |
VARCHAR |
Map flowsheet measures stored in vital_name to the to categories identified under CLIF. |
temp_c , pulse , sbp , dbp , spo2 , respiratory_rate , map , height_inches , weight_kg |
vital_value |
DOUBLE |
Recorded value of the vital. Ensure that the measurement unit is aligned with the permissible units of measurements. |
temp_c = Celsius , height_inches = Inch , weight_kg = Kg , map = mm/Hg , spo2 = % , No unit for pulse , sbp , dbp , and respiratory_rate |
meas_site_name |
VARCHAR |
Site where vital is recorded |
No restrictions. Record the site name from the source data. |
Example:
Labs
encounter_id |
VARCHAR |
ID variable for each patient encounter. |
|
lab_order_dttm |
DATETIME |
Date and time when the lab is ordered. |
Datetime format should be %Y-%m-%d %H:%M:%S |
lab_collect_dttm |
DATETIME |
Date and time when the specimen is collected. |
Datetime format should be %Y-%m-%d %H:%M:%S |
lab_result_dttm |
DATETIME |
Date and time when the lab results are available. |
Datetime format should be %Y-%m-%d %H:%M:%S |
lab_name |
VARCHAR |
Original lab name string recorded in the raw data. This field is not used for analysis. |
|
lab_category |
VARCHAR |
43 labs identified by the CLIF consortium. |
List of lab categories in CLIF |
lab_group |
VARCHAR |
Lab categories roll up to form lab groups. |
ABG , BMP , CBC , Coags , LFT , Lactic Acid , Misc , VBG |
lab_value |
DOUBLE |
Recorded value corresponding to a lab. |
|
reference_unit |
VARCHAR |
Unit of measurement for that lab . |
Permissible reference values for each lab_category listed here |
lab_type_name |
VARCHAR |
Type of lab. |
arterial , venous , standard , poc |
Example:
Note: The lab_value
field often has non-numeric entries that are useful to make project-specific decisions. A site may choose to keep the lab_value
field as a character and create a new field lab_value_numeric
that only parses the character field to extract the numeric part of the string.
Microbiology
encounter_id |
VARCHAR |
ID variable for each patient encounter. |
|
test_id |
VARCHAR |
An ID for a specific component, such as a gram culture smear, taken from a fluid sample with a unique order and collection time, if two different pathogens are identified, the result will be recorded as two separate rows, each sharing the same test_id. |
|
order_dttm |
DATETIME |
Date and time when the test is ordered. |
Datetime format should be %Y-%m-%d %H:%M:%S |
collect_dttm |
DATETIME |
Date and time when the specimen is collected. |
Datetime format should be %Y-%m-%d %H:%M:%S |
result_dttm |
DATETIME |
Date and time when the results are available. |
Datetime format should be %Y-%m-%d %H:%M:%S |
fluid_name |
VARCHAR |
Cleaned fluid name string from the raw data. This field is not used for analysis. |
No restriction. Check this file for examples: clif_vocab_microbiology_fluid_ucmc.csv |
fluid_category |
VARCHAR |
Fluid categories defined according to the NIH common data elements. |
CDE NIH Infection Site |
component_name |
VARCHAR |
Original componenet names from the source data. |
No restriction |
component_category |
VARCHAR |
Map component names to the categories identified under CLIF. |
culture , gram stain , smear |
organism_name |
VARCHAR |
Cleaned oragnism name string from the raw data. This field is not used for analysis. |
No restriction. Check this file for examples: clif_vocab_microbiology_organism_ucmc.csv |
organism_category |
VARCHAR |
Organism categories defined according to the NIH common data elements. |
CDE NIH Organism |
Example:
Respiratory_support
encounter_id |
VARCHAR |
ID variable for each patient encounter |
|
recorded_dttm |
DATETIME |
Date and time when the device started |
Datetime format should be %Y-%m-%d %H:%M:%S |
device_name |
VARCHAR |
Includes raw string of the devices. Not used for analysis |
Example mapping for device name to device category |
device_category |
VARCHAR |
Includes a limited number of devices identified by the CLIF consortium |
Vent , NIPPV , CPAP , High Flow NC , Face Mask , Trach Collar , Nasal Cannula , Room Air , Other |
mode_name |
VARCHAR |
Includes raw string of the modes. Not used for analysis |
Example mapping for mode name to mode category |
mode_category |
VARCHAR |
Limited number of modes identified by the CLIF consortium |
Assist Control-Volume Control , Pressure Support/CPAP , Pressure Control , Pressure-Regulated Volume Control , Other , SIMV , Blow by |
tracheostomy |
BOOLEAN |
Indicates if tracheostomy is performed |
0 = No, 1 = Yes |
fio2_set |
DOUBLE |
Fraction of inspired oxygen set |
|
lpm_set |
DOUBLE |
Liters per minute set |
|
tidal_volume_set |
DOUBLE |
Tidal volume set (in mL) |
|
resp_rate_set |
DOUBLE |
Respiratory rate set (in bpm) |
|
pressure_control_set |
DOUBLE |
Pressure control set (in cmH2O) |
|
pressure_support_set |
DOUBLE |
Pressure support set (in cmH2O) |
|
flow_rate_set |
DOUBLE |
Flow rate set |
|
peak_inspiratory_pressure_set |
DOUBLE |
Peak inspiratory pressure set (in cmH2O) |
|
inspiratory_time_set |
DOUBLE |
Inspiratory time set (in seconds) |
|
peep_set |
DOUBLE |
Positive-end-expiratory pressure set (in cmH2O) |
|
tidal_volume_obs |
DOUBLE |
Observed tidal volume (in mL) |
|
resp_rate_obs |
DOUBLE |
Observed respiratory rate (in bpm) |
|
plateau_pressure_obs |
DOUBLE |
Observed plateau pressure (in cmH2O) |
|
peak_inspiratory_pressure_obs |
DOUBLE |
Observed peak inspiratory pressure (in cmH2O) |
|
peep_obs |
DOUBLE |
Observed positive-end-expiratory pressure (in cmH2O) |
|
minute_vent_obs |
DOUBLE |
Observed minute ventilation (in liters) |
|
|
|
|
|
Example:
Medication_admin_continuous
encounter_id |
VARCHAR |
ID variable for each patient encounter |
|
med_order_id |
VARCHAR |
Medication order id. Foreign key to link this table to other medication tables |
|
admin_dttm |
DATETIME |
Date and time when the medicine was administered |
Datetime format should be %Y-%m-%d %H:%M:%S |
med_name |
VARCHAR |
Original med name string recorded in the raw data for a limited number of labs identified by the CLIF consortium |
Example mapping of med_name to med_category |
med_category |
VARCHAR |
Limited number of medication categories identified by the CLIF consortium |
List of continuous medication categories in CLIF |
med_route |
VARCHAR |
eod of medicine delivery |
|
med_dose |
VARCHAR |
quantity taken in dose |
|
med_dose_unit |
VARCHAR |
unit of dose |
|
Example:
Note: The medication_admin_intermittent
table has exactly the same schema. The consortium decided to separate the medications that are administered intermittenly from the continuously administered medications.
Dialysis
encounter_id |
VARCHAR |
ID variable for each patient encounter |
|
start_dttm |
DATETIME |
Start date and time of dialysis session |
Datetime format should be %Y-%m-%d %H:%M:%S |
stop_dttm |
DATETIME |
Stop date and time of dialysis session |
Datetime format should be %Y-%m-%d %H:%M:%S |
dialysis_type |
VARCHAR |
Type of dialysis performed |
intermittent , peritoneal , crrt |
dialysate_flow_amount |
DOUBLE |
Amount of dialysate flow |
|
ultrafiltration_amount |
DOUBLE |
Amount of ultrafiltration |
|
Example:
Position
encounter_id |
VARCHAR |
ID variable for each patient encounter. This table only includes those encounters that have proning documented ever. |
|
recorded_dttm |
DATETIME |
Date and time when the vital is recorded. |
Datetime format should be %Y-%m-%d %H:%M:%S |
position_name |
VARCHAR |
This field is used to store the description of the position from the source data. This field is not used for analysis. |
No restriction |
position_category |
VARCHAR |
Map position_name to the to categories identified under CLIF. |
prone , not_prone |
Example: