= i2b2 Data Import PATS Mapping v1 Tags: [[i2b2 Summary]] The PATS mappings are split into text and numeric fields. == Text Fields Text fields are mapped to the i2b2 Observation Fact table like this ||i2b2 Field||Source|| ||Encounter_Num||i2b2 Patient ID|| ||Patient_Num||i2b2 PatientID|| ||Concept_Cd||Enum lookup value (see below)|| ||Provider_Id||"@"|| ||Start_Date||The source table observation date field (see below)|| ||Modifier_Cd||"@"|| ||ValType_Cd||"T" (text)|| ||Units_Cd||"@"|| ||Location_Cd||"@"|| ||Update_Date||The current date using the GETDATE() Function|| ||SourceSystem_Cd||"BRICCS"|| ||Upload_Id||"1"|| '''''SOMETIME (RB): Convert Update_Date to use a date parameter. It's just a little quicker.''''' '''''SOMETIME (RB): Should the SourceSystem_Cd not be "PATS"? Or maybe "MINAP" or whatever.''''' '''''SOMETIME (RB): The Upload_Id is presumably always one because everything is reloaded each time. Might be nice to change this to something more meaningful.''''' === Enum Lookup Value The Concept Code to use is found in the table DATA_DICTIONARY_ENUM_VALUES in the DWBRICCS database on UHLDWH server. This table contains records for concept code keyed on source table name, column name and column value. NULL column values may be mapped to the field '(empty)'. '''''ACTION (RB): Check if all possible values of column value contain a mapping in the ''DATA_DICTIONARY_ENUM_VALUES'' table.''''' === Source Table Observation Date Field For each source table in the there is a single field that is used as the start date for all columns from that field. The observation date field to use for each table is stored in the ''pats_tables'' table in the DWBRICS on the UHLDWH server. == Numeric Fields Numeric fields are mapped to the i2b2 Observation Fact table like this ||i2b2 Field||Source|| ||Encounter_Num||i2b2 Patient ID|| ||Patient_Num||i2b2 PatientID|| ||Concept_Cd||"PTS:{TABLE_NAME}_{COLUMN_ID}"|| ||Provider_Id||"@"|| ||Start_Date||The source table observation date field (see above)|| ||Modifier_Cd||"@"|| ||ValType_Cd||"N" (numeric)|| ||TVal_Char||"E" (see below)|| ||Units_Cd||Value calculated by ufn_get_units function (see below)|| ||Location_Cd||"@"|| ||Update_Date||The current date using the GETDATE() Function|| ||SourceSystem_Cd||"BRICCS"|| ||Upload_Id||"1"|| === TVal_Char The ''TVal_Char'' describes the meaning of the value contained in the ''NVal_Num'' field. For all values currently the value of ''TVal_Char'' is set to "E", which means equals. That is, the value in the ''NVal_Num'' field is the result of the test. All possible values of ''TVal_Char'' are: ||Value||Meaning|| ||E||Equals|| ||NE||Not Equal|| ||L||Less Than|| ||LE||Less Than or Equals|| ||G||Greater Than|| ||GE||Greater Than or Equals|| From a casual look at the data, I think that a value of less than X, or greater than Y is sometimes used for Pathology data. === ufn_get_units Function The function ufn_get_units in the DWBRICCS database is called with parameters of ''table_name'' and ''field_name'' to determine the units. '''''~~ACTION (RB): Work out what this function actually does.~~''''' The function just selects the units column from the Data_Dictionary table in the DWBRICCS database. If the result is an empty string, it returns 'units' instead. Don't know why it would do that. '''''ACTION (RB): Rewrite numeric data extract to do the same, but with a join. Or it could be moved into the query that copies all the numeric fields into the recordset for looping with the for each. Either really.''''' == Tables The PATS i2b2 data integration load maps source table types to either Text or Numeric. All other types are ignored. The mapping is as follows. ||Source Type||i2b2 Type|| ||VARCHAR||Text|| ||INT or FLOAT||Numeric|| ||All other types||NOT LOADED|| In addition, all fields with certain names are also ignored. '''Ignored Field Names''' * !PatientId * !EntryId * !PreviousEntry * !NextEntry Comments about data: '''''COMMENT (RB): Which fields should be removed?''''' '''''~~COMMENT (RB): All date fields are ignored, but could they be converted into numeric fields. For example, length of stay, time between stays, etc. This may already be being done.~~''''' Nicks says that generally this information is not important for cohorting. '''''COMMENT (RB): There are quite a few denormalised fields. For example: drug1, drug2 - could these be normalised into one concept code. Or: readmittedDate1, readmittedDate2 - could these be different encounters? Are these already being recorded?''''' '''''COMMENT (RB): Some dates are recorded as varchars and so presumably are being imported''''' '''''~~COMMENT (RB): Are some text fields also being ignored because they don't have entries in the enum data dictionary?~~ Yes they are ignored.''''' '''''ACTION (RB): Rewrite extracting of varchar fields to check that there are any entries in the ENUM Data Dictionary for that field. This should reduce the number of queries that need to be run in the for each and speed things up quite a bit.''''' == Registries === Myocardial Ischaemia National Audit Project (MINAP) Minap data is contained within the following mapped tables in the data warehouse * [[i2b2 - UHL clinical informatics system integration PATS GGHMINAP_GGHMINAP|GGHMINAP_GGHMINAP]] * [[i2b2 - UHL clinical informatics system integration PATS GGHMINAPNEW|GGHMINAPNEW]] * [[i2b2 - UHL clinical informatics system integration PATS GGHMINAPNEW_GGHMINAPINIVW|GGHMINAPNEW_GGHMINAPINIVW]] * [[i2b2 - UHL clinical informatics system integration PATS LRIMINAPNEW|LRIMINAPNEW]] * [[i2b2 - UHL clinical informatics system integration PATS LRIMINAPNEW_LRIMINAPNEWINITIALVW|LRIMINAPNEW_LRIMINAPNEWINITIALVW]] And the following unmapped tables * [[i2b2 - UHL clinical informatics system integration PATS MINAPGGH_MINAPGGHINIVW |MINAPGGH_MINAPGGHINIVW ]] * [[i2b2 - UHL clinical informatics system integration PATS MINAPLRI_MINAPLRIINIVW |MINAPLRI_MINAPLRIINIVW ]] (This table only contains records for 2 patients. The details of both have been copied into the table MINAPGGH_MINAPGGHINIVW.) These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS Combined Minap|MINAP Combined Minap]] === Cardiac Surgery? (CS) This information is recorded at 4 times: pre-procedure, post-procedure, sub-procedure and follow up. There are several candidate tables for each of these. ==== Pre-Procedure * [[i2b2 - UHL clinical informatics system integration PATS CS2003F_PreProcedure|CS2003F_PreProcedure]] - currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS2010F_PreProcedure|CS2010F_PreProcedure]] - currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS_PreProcedure|CS_PreProcedure]] - not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS CS PreProcedure Combined|CS Pre-Procedure Combined Mapping]] ==== Sub-Procedure * [[i2b2 - UHL clinical informatics system integration PATS CS2003F_SubProcedure|CS2003F_SubProcedure]] - not currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS2010F_SubProcedure|CS2010F_SubProcedure]] - not currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS_SubProcedure|CS_SubProcedure]] - not currently mapped These tables do not have a candidate for a start date. However, they do have a ParentEntryID, which presumably relates to a pre- or post- procedure record. Were these tables to be mapped they should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS CS SubProcedure Combined|CS Sub-Procedure Combined Mapping]] ==== Post-Procedure * [[i2b2 - UHL clinical informatics system integration PATS CS2003F_PostProcedure|CS2003F_PostProcedure]] - currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS2010F_PostProcedure|CS2010F_PostProcedure]] - currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS_PostProcedure|CS_PostProcedure]] - not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS CS PostProcedure Combined|CS Post-Procedure Combined Mapping]] ==== Follow Up * [[i2b2 - UHL clinical informatics system integration PATS CS2003F_FollowUp|CS2003F_FollowUp]] - not currently mapped * [[i2b2 - UHL clinical informatics system integration PATS CS_FollowUp|CS_FollowUp]] - not currently mapped There is nothing in CS_FollowUp. Not sure that there is much of use in CS2003F_FollowUp. === Catheter * [[i2b2 - UHL clinical informatics system integration PATS CATH_CATHINITIALVW|CATH_CATHINITIALVW]] * [[i2b2 - UHL clinical informatics system integration PATS CATH_CATH|CATH_CATH]] - not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS CATH Combined|CATH Combined]] === EPS * [[i2b2 - UHL clinical informatics system integration PATS EPS_EPSINIVW|EPS_EPSINIVW]] * [[i2b2 - UHL clinical informatics system integration PATS EPS_EPS|EPS_EPS]] - Not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS EPS Combined|EPS Combined]] === Pacemakers * [[i2b2 - UHL clinical informatics system integration PATS GGHPACEICD_GGHPACEICDINIALVW|GGHPACEICD_GGHPACEICDINIALVW]] * [[i2b2 - UHL clinical informatics system integration PATS PACE_PACE|PACE_PACE]] - not currently mapped * [[i2b2 - UHL clinical informatics system integration PATS GGHPACEICD_GGHPACEICD|GGHPACEICD_GGHPACEICD]] - not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS PACE Combined|PACE Combined]] === Percutaneous Coronary Intervention (PCI) ==== Procedure * [[i2b2 - UHL clinical informatics system integration PATS GGHPCI_INITIALGGHPCIVIEW|GGHPCI_INITIALGGHPCIVIEW]] * [[i2b2 - UHL clinical informatics system integration PATS GGHPCI_GGHPCI|GGHPCI_GGHPCI]] - Not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS PCI Combined|PCI Combined]] ==== Follow Up Although this table is mapped, it may not contain any useful information * [[i2b2 - UHL clinical informatics system integration PATS GGHPCI_FLWGGHPCIVIEW|GGHPCI_FLWGGHPCIVIEW]] === Cardioversion * [[i2b2 - UHL clinical informatics system integration PATS VERTED_VERTEDINIALVW|VERTED_VERTEDINIALVW]] * [[i2b2 - UHL clinical informatics system integration PATS VERTED_VERTED|VERTED_VERTED]] - Not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS Cardioversion Combined|Cardioversion Combined]] === Rehab * [[i2b2 - UHL clinical informatics system integration PATS UHLREHAB_UHLREHABFLW|UHLREHAB_UHLREHABFLW]] - Not Currently Mapped * [[i2b2 - UHL clinical informatics system integration PATS UHLREHAB_UHLREHABVWpt1|UHLREHAB_UHLREHABVWpt1]] - Not currently mapped * [[i2b2 - UHL clinical informatics system integration PATS UHLREHAB_UHLREHABVWpt2|UHLREHAB_UHLREHABVWpt2]] - Not currently mapped These tables should be combined into one mapping: [[i2b2 - UHL clinical informatics system integration PATS Rehab Combined|Rehab Combined]] [[BackLinks]]