Changes between Version 2 and Version 3 of i2b2 Data Import Pathology Procedure v1


Ignore:
Timestamp:
03/12/13 10:31:55 (11 years ago)
Author:
Richard Bramley
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • i2b2 Data Import Pathology Procedure v1

    v2 v3  
    2626
    2727Runs the stored procedure USP_DWH_INSERT_PATHOLOGY_I2B2 in the i2b2ClinDataIntegration database, which loads data into the destination i2b2 database from the view UVW_BRICCS_PATHOLOGY_RESULTS in the DWBRICCS database on the UHL data warehouse as a linked server.
     28
     29== Duplicate Processing
     30
     31Version one of the data load identifies some records as being duplicates because they have the same patient, sample collection datetime and concept code.  When a duplicate is identified it discards the most recent record.  This is probably not correct for several reasons:
     32
     331. If there are more that two duplicates, it only discards one record and so there will still be a duplicate.
     341. Common sense and reason 1 suggest that it should be keeping the most recent record.
     351. There may be a better way to identify which record is correct.  For example, if the result has been suppressed (result suppression will not solely solve the problem).
     361. Both records may be valid.
     37
     38Paul Smalley has agreed to look at the duplicate records to find out the reasons for the duplication.