Context Navigation

Changes between Version 2 and Version 3 of i2b2 Data Import Pathology Procedure v1

-              v2
+              v3
 Runs the stored procedure USP_DWH_INSERT_PATHOLOGY_I2B2 in the i2b2ClinDataIntegration database, which loads data into the destination i2b2 database from the view UVW_BRICCS_PATHOLOGY_RESULTS in the DWBRICCS database on the UHL data warehouse as a linked server.
+== Duplicate Processing
+Version one of the data load identifies some records as being duplicates because they have the same patient, sample collection datetime and concept code.  When a duplicate is identified it discards the most recent record.  This is probably not correct for several reasons:
+. If there are more that two duplicates, it only discards one record and so there will still be a duplicate.
+. Common sense and reason 1 suggest that it should be keeping the most recent record.
+. There may be a better way to identify which record is correct.  For example, if the result has been suppressed (result suppression will not solely solve the problem).
+. Both records may be valid.
+Paul Smalley has agreed to look at the duplicate records to find out the reasons for the duplication.