Changes between Version 1 and Version 2 of i2b2 - UHL clinical informatics system integration PATS v1


Ignore:
Timestamp:
03/12/13 11:05:48 (11 years ago)
Author:
Richard Bramley
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • i2b2 - UHL clinical informatics system integration PATS v1

    v1 v2  
    3333The field DateOfAdmission is used as the start date for the EPS_EPSINIVW table.  Alternative fields for start date on this table are date_timeOfProcedure and procedureStartTime.  Both of these, however, also have records where the date is NULL.  All three fields could be coalesced to reduce the risk of an error.
    3434
    35 == Duplicate Processing
     35== Duplicate Records
    3636
    37 Version one of the data load identifies some records as being duplicates because they have the same patient, sample collection datetime and concept code.  When a duplicate is identified it discards the most recent record.  This is probably not correct for several reasons:
     37Records are duplicate if they have duplicate values for all of the fields Encounter Number, Concept Code, Provider ID, Start Date and Modifier ID, of which Provider ID and Modifier ID are not used.  The key field is therefore start date, which has the following source for the records on which a duplicate was found:
    3838
    39 1. If there are more that two duplicates, it only discards one record and so there will still be a duplicate.
    40 1. Common sense and reason 1 suggest that it should be keeping the most recent record.
    41 1. There may be a better way to identify which record is correct.  For example, if the result has been suppressed (result suppression will not solely solve the problem).
    42 1. Both records may be valid.
     39||Table||No. Duplicates||Start Date Field||
     40||CATH_CATHINITIALVW||5||DateOfCatheter||
     41||CS2003F_PostProcedure||1||DateOfDischargeOrDeath||
     42||CS2010F_PostProcedure||1||DateOfDischargeOfDeath||
     43||EPS_EPSINIVW||1||DateOfAdmission||
     44||GGHPCI_FLWGGHPCIVIEW||169||DateOfFollowup||
     45||GGHPCI_INITIALGGHPCIVIEW||2||DateOfProcedureCCAD||
    4346
    44 Paul Smalley has agreed to look at the duplicate records to find out the reasons for the duplication.