Changes between Version 6 and Version 7 of LEGACY - ProcessingPidSet
- Timestamp:
- 12/30/11 17:37:01 (13 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
LEGACY - ProcessingPidSet
v6 v7 3 3 === Cardinality, with some examples. === 4 4 5 ==== __Example One__ ==== 5 6 First of all, this is wrong. Every pid must have a patient_id. 6 7 {{{ … … 11 12 }}} 12 13 13 14 ==== __Example Two__ ==== 14 15 This is acceptable: 15 16 {{{ … … 20 21 }}} 21 22 22 23 ==== __Example Three__ ==== 23 24 The following example is also acceptable, but implies we know the participant is either: 24 25 * already within the CRC (and that we know the internal identifier = 2) or: 25 26 * this is a new participant and we are ourselves assigning a new i2b2 internal identifier (= 2) for them. 26 Both situations are ones we can avoid by adopting the approach of the exampleabove, and omitting the HIVE as a source.27 Both situations are ones we can avoid by adopting the approach of Example Two above, and omitting the HIVE as a source. 27 28 {{{ 28 29 <pid> … … 33 34 }}} 34 35 36 ==== __Comment__ ==== 37 As far as I can tell, a row in the temporary table covers a patient_id / patient_map_id combination. So: 38 * Example Two would give rise to one row. 39 * Example Three would give rise to two rows. 40 35 41 === First Stage: Eliminate Duplicates. === 42 Any "duplicates" are eliminated from the temporary table. 43 A duplicate is one where another row matches on: 44 1. patient_id 45 1. patient_id source 46 1. patient_map_id 47 1. patient_map_id source 36 48 37 49 === Second Stage: Process HIVE as a Source. ===