Version 3 (modified by 11 years ago) ( diff ) | ,
---|
SQL fields in REDCap
A useful, but specialised, REDCap field type is the 'sql' type. It allows for a database query to populate a dropdown selection element.
Full details on the REDCap wiki at https://iwg.devguard.com/trac/redcap/wiki/SQLFieldType
Implementing the sql field requires that the table housing the data is stored in the same database as the REDCap tables.
Examples from the documentation include the following:
select value from redcap_data where project_id = 390 and field_name='institution' order by value
- Returns just one value per result, which will form both the label and the value for the drop-down selection element.
select record, value from redcap_data where project_id = 390 and field_name = 'institution' and record in (select distinct record from redcap_data where project_id = 390 and field_name = 'consortium_status' and value != '4') order by value
- Returns two values per result, where the 'record' will become the value, and the 'value' will become the label of each option in the drop-down selection element.
There are essentially two types of sql field instances: (a) as an extensible, scalable version of the 'dropdown' data type, where the data to be read by the select query is essentially static, and (b) where the select query addresses data held in the REDCap_data table and generated as a result of REDCap data capture processes.
The former type allows for psuedo-dropdown elements in REDCap questionnaires, where the options can be added to without re-editing the data collection instrument. Remember that changes to the source SQL table would not be applied retrospectively to data collected through the REDCap questionnaire, so editing existing options once data collection is underway is NOT RECOMMENDED.
The second type allows for complex relationships between different studies in REDCap, where entries in one study can be dynamically linkes to entries in another study.
SQL fields in LCBRU REDCap inplementations
For those SQL fields created in LCBRU REDCap implementations which are not populated from other REDCap-derived data (i.e. not referencing the redcap_data table), the question arises of where to store the source data.
To avoid proliferation of tables within the LCBRU implementations of REDCap, it is proposed that a single additional table be created for all stored source data for sql fields. The table created will need to have at a minimum four columns: an object ID, a key to indicate which data collection form and which question the object relates to, the value and the label. Only the last two columns can be returned to the REDCap sql query.
Might be better to have separate columns for study, form and question (field, in REDCap parlance).
DRAFT database table definition:
CREATE TABLE sql_fields ( id MEDIUMINT UNSIGNED AUTO_INCREMENT PRIMARY KEY, project VARCHAR(100), form VARCHAR(100), field VARCHAR(100), value VARCHAR(100), label VARCHAR(100), INDEX 'reference' ('project','form','field'), KEY 'value_key' ('project','form','field','value'), KEY 'label_key' ('project','form','field','label'), FOREIGN KEY ('project') REFERENCES redcap_projects (project_name) ON UPDATE CASCADE, );
Entries into the table should therefore use the project_name from the REDCap redcap_projects database table (usually an all-lower-case rendition of the project title with underscores for spaces) for 'study', form_name and field_name from the redcap_metadata table for 'form' and 'field' respectively.
project_name is easy, because you can search for it in the redcap_projects database where project_id matches (look at the URL for the project to see the "pid=XX" attribute. field_name is kind of easy, because you can search for it in the redcap_metadata table, where project_id matches and the human-readable string is in element_label.
To find the form_name, you need to look for entries in the redcap_metadata where the project_id matches, the form_menu_description is not null (it will have the name of the forms in human readable format). Basically the first element in each form has a value in this column, and all other fields do not.
By using foreign keys we could force adherence to the REDCap object model and maintain the references even if the source data changes (although the SQL query itself would need to be updated in this situation, unless you built a very complex query).
But I think you can only use a foreign key if the corresponding column holds unique values, and that isn't true for the form name or field name columns in redcap_metadata - fields and forms only have to be unique within a particular project, not across the whole database. So maybe we can't have foreign keys for all three.
Need to check with the REDCap data model. I've asked for more information. In the meantime the above would work, but in a limited way, with no updating for field or form names.
The other approach is to create a project, with a data collection form to collect values and labels, and then query the redcap_data table for those entries. This is popular with the REDCap community but I (NH) really dislike it, as it puts metadata in the data table, which seems to be contrary to the database model of REDCap.
To begin with, we will use the table definition above, and do manual lookups for the project, form, and field names.
Re-use of option list
Of course, sometimes you want to use the same list more than once. In this case, the form and field names in the database should relate to the question which makes FIRST use of the list.