Differences between structured databases and unstructured information in the research of RA

CharacteristicStructured databasesUnstructured information
DefinitionUses diagnostic codes and predefined formatsFound in free text or images
Data sourceClaims; prescriptions and administrative databasesClinical notes; imaging data
Data collectionInternational Classification of Diseases, 9th edition (ICD-9), ICD-10 codesNatural language processing (NLP) for text; convolutional neural network (CNN) for imaging
Examples of RA researchDetailed study of comorbidities; treatment safetyIdentification of RA patients; extraction of outcome measures
LimitationsLimited by predefined formats; requires systematic coding; possible missing variables and biasesAnalytical challenges; require precision in data detection; design challenges in algorithms
BenefitsSystematic and standardized data; detection of long-term trends; prevalence in broad populationsEnhances collection of specific features; contributes to multimodal research