Minimizing data linkage error in an ETL pipeline using R: an intersection of MIMIC III and ODK database
What can you learn from this article?
Understand the concepts of data linkage, especially deterministic linakge. Address linkage error in the conjunction of MIMIC III (served in a postgreSQL database) and ODK database. Employ R to design the Extract, Transform, and Load (ETL) pipeline. Use Quarto document to generate a report in PDF format. Concepts of Data Linkage In a data scientist’s typical day, the merge/join function is an inevitable task.
2024-01-06