geneSync: Gene Symbol Harmonization for Large-scale RNA-seq Data Integration (opens in new tab)
Cross-cohort integration of transcriptomic data is a routine strategy for boosting statistical power and enhancing generalizability. However, gene nomenclature inconsistencies across datasets-arising from annotation version updates, historical renaming, and synonym reassignment-introduce silent mismatches during feature alignment, causing genes to be falsely classified as absent or split into duplicate features. Here, we present geneSync, an R package that performs gene symbol harmonization a...
Read the original article