This page summarizes the findings from a comprehensive pre-QC investigation of the raw PLINK files
(ConvSK_raw: 654,027 variants × 1,247 samples, Illumina GSA-24v3-0_A2 array).
The investigation examined chip-level quality, sample identity (IBD), d/t replicate integrity,
heterozygosity, sex check, and contamination indicators before any QC filtering was applied.
Full interactive dashboard:
sample_investigation_v2.html
(17 sections with Plotly charts, sortable tables, and per-sample verdicts).
Data source: data/investigation_data_v2.json.
This page has two objectives:
- Cross-check Steps 1–15 — verify that all QC decisions are consistent with investigation findings
- GWAS sample filtering — definitive reference for which samples to keep, remove, and flag for future phenotype-association analyses