Skip to main content

Table 1 Recurrence of quality makers in selected datasets. Thirteen datasets with low-quality imbalance were used to derive quality marker genes. Genes whose expression correlated positively or negatively with low sample quality in at least 2 datasets were defined as low-quality or high-quality markers, respectively

From: Overlooked poor-quality patient samples in sequencing data impair reproducibility of published clinically relevant datasets

Recurring datasets

Low-quality markers

High-quality markers

2 (15%)

7708

5243

3 (23%)

3443

2405

4 (31%)

1597

951

5 (38%)

724

287

6 (46%)

320

76

7 (54%)

136

15

8 (62%)

51

0

9 (69%)

7

0

10 (77%)

1

0