gravatar for Sebastian Hesse

2 hours ago by

Germany / Munich / Dr. von Hauner Children's Hospital

There are several questions about this topic but none of them received a clear answer, so here is my try to find a solution:

I am working with a proteome dataset that clearly has a batch effect from the date the MS measurement was done.
The dataset is also not normalised yet.
Now, which is the correct order to press the data?

1) Normalise the data first and remove the batch effect then (I use limmas "removebatcheffect")

OR

2) Remove batch effect from the data first and normalise then?

Please give straight up answers. If you feel the need to write: "well, depends on what you want to do...", please explain what you actually mean by that. I do not plan to normalise parts of the data in different ways, just straight up: normalise them all together and batch effect correct them all together (for date_processed only). Batch effect correction will only be used for visualisations, clustering and ML classifications, not for diff ex (there it will be included as a covariate).

Thanks a lot for your advice!
Sebastian

(I will try all, do more research and post results here to hopefully finally resolve this issue).



Source link