-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow validation #16
Comments
Sorry about the silence on this. Thank you for pointing out this problem. I might realistically be able to have a look at this in more detail in May. It looks like you are doing everything correctly. The final line could be simpler: sum(pv$fdr < 0.05). It could help if you provided some output for the CpG site associations you were expecting to see, e.g. a boxplot showing the methylation levels by group of those CpG site(s) in the meffil and minfi normalized datasets, a scatterplot of the methylation levels for those site(s) meffil vs minfi, and the summary statistics for those site(s) from meffil and minfi. I might be able to determine the reason for the discrepancy by looking at those. |
Hi, Thanks for looking into this. I have a small correction to the script above. I changed Here is a plot of few top significant probes using minfi workflow. The top probes are selected from a testing between act_naive-act_rTreg. The values used in the plot is quantile beta values to make it comparable with the meffil plot. The actual data used in minfi for DM is M values. Here is the same probes using meffil's beta values. It seems like the groups are distinct, but they don't show up as significant in meffil results. Here are the meffil FDR values for these probes.
The full script used for minfi workflow, meffil workflow and plots is attached here. |
Hi, I am testing the meffil workflow with a publicly available dataset to see if I get the same results. But, I am unable to find any DMPs. I was able to reproduce the results using minfi but not using meffil.
So I am running this minfi workflow and the data is in this R package which needs to be installed from source.
Here is the code I use. Perhaps, I am doing something wrong or there is some explanation. One sample is discarded due to high detection p-value, cell type info is not used, norm is quantile. Also, I am unsure how to model contrasts with variables that have more than two levels.
The text was updated successfully, but these errors were encountered: