Mendelian Randomisation

type

status

date

slug

summary

The Tutorial of Mendelian Randomisation

1. download the software

download link：https://disk.yandex.com/d/ixz9-frc8VyEKg

2. click the subMR.exe

I shared the "workDir.zip" with Quark network disk, you can open the link through your browser to download and extract the zip to the working path.

Link: https://pan.quark.cn/s/c12d96c81de6

Take the working path D:/appTest/mr/workDir as an example:

1kg.v3 and plinkbinr folders should be placed under the working path!

3. Fill the setting

For convenience, I use the rows and columns as the coordinates of the buttons, such as (1,1) represents “WorkDir” which is a show-lineEdit.

Working environment

WorkDir: All results are placed under the work path with subfolder as WorkDir/mrProject/Project Name/

you can click (1,2) “WorkDir” to choose your working folder of this project.
Of course, you can also edit (1,1) “WorkDir” manually.

R Path or Server Address:

you can click (1,4) “R Path” to choose your R environment on your PC like I:/App/R/R-4.3.3.

Of course, you can also edit (1,3) “R Path or Server Address” manually by using a R path.
If you have a linux R server environment, you can edit (1,3) “R Path or Server Address” manually by using a host:port like 127.0.0.1:6311

Project Name:

you can edit (1,5) “Project Name” manually.

Batch Mendelian randomization analysis software

The folder path of Expose or clumped data:

if your data are in vcf format:

you can click (1,2) “The folder path of vcf data” to choose your folder path of vcf data.
Of course, you can also edit (1,1) “The folder path of vcf format gwas data” manually.
then you can Convert data format by click (1,3) “0-Trans vcf to csv”, the results will in the vcf2csv folder under the work path.
The you can choose the vcf2csv folder as “The folder path of Expose or clumped data”
Of course, you can trans by the code like:

if your data are not in vcf format,like csv,txt,tsv or in .gz compression format:

you can click (2,2) “Choose Exp Dir” to choose your “The folder path of Expose or clumped data” folder of this project.
Of course, you can also edit (2,1) “The folder path of Expose or clumped data” manually.
if the “The folder path of Expose or clumped data” is the folder without clumping, you need to check (4,2) “Whether clump” to make clumping; otherwise, if the “The folder path of Expose or clumped data” is the folder after clumping, you need to uncheck (4,2) “Whether clump”.Because you only need to clump once with specific parameters, at the same time, we will save the clumping data under the WorkDir.

The folder path of Outcome:

if your data are in vcf format:Please refer to above “The folder path of Expose or clumped data”
if your data are not in vcf format,like csv,txt,tsv or in .gz compression format:

you can click (2,4) “The folder path of Outcome” to choose your working folder of this project.
Of course, you can also edit (2,4) “The folder path of Outcome” manually.

Number of parallel cores:

you can edit (4,1) “Number of parallel cores” manually. If the value =0, which means single-core operations, if the value=4,which means parallel operations with 4 cores.

Expose name index / Outcome name index:

you can edit (5,1) “Expose name index” manually. If the value =0, which means Expose name is the full file name, if the value=4, which means Expose name is 4th str by splitting the full file name with “_”.For example,The file name is a_b_c_d_e.csv, 0 means a_b_c_d_e,4 means d.
you can edit (6,1) “Outcome name index” manually. If the value =0, which means Outcome name is the full file name, if the value=4, which means Outcome name is 4th str by splitting the full file name with “_”.For example,The file name is a_b_c_d_e.csv, 0 means a_b_c_d_e,4 means d.

Generate subgraphs by exposure or outcome:

you can choose (5,2) “Out” manually. If the value =Out, which means Generate subgraphs by outcome, if the value=Exp, which means Generate subgraphs by exposure.
Let's assume the number of exposure file is m,the number of outcome file is n, if m≥n, you can choose “Out”,otherwise,you can choose “Exp”.

Exposure and Outcome SampleSize Ncase path:

if your gwas dataframe do not contain the column of SampleSize and Ncase, you can click (7,2) “SampleSize Ncase” to choose your Exposure and Outcome SampleSize Ncase csv file path like SampleSizeNcaseFile.csv. The software will add them by match the file name.
Of course, you can also edit (7,1) “SampleSize Ncase” manually.

if you just have one exposure file and one outcome file, you can fill them in there.

Parameter explanation

Expose p-value:
Outcome p-value:
R-squared value:
F value:
P value of forest
Volcano chart title
Items number in donut diagram
Volcano plot height
Volcano plot width
Donut Width

Run MR:

if the “The folder path of Expose or clumped data” is the folder without clumping, you need to check (4,2) “Whether clump” to make clumping.

First, you can click (4,3) “1-clump analysis” to clump.
then, you can click (4,4) “2-MR analysis”.
then, you can click (4,5) “3- Merge result”.
then, you can click (5,5) “4- Mapping the Forest Volcano”.
then, you can click (6,5) “5-Draw a ring heat map”.

if the “The folder path of Expose or clumped data” is the folder after clumping, you need to uncheck (4,2) “Whether clump”.

First, you can click (4,4) “2-MR analysis”.
then, you can click (4,5) “3- Merge result”.
then, you can click (5,5) “4- Mapping the Forest Volcano”.
then, you can click (6,5) “5-Draw a ring heat map”.

Colocalization analysis software

The folder path of Expose:

you can click (1,2) “Choose Exp Dir” to choose your working folder of this project.
Of course, you can also edit (1,1) “The folder path of Expose” manually.

The folder path of Outcome:

you can click (2,2) “Choose Out Dir” to choose your working folder of this project.
Of course, you can also edit (1,1) “The folder path of Outcome” manually.

Genome format file to be converted:

you can click (3,2) “Path of the folder to be converted” to choose your working folder of this project.
Of course, you can also edit (3,1) “Genome format file to be converted” manually.

Parameter explanation

Up and down range: Upstream and downstream range
variable type: Continuous or Classification
PPH4:PPH4 threshold filtering
genome type: 37 or 38 ⇒ grch37/38

run Colocalization

you can click (2,6) “1- Colocalization analysis and plot”.
Colocalization do not need to run “0-Chromosome format conversion”, but you need to choose right genome type of the data file!

run Chromosome format conversion

you can click (3,4) “0-Chromosome format conversion”,then it will add chr and poc of new Chromosome format by match snp name!