Options
The options for Reference population genotypes, GWAS summary statistics, and Global misc act on all the analyses. For clarity, we have categorized the other parameters by GATES and EHE, DESE, EMIC and EHE, although this has resulted in duplications.
In the “Default” columns of the following tables, “null” denotes that the flag works with an argument but there is no default value; “n/a” denotes that the flag works without any argument.
Reference population genotypes
These options work on the VCF file of reference population genotypes. Only SNPs that pass the filters will be used for subsequent analyses.
Flag |
Description |
Default |
|---|---|---|
|
Specifies a VCF file of genotypes sampled from the target population. These genotypes are used to estimate LD correlation coefficients between any pair of SNPs. For VCF files of separated chromosomes, use wildcards and quotes like |
null |
|
Specifies a directory to keep the parsed VCF files in KGGSEE object format. |
null |
|
Specifies the directory of the genotypes of the reference population in KGGSEE object format, which was saved by |
null |
|
Filter SNPs with a minor allele frequency lower than the setting. |
|
|
Filter SNPs with a p-value of rejecting Hardy-Weinberg equilibrium lower than the setting. |
|
|
Specify chromosome labels. e.g., |
1,…,22,X,Y,M |
|
Set the max tolerable LD coefficient between SNPs from two LD blocks. KGGSEE divides SNPs within a genomic region into LD blocks to improve computational efficiency. Any pairwise LD coefficient between SNPs from two LD blocks are always less than the number specified by |
|
|
Set the window size in base-pair for searching LD blocks on chromosomes |
|
GWAS summary statistics
Flag |
Description |
Default |
|---|---|---|
|
Specifies a whitespace delimitated file of GWAS summary statistics. |
null |
|
Specifies the column header of chromosomes. |
|
|
Specifies the column header of coordinates. |
|
|
Specifies the column header of p-values. |
|
|
Specifies the column header of the effect allele. |
|
|
Specifies the column header of the other allele. |
|
|
Specifies the column header of the frequencies of the allele specified by |
|
|
Specifies the column header of effect sizes. |
null |
|
Specifies the type of effect sizes: |
null |
|
Specifies the column header of standard errors of effect sizes. Note: even if the effect size is provided as an odds ratio, this is still the standard error of the logarithm (base e) of the odds ratio. |
|
|
Specifies the column header of sample sizes for a quantitative trait. |
null |
|
Specifies the column header of case sample sizes for a qualitative trait. |
null |
|
Specifies the column header of control sample sizes for a dichotomous trait. |
null |
|
Ask KGGSEE to adjust the p-values and chi-square statistics using the genomic control factors from the input GWAS data before all follow-up analyses. |
1 |
Global misc
Flag |
Description |
Default |
|---|---|---|
|
Specifies the number of threads. |
|
|
Download |
n/a |
|
Specifies the reference genome version of the coordinates. The supported versions are |
|
|
Specifies the database of gene annotations. |
|
|
Output results in Excel format. |
n/a |
|
Only genes with an HGNC-approved gene symbol are considered in the analysis. |
n/a |
|
Specifies the output prefix of results. |
|
|
Specify a BED file to define customized gene coordinates instead of the annotation from RefSeqGene or GENCODE. The first three columns of the BED file define gene coordinates and are mandatory; the fourth column defines gene names and is optional. When the fourth column is absent, a gene name of the format like |
null |
|
Specifies genomic regions to be excluded in the analysis, e.g., |
null |
|
Specifies the path KGGSEE running resource data. |
|
GATES and ECS
Flag |
Description |
Default |
|---|---|---|
|
Triggers gene-based association tests. |
n/a |
|
One number sets the basepair to extend at both sides of a gene, when considering SNPs belonging to the gene, e.g., |
|
|
Specifies a fasta-styled file of eQTL summary statistics. If this flag is used, |
null |
|
Specifies the threshold of eQTL p-values. Only eQTLs with a p-value lower than the threshold will be used. The default is |
|
DESE
Flag |
Description |
Default |
|---|---|---|
|
Trigers the DESE, eDESE or SelDP. |
n/a |
|
Specifies a gene expression file that contains means and standard errors of gene expressions in multiple tissues. |
null |
|
Specifies the method for multiple testing correction. |
|
|
Specifies the threshold of the adjusted p-value for fine-mapping. Only genes with an adjusted p-value lower than the threshold will be retained for fine-mapping. |
0.05 |
|
Specifies the maximum number of genes with the smallest p-values that will be retained for fine-mapping. |
null |
|
Specifies MSigDB gene sets for enrichment analysis:
|
null |
|
Specifies a user-defined file of gene sets for enrichment analysis. |
null |
|
One number sets the basepair to extend at both sides of a gene when considering SNPs belonging to the gene, e.g., |
|
|
Specifies a fasta-styled file of eQTL summary statistics. If this flag is used, |
null |
|
Specifies the threshold of eQTL p-values. Only eQTLs with a p-value lower than the threshold will be used. The default is |
|
|
The number of permutations for an adjustment of selection bias and multiple testing |
null |
EMIC
Flag |
Description |
Default |
|---|---|---|
|
Triggers the EMIC. |
n/a |
|
Specifies a fasta-styled file of eQTL summary statistics. |
null |
|
Specifies the threshold of eQTL p-values. Only eQTLs with a p-value lower than the threshold will be used. The default is |
|
|
Specifies the threshold of LD coefficients when pruning variants. For each gene or transcript, eQTLs with LD coefficients higher than the threshold will be pruned. |
0.5 |
|
Specifies the p-value threshold to further perform an EMIC pleiotropy fine-mapping (EMIC-PFM) analysis. If the EMIC p-value of a gene is lower than the threshold, an EMIC-PFM will be performed to control the false-positive caused by pleiotropy. |
|
|
Specifies the p-value threshold for plotting a scatter plot. Genes with an EMIC p-value lower than the threshold will be plotted. |
|
EHE
Flag |
Description |
Default |
|---|---|---|
|
Triggers gene-based association tests and estimation of gene heritability. The flags of |
n/a |
|
When |
null |
|
When |
n/a |
|
Specifies the proportion of cases in the population when estimating the heritability of a dichotomous trait. |
0.01 |