GitHub - piquelab/locoglimpse2: Low coverage imputation using GLIMPSE2 in the grid

Low coverage imputation using GLIMPSE2 in the grid

This are the scripts used to use GLIMPSE2 (https://www.nature.com/articles/s41588-020-00756-0) method to impute genotypes from low coverage sequencing at the Wayne State High Performance Comuting Grid. We perviously used Gencove for both making the data and imputation, and we wanted to repeat the multiple batches using the same pipeline. We adapted the analysis steps from the GLIMPSE2 tutorial here (https://odelaneau.github.io/GLIMPSE/). The starting point are fastq files (or bam with bam2fastq converted files) and for each one we obtain a vcf file with the same reference panel and software version.

Step by step using existing reference files

Clone the github repo in a new folder where the analysis will be performed.

git clone [email protected]:piquelab/locoglimpse2.git

Make links to all the reference files needed for aligment and imputation to already existing locations in the cluster. Alternatively, you may want to create new references from scratch (see below). The ones in the following script are based on hg38 and the latest release of the 1KG remapped on hg38.

bash makeRefLinks.sh

Put all the fastq files on the ./fastq/ folder or convert them from bam to fastq. See and adapt makefastq.sh.
Alignment of all the fastq reads with the reference using BWA-MEM2 using Slurm jobs

make -f bamMake.mk slurm

Imputation using 1KG panel. Using Slurm jobs.

make -f vcfImputeMake.mk slurm

Merging all bcf files into one vcf.gz file. See or adapt final_merge.sh

Steps to make a new reference for alignment and imputation panel.

Making the reference for aligning reads with BWA-MEM

bash make_hg38ref.sh

Making the imputation panel reference.

Download 1000G vcf files: download_1kgFiles.sh
Process the reference vcf files: step2_process_ref_panel.sh
Make the chunks: step3_script_chunk.sh
Split the reference panel in chunks: step4_script_split_reference.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Low coverage imputation using GLIMPSE2 in the grid

Step by step using existing reference files

Steps to make a new reference for alignment and imputation panel.

Making the reference for aligning reads with BWA-MEM

Making the imputation panel reference.

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
1kg		1kg
README.md		README.md
bamMake.mk		bamMake.mk
cleanError.sh		cleanError.sh
cramMake.mk		cramMake.mk
download_1kgFiles.sh		download_1kgFiles.sh
final_merge.sh		final_merge.sh
makeRefLinks.sh		makeRefLinks.sh
make_hg38ref.sh		make_hg38ref.sh
makefastq.sh		makefastq.sh
step2_process_ref_panel.sh		step2_process_ref_panel.sh
step3_script_chunk.sh		step3_script_chunk.sh
step4_script_split_reference.sh		step4_script_split_reference.sh
step5_script_impute_parallel.sh		step5_script_impute_parallel.sh
step6_script_ligate.sh		step6_script_ligate.sh
vcfImputeMake.mk		vcfImputeMake.mk

piquelab/locoglimpse2

Folders and files

Latest commit

History

Repository files navigation

Low coverage imputation using GLIMPSE2 in the grid

Step by step using existing reference files

Steps to make a new reference for alignment and imputation panel.

Making the reference for aligning reads with BWA-MEM

Making the imputation panel reference.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages