fcGENE: a versatile tool for processing and transforming SNP datasets

Abstract:

BACKGROUND Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses. METHODS In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses. RESULTS fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications. CONCLUSIONS We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.

DOI: 10.1371/journal.pone.0097589

Projects: Genetical Statistics and Systems Biology

Publication type: Journal article

Journal: PloS one

Human Diseases: No Human Disease specified

Citation: PLoS ONE 9(7):e97589

Date Published: 22nd Jul 2014

Registered Mode: imported from a bibtex file

Authors: Nab Raj Roshyara, Markus Scholz

Help
help Submitter
Citation
Roshyara, N. R., & Scholz, M. (2014). fcGENE: A Versatile Tool for Processing and Transforming SNP Datasets. In M. M. Abad-Grau (Ed.), PLoS ONE (Vol. 9, Issue 7, p. e97589). Public Library of Science (PLoS). https://doi.org/10.1371/journal.pone.0097589
Activity

Views: 682

Created: 14th Sep 2020 at 13:35

Last updated: 7th Dec 2021 at 17:58

help Tags

This item has not yet been tagged.

help Attributions

None

Related items

Powered by
(v.1.13.0-master)
Copyright © 2008 - 2021 The University of Manchester and HITS gGmbH
Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig

By continuing to use this site you agree to the use of cookies