Examples

Illustration of haplotyping with SMAP haplotype-sites

Below, we present good quality data sets analysed with SMAP haplotype-sites to illustrate typically expected haplotyping results. For each data set, the command to run SMAP haplotype-sites with suggested optimal settings, together with graphical results are displayed for comparison to your own data.

Please note that while we suggest ‘optimal’ parameter settings in the command to run SMAP haplotype-sites, the default of SMAP haplotype-sites is to perform as little filtering as possible and to report all loci. The user is adviced to run SMAP haplotype-sites first with the mandatory and default settings (no data filtered out), and then decide on the most optimal parameter settings for your own data. Parameter settings should be iteratively adjusted for each novel data set, after manual inspection of the graphical output. The example data shown below are merely meant to illustrate the expected outcome of data sets processed with parameters adjusted to the specific type of library, reads and samples. If your data does not look like these examples, please check out the section Recommendations and Troubleshooting for examples (and suggested solutions) of incorrectly generated, preprocessed, or mapped reads, or sample sets analyzed with inappropriate parameter settings for SMAP haplotype-sites. There, some guidelines for troubleshooting are provided, which may help to optimize library preparation, sequencing, read preprocessing or SMAP haplotype-sites parameter settings.

HiPlex

Individuals

Tab command shows a typical command to run SMAP haplotype-sites for HiPlex data in diploid individuals.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation stranded  -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

Pools

Tab command shows a typical command to run SMAP haplotype-sites for HiPlex data in pools of diploid individuals.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation stranded  -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

Shotgun

Sliding frames

Tab command shows a typical command to run SMAP haplotype-sites for Shotgun PE-86 separately mapped reads of diploid individuals.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation stranded  -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

Structural Variants

Tab command shows a typical command to run SMAP haplotype-sites for single-enzyme GBS, merged reads in tetraploid individuals.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation ignore -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

Oxford Nanopore long reads

Tab command shows a typical command to run SMAP haplotype-sites for single-enzyme GBS, merged reads in pools.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation ignore -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

PacBio long reads

Tab command shows a typical command to run SMAP haplotype-sites for single-enzyme GBS, merged reads in pools.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation ignore -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

GBS

Individuals

Tab command shows a typical command to run SMAP haplotype-sites for single-enzyme GBS, merged reads in diploid individuals.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation ignore -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10

Pools

Tab command shows a typical command to run SMAP haplotype-sites for single-enzyme GBS, merged reads in pools.
Tabs further show typical graphical output such as haplotype frequency distribution at the individual sample level, as well as summary graphics that show locus and sample completeness and correctness and haplotype diversity, together with explanation about step-specific parameters.
smap haplotype-sites /path/to/BAM/ -mapping_orientation ignore -p 8 --plot all --plot_type png --name 2n_ind_GBS-SE -f 50 -g 200 --min_stack_depth 3 --max_stack_depth 500 --min_cluster_depth 10 --max_cluster_depth 1500 --max_stack_number 2 --min_stack_depth_fraction 10 --completeness 1 --max_smap_number 10