Gap closing and genome finishing tools in Align to Reference and Assembler.

Automated algorithms can only take you so far with genome assembly. The final steps involved in finishing a genome always need manual intervention. MacVector’s various assembly editors have many tools for helping finish genome sequencing projects. For example, closing gaps, extending reference sequences and even automatically circularizing contigs. If you select reads, then right click (or use CTRL-left click) you will see a context sensitive menu with the following tools:

Unknown

  • Export Consensus with/without Gaps
  • Align Selected Reads
  • Delete Selected Reads
  • Reset (unalign) Selected Reads
  • Export Selected Reads as FASTA/FASTQ
  • Select Matching Pairs – if you have aligned a set of paired-end reads, you can select individual read(s) and use this function to select the corresponding mate(s). This is particularly useful if you want to find pairs that will extend a contig and export them for further analysis/assembly.
  • Extend Reference with Selected Read – This is active if you have selected a single read that hangs over either end of a Reference sequence. This will extend the Reference in the appropriate direction using the sequence of the read.
  • Circularize Consensus – This is enabled if it detects direct repeats at the ends of a contig, and even tells you the length of the repeat it found. It will circularize the consensus and create a new circular sequence window with the repeat appropriately deleted.
  • Select Overlapping Reads Containing Selected Sequence – This is enabled if you select a short region in a read. All overlapping reads that contain that selected sequence will be selected. For paired reads you can then use Select Matching Pairs to select their mate, then Export Selected Reads as FASTQ/FASTA to export them to a file.

Not all tools are applicable or available in all editors. Plus some tools are only enabled when using paired end reads. Here’s what’s available in each editor.

Align to Reference editor

  • Export Consensus with/without Gaps
  • Align Selected Reads
  • Delete Selected Reads
  • Reset (unalign) Selected Reads
  • Export Selected Reads as FASTA/FASTQ
  • Select Matching Pairs
  • Extend Reference with Selected Read.
  • Select Overlapping Reads Containing Selected Sequence.

Reference Contig editor

  • Export Consensus with/without Gaps
  • Export Selected Reads as FASTA/FASTQ
  • Select Matching Pairs
  • Select Overlapping Reads Containing Selected Sequence.

De novo contig editor

  • Export Consensus with/without Gaps
  • Export Selected Reads as FASTA/FASTQ
  • Select Matching Pairs
  • Circularize Consensus

Read more about the various assembly tools in MacVector.

This entry was posted in Tips and tagged , . Bookmark the permalink. Trackbacks are closed, but you can post a comment.

Post a Comment

Your email is never published nor shared. Required fields are marked *

You may use these HTML tags and attributes <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

*
*

This site uses Akismet to reduce spam. Learn how your comment data is processed.