<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MacVector talk &#187; Alignment</title>
	<atom:link href="http://macvector.com/blog/tag/alignment/feed/" rel="self" type="application/rss+xml" />
	<link>http://macvector.com/blog</link>
	<description>General musings from the MacVector team about sequence analysis, molecular biology, the Mac in general and of course your favorite sequence analysis app for the Mac!</description>
	<lastBuildDate>Mon, 14 May 2012 14:40:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Clustering an alignment</title>
		<link>http://macvector.com/blog/2010/10/clustering-an-alignment/</link>
		<comments>http://macvector.com/blog/2010/10/clustering-an-alignment/#comments</comments>
		<pubDate>Mon, 25 Oct 2010 09:44:50 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Techniques]]></category>
		<category><![CDATA[Tips]]></category>
		<category><![CDATA[Alignment]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=390</guid>
		<description><![CDATA[Sometimes it is useful to sort, or cluster, an alignment according to the similarity/identity of its sequences. With such a sorted alignment you are able to more easily visualise closely related sequences as they will be together in an alignment with more distantly related sequences being much further apart. If you want to quickly cluster [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes it is useful to sort, or cluster, an alignment according to the similarity/identity of its sequences. With such a sorted alignment you are able to more easily visualise closely related sequences as they will be together in an alignment with more distantly related sequences being much further apart.</p>
<p>If you want to quickly cluster a sequence alignment in MacVector then you can use a phylogenetic tree to do this.  </p>
<p>- Take your sequence alignment and run a phylogenetic reconstruction. In the MSA editor click the TREE button or run the menu command:</p>
<pre>ANALYZE > CONSTRUCT TREE
</pre>
<p>- From the resulting phylogenetic window, you can rearrange sequences if necessary by selecting nodes and &#8220;rotating&#8221; them with the toolbar buttons. </p>
<p>- Then, with the focus on the Phylogeny window, sort the alignment to match the tree using the Sort MSA toolbar button or using the menu command:</p>
<pre>ANALYZE > PHYLOGENETIC ANALYSES > SORT MSA TO MATCH TREE</pre>
<p>This will rearrange the sequences in the editor to match the phylogeny branches.</p>
<p><img style="display:block; margin-left:auto; margin-right:auto;" src="http://macvector.com/blog/wp-content/uploads/2010/10/MSA_Tree.png" alt="MSA_Tree.png" title="MSA_Tree.png" border="0" width="600" height="382" /> </p>
<p><!-- Technorati Tags Start --></p>
<p>Technorati Tags:<br />
<a href="http://technorati.com/tag/MacVector" rel="tag">MacVector</a>
</p>
<p><!-- Technorati Tags End --></p>
<div id="tweetbutton390" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2010%2F10%2Fclustering-an-alignment%2F&amp;via=macvector&amp;text=Clustering%20an%20alignment&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2010%2F10%2Fclustering-an-alignment%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2010/10/clustering-an-alignment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Alignments in MacVector</title>
		<link>http://macvector.com/blog/2009/06/alignments-in-macvector/</link>
		<comments>http://macvector.com/blog/2009/06/alignments-in-macvector/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 01:00:57 +0000</pubDate>
		<dc:creator>Kevin</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Tutorials]]></category>
		<category><![CDATA[Alignment]]></category>
		<category><![CDATA[assembler]]></category>
		<category><![CDATA[BLAST]]></category>
		<category><![CDATA[Dot Plot]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=64</guid>
		<description><![CDATA[We get a lot of comments and questions from users on the various alignment functions in MacVector. They say there&#8217;s more than one way to skin a cat (not that I&#8217;ve done that &#8211; I have skinned a catfish, but I only know one way), and thats certainly true for alignments in MacVector. Each function [...]]]></description>
			<content:encoded><![CDATA[<p>We get a lot of comments and questions from users on the various alignment functions in MacVector. They say there&#8217;s more than one way to skin a cat (not that I&#8217;ve done that &#8211; I have skinned a catfish, but I only know one way), and thats certainly true for alignments in MacVector. Each function is designed for a different purpose. First, lets just list the functions;</p>
<ul>
<li>ClustalW &#8211; we also call this the &#8220;standard&#8221; Multiple Sequence Alignment (MSA)</li>
<li>Align to Reference</li>
<li>Pustell Matrix (also known as a Dot Plot)</li>
<li>Internet BLAST</li>
<li>Align to Folder</li>
<li>Contig Assembly</li>
</ul>
<p><strong>ClustalW/Multiple Sequence Alignment (MSA)</strong></p>
<p><img class="alignleft size-medium wp-image-87" title="msa2" src="http://macvector.com/blog/wp-content/uploads/2009/06/msa2-300x234.png" alt="msa2" width="300" height="234" />If you have two or more related sequences (DNA or Protein) and you want to examine the relationship between them, use this function. Choose <em>File-&gt;New-&gt;Protein Alignment</em> (or <em>File-&gt;New-&gt;Nucleic Acid Alignment</em>) to create an empty MSA window. Add sequences to the alignment by using the <em>Edit-&gt;Add Sequences from File</em> menu item then click on the <em>Align</em> toolbar button to automatically align the sequences using <em>ClustalW</em>. Click on the <em>Prefs</em> toolbar button to control the appearance and behavior of the data in each of tabs that represent different views or analyses of the alignment. This functionality is most suited for protein alignments, or for nucleic acid sequences where you are interested in examining phylogenetic relationships. If you wish to compare two or more DNA sequences, you should definitely consider if one of the other alignment functions may be more suitable.</p>
<p><strong>Align to Reference</strong></p>
<div id="attachment_92" class="wp-caption alignleft" style="width: 310px"><img class="size-medium wp-image-92" src="http://macvector.com/blog/wp-content/uploads/2009/06/align2ref.jpg" alt="The Align to Reference Editor window" width="300" height="260" /><p class="wp-caption-text">The Align to Reference Editor window</p></div>
<p>Use this if you have a reference sequence and you want to align one or more DNA sequences against it. A typical use would be in resequencing e.g. sequencing a cloned PCR fragment to check no errors were introduced, sequencing across end junctions, scanning for successful mutagenesis clones etc. In each case, open the file that represents the parent or &#8220;reference&#8221; sequence, then choose <em>Analyze-&gt;Align to Reference</em>. In the window that opens, click on the &#8220;+&#8221; button to add sequences from disk &#8211; these can be in any format that MacVector can read &#8211; typically ABI or SCF chromatogram files, but you can add plain sequences as well. When you click on the <em>Align</em> button, choose the <em>Sequence Confirmation</em> algorithm &#8211; this is tuned to expect the small insertions/deletions you would expect in raw chromatogram files. Compared to <em>ClustalW</em>, Align to Reference has the advantage that it will automatically &#8220;flip&#8221; sequences to guarantee optimal alignment.</p>
<p><em>Align to Reference</em> can also be used to align cDNA clones against a genome sequence. The steps are similar &#8211; use the genomic sequence as the reference, then add one or more cDNA clones to the alignment. Again, these can be chromatogram files. Now choose the <em>cDNA Alignment </em>algorithm when you <em>Align</em> &#8211; this is tuned to expect large insertions representing the intron regions.</p>
<p><strong>Pustell Matrix</strong></p>
<div id="attachment_83" class="wp-caption alignleft" style="width: 310px"><img class="size-medium wp-image-83" title="dotplotrepeats" src="http://macvector.com/blog/wp-content/uploads/2009/06/dotplotrepeats-300x233.png" alt="Repetitive sequence elements identified using a dot plot" width="300" height="233" /><p class="wp-caption-text">Repetitive sequence elements identified using a dot plot</p></div>
<p>This &#8220;Dot Plot&#8221; function is great for identifying weak regions of similarity between two sequences. It is not designed to show full-length alignments between two sequences, but instead shows shorter segments of direct or inverted similarity. You can use this to identify shorter regions of similarity, then copy those sections to new sequence windows for more in depth analysis using <em>ClustalW</em> or <em>Align to Reference</em>. Dot Plots are also the best way of identifying sequence rearrangements &#8211; the display clearly shows insertions and deletions (the main diagonal will be broken and have an offset) and even inversions (the inverted diagonal will run bottom left to top right and be colored blue). Finally you can use it to identify repetitive regions which appear as parallel diagonals offset from the main diagonal. <em>Pustell Matrix</em> can be used not only to compare DNA:DNA and Protein:Protein, but you can also use it to compare DNA:Protein where the algorithm will translate the DNA in all 6 frames before aligning to the protein.</p>
<p><strong>Internet BLAST</strong></p>
<p>Use this to identify and align a test sequence to the databases at the NCBI using the popular BLAST algorithm. One slightly hidden function in MacVector is that you can select sequences in the &#8220;hitlist&#8221; and then choose <em>Database-&gt;Retrieve to Disk</em> or <em>Database-&gt;Retrieve to Desktop</em> to download the matching sequences from the NCBI. You don&#8217;t even need to select the entire line &#8211; just select part of a line and use the <em>Retrieve</em> menu item.</p>
<p><strong>Align to Folder</strong></p>
<p>This allows you to scan a local folder full of sequences (in any format MacVector can recognize) and align them using the FastA alignment algorithm. Its kind of like a local BLAST, but more sensitive. Like the <em>Pustell Matrix</em>, you can choose to search DNA with Protein and <em>vice versa</em>. Many users like this function because the text alignment output also shows the features in the test sequence. This can be very useful for demonstrating the differences between your sequence and other sequences for patent purposes.</p>
<p><strong>Contig Assembly</strong></p>
<p>This requires our optional Assembler add-on. Use this if you want to align two or more DNA sequences with the idea of assembling them into a longer sequence with a consensus. Its is primarily designed for <em>de novo</em> sequencing, where you have no reference or scaffold sequence to align the individual sequences to. The MacVector implementation uses the popular phred, phrap and cross_match algorithms from the University of Washington that use quality values for improved accuracy of assembly. While you can use this for resequencing, you should consider whether the <em>Align to Reference</em> function might be a better choice.</p>
<p><strong>Tutorials</strong></p>
<p>There are tutorials for Sequence Confirmation and Contig Assembly in the Documentation folder of your MacVector installation. You can also download copies from our website.</p>
<p>So there we have at least five ways to align sequences using MacVector. Now if I can just find another 4 ways of skinning a catfish (or even just ONE thats easier than my current method) then I&#8217;ll be all set.</p>
<div id="tweetbutton64" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F06%2Falignments-in-macvector%2F&amp;via=macvector&amp;text=Alignments%20in%20MacVector&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F06%2Falignments-in-macvector%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/06/alignments-in-macvector/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

