<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MacVector talk &#187; Development</title>
	<atom:link href="http://macvector.com/blog/category/development/feed/" rel="self" type="application/rss+xml" />
	<link>http://macvector.com/blog</link>
	<description>General musings from the MacVector team about sequence analysis, molecular biology, the Mac in general and of course your favorite sequence analysis app for the Mac!</description>
	<lastBuildDate>Mon, 14 May 2012 14:40:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>MacVector 12.5: Interface improvements and redesign</title>
		<link>http://macvector.com/blog/2011/11/macvector-12-5-interface-improvements-and-redesign/</link>
		<comments>http://macvector.com/blog/2011/11/macvector-12-5-interface-improvements-and-redesign/#comments</comments>
		<pubDate>Thu, 24 Nov 2011 14:52:10 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[assembler]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=688</guid>
		<description><![CDATA[This is part of a series of posts about the shortly to be released MacVector 12.5. With each release of MacVector as well as adding new features we continually redesign and rewrite many existing parts of MacVector. This is done for a number of reasons. For example: to accommodate changes in OS X, where generally [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is part of a series of posts about the shortly to be released MacVector 12.5.</em></p>
<p>With each release of MacVector as well as adding new features we continually redesign and rewrite many existing parts of MacVector. This is done for a number of reasons. For example:</p>
<li>to accommodate changes in OS X, where generally with every major release they make major changes under the hood.</li>
<li>to optimise workflows, where we try to respond to user feedback and make analyses both easier to use and quicker without removing any functionality.</li>
<li>to redesign dialogues and tools to make them easier to use whilst keeping all the options for the power user.</li>
<li>to take advantage of new OSX features to make life easier.</li>
<p>As a recent example you can now run a primer analysis and literally <a href="http://macvector.com/blog/2011/07/macvector-12-drag-and-drop-annotation-of-your-sequences/">drag and drop</a> a primer pair or their product to the sequence window to annotate those features to the sequence.</p>
<p><img src="http://macvector.com/blog/wp-content/uploads/2011/01/CreateResults.png" alt="CreateResults.png" title="CreateResults.png" border="0" width="230" height="150" style="float:center;" /></p>
<p>Here&#8217;s a few of those general improvements that have been included in MacVector 12.5.</p>
<h3>Interface improvements</h3>
<li>
<p>In MacVector 12.5 a number of editors and lists have been rewritten in <a href="http://en.wikipedia.org/wiki/Cocoa_(API)">Cocoa</a>. The restriction enzyme, proteolytic enzyme, DNA and protein subsequence editors and the Matrix editor have all been rewritten using Cocoa table views. Apart from an improved appearance, this change makes it easier to navigate and select entries in the editors, simplifying the creation of subsets (you can copy and paste between windows). In addition, you can sort the columns by clicking on the titles.</li>
</p>
<p><img src="http://macvector.com/blog/wp-content/uploads/2011/11/MV125EntrezBrowser.png" alt="MV125EntrezBrowser" title="MV125EntrezBrowser.png" border="0" width="400" height="302" style="float:right;" /></p>
<li>The Entrez browser has been rewritten and now permits an unlimited number of search terms to be combined for a search. So you can search for a specific gene in two different organisms at the same time. For example in the screenshot we are searching for the gene &#8220;ADH1&#8243; in two different yeasts,<br />
Sacchoromyces cervisiae and Schizosaccharomyces pombe</p>
</li>
<li>
<p>Many dialogues have had error handling code added so that red information text will be displayed explaining why the OK button is not enabled if invalid parameters have been entered. This helps understand how to optimise an analysis without having to read up in depth about the various parameters.</li>
<h3>Sequence Window Enhancements</h3>
<li>
<p>There are new “Segmented Hollow Arrow”, “Segmented Hollow Box” and “Segmented Full Height Hollow Box” symbol. You can use these for segmented features (e.g. CDS features containing introns) in the Map tab and the exon segments will be shown joined by a line. See the screenshot below for an example of the end of a multi exon segmented feature.</p>
</li>
<p><img src="http://macvector.com/blog/wp-content/uploads/2011/11/MV125_SegmentedFeatures.png" alt="MV125 SegmentedFeatures" title="MV125_SegmentedFeatures.png" border="0" width="400" height="260" style="float:center;" /></p>
<li>
<p>The Editor tab can now display the translations of CDS features above or below the sequence – turn this on using the Strands button.</p>
</li>
<h3>Miscellaneous Enhancements</h3>
<li>
<p>Segmented subsequence matches now have all segments displayed in the text and graphical outputs. For example you can now search for canonical <em>E.coli</em> promoter sequences and see both the -35 and -10 regions displayed in the Map.</li>
<li>
<p>The Align to Reference algorithm now runs as a standard job so that you can switch to other windows and carry on using MacVector while waiting for lengthy alignments to complete.</p>
</li>
<li>
<p>The maximum number of results that can be returned by the Align to Folder function has been increased to 32,767.</p>
</li>
<li>
<p>The SNP tab of the Align to Reference Window now lists all of the changes between the consensus and the reference sequence and includes the codon change and amino acid change of any CDS feature crossing each SNP.</p>
</li>
<p><strong>..and remember if you want to add Assembler before MacVector 12.5 is released you can do so at <a href="http://macvector.com/blog/2011/10/macvector-12-5-get-assembler-for-half-price/">half price</a>. You can also purchase an upgrade or a new license before the release of MacVector 12.5 and get a 10% discount. Both these offers include a free upgrade to MacVector 12.5 when it is released. Please <a href="http://www.macvector.com/initquoterequest.php">request a quote</a> now. Don&#8217;t forget to quote the promotional code of <em><strong>&#8220;Assembler50%&#8221;</strong></em> or <em>&#8220;MV12510&#8243;</em> for the discounts.</p>
<p></strong></p>
<p><!-- Technorati Tags Start --></p>
<p>Technorati Tags: <a href="http://technorati.com/tag/bowtie" rel="tag">bowtie</a>, <a href="http://technorati.com/tag/MacVector" rel="tag">MacVector</a></p>
<p><!-- Technorati Tags End --></p>
<div id="tweetbutton688" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2011%2F11%2Fmacvector-12-5-interface-improvements-and-redesign%2F&amp;via=macvector&amp;text=MacVector%2012.5%3A%20Interface%20improvements%20and%20redesign&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2011%2F11%2Fmacvector-12-5-interface-improvements-and-redesign%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2011/11/macvector-12-5-interface-improvements-and-redesign/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Growl and MacVector</title>
		<link>http://macvector.com/blog/2010/01/growl-and-macvector/</link>
		<comments>http://macvector.com/blog/2010/01/growl-and-macvector/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 18:42:03 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Releases]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=241</guid>
		<description><![CDATA[Over the next few weeks I&#8217;m going to be highlighting some of the new features that are being added to MacVector 11.1. The first of these is support for a great notification system that we all use at MacVector. Some analyses with MacVector will take seconds and as such are done in the foreground. However, [...]]]></description>
			<content:encoded><![CDATA[<p>Over the next few weeks I&#8217;m going to be highlighting some of the new features that are being added to MacVector 11.1. The first of these is support for a great notification system that we all use at MacVector.</p>
<p>Some analyses with MacVector will take seconds and as such are done in the foreground.  However, some types of analysis can take literally hours and if performed in the foreground they could tie up your Mac for a long time. MacVector has a tool called the Job Manager, whose sole purpose is to control such long running tasks. Once such a task is run MacVector will pass over control directly to the Job Manager, allowing the user to continue to use MacVector. As subsequent jobs are submitted it will store them in the job queue, only allowing them to run when there is a spare CPU (or core) to do so! Blast, alignments, primer design and sequence assembly are all examples of jobs that are submitted to the Job Manager. </p>
<p>Usually once a job has completed the Job Manager will notify the user using in the usual way by bouncing the MacVector icon in the Dock.</p>
<p><a href="http://growl.info/">Growl</a> is a third party open notification framework for OS X. It allows the basic notification system used by OS X (i.e. the bouncing icon!), to be replaced by a much more flexible, and potentially less intrusive system. It is widely supported by many OS X apps as a complete replacement for the OS X one.</p>
<p>We&#8217;re pleased to say that with MacVector 11.1 we&#8217;ve added support for Growl to the Job Manager.</p>
<p><img src="http://macvector.com/blog/wp-content/uploads/2010/01/GrowlNotification.png" alt="GrowlNotification.png" border="0" width="300" height="61" align="center" /></p>
<p>Now whenever a job, that is running in the Job Manger completes it notifies you with a non-intrusive way. For example in the image above a Blast job has finished and Growl notifies you with the default theme, &#8220;Smoke&#8221;. If you click on the dialog then it will take you to the Job Manager where you can display the results. Otherwise it will quietly fade away. Although you&#8217;ll never forget you have results to view as the MacVector icon in the Dock will show the number of completed jobs available in the Job Manager!</p>
<p>MacVector 11.1 will be released very shortly.</p>
<p><!-- Technorati Tags Start --></p>
<p>Technorati Tags:<br />
<a href="http://technorati.com/tag/MacVector" rel="tag">MacVector</a>, <a href="http://technorati.com/tag/growl" rel="tag">growl</a>
</p>
<p><!-- Technorati Tags End --></p>
<div id="tweetbutton241" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2010%2F01%2Fgrowl-and-macvector%2F&amp;via=macvector&amp;text=Growl%20and%20MacVector&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2010%2F01%2Fgrowl-and-macvector%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2010/01/growl-and-macvector/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mac OS X updates and MacVector</title>
		<link>http://macvector.com/blog/2009/10/mac-os-x-updates-and-macvector/</link>
		<comments>http://macvector.com/blog/2009/10/mac-os-x-updates-and-macvector/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 22:16:10 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Releases]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=198</guid>
		<description><![CDATA[One of the great things about OS X is that updates are really painless to install! Actually come to mention it upgrading the OS between major versions is usually trouble free too! For minor in version updates System Update will easily download and update the OS, leaving you the user to just reboot. So really [...]]]></description>
			<content:encoded><![CDATA[<p>One of the great things about OS X is that updates are really painless to install! Actually come to mention it upgrading the OS between major versions is usually trouble free too! For minor in version updates System Update will easily download and update the OS, leaving you the user to just reboot.  So really there&#8217;s no excuse to run an older version of an OS! Of course as well as being painless OS updates generally include many important bugs found by Apple.</p>
<p>An good example of one of these bugs is a recurring one we&#8217;ve found with early versions of OS X 10.4 (Tiger).  We&#8217;ve had many users report this one.   The bug in the operating system causes MacVector to crash when loading the &#8220;online update&#8221; message from our website. Quite annoying and not our fault we assure you! Updating to 10.4.11 (the last update of this OS) solves the problem.</p>
<p>A more current example is a bug that we have just found in the search function of the file open dialogue.  If you regularly use this to open files, and you have recently upgraded to Snow Leopard, then it is likely that you will start experiencing some hangs.  Unfortunately once again the bug appears with OS X and not with MacVector, so there&#8217;s not a lot we can do!  We&#8217;ve reported the bug to Apple. In the meantime we recommend using a different way to open your files until the bug is fixed!</p>
<div id="tweetbutton198" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F10%2Fmac-os-x-updates-and-macvector%2F&amp;via=macvector&amp;text=Mac%20OS%20X%20updates%20and%20MacVector&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F10%2Fmac-os-x-updates-and-macvector%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/10/mac-os-x-updates-and-macvector/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Topology of a sequence and interface changes</title>
		<link>http://macvector.com/blog/2009/09/topology-of-a-sequence-and-interface-changes/</link>
		<comments>http://macvector.com/blog/2009/09/topology-of-a-sequence-and-interface-changes/#comments</comments>
		<pubDate>Sun, 27 Sep 2009 20:45:54 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Releases]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=168</guid>
		<description><![CDATA[With each new release of MacVector we have always strived to be the easiest to use sequence analysis application. We have made some fairly radical changes to the interface over the last few years, and inevitably there are some changes that you know will confuse long time users but are needed to make the interface [...]]]></description>
			<content:encoded><![CDATA[<p>With each new release of MacVector we have always strived to be the easiest to use sequence analysis application. We have made some fairly radical changes to the interface over the last few years, and inevitably there are some changes that you know will confuse long time users but are needed to make the interface more consistent and logical.</p>
<p>One of the major changes was in MacVector 10. Prior to this release single sequences were displayed in the sequence editor with static views of the graphical map and feature table available from this single window. The actual sequence was only represented in the editor.  In MacVector 10 this changed to a tabbed window with multiple dynamic views representing the same sequence model underneath. With this more flexible approach regardless of which view you are looking at, or modifying, you are acting on the sequence directly. Furthermore because of this single sequence model all windows are linked, and you can view multiple windows (we call them replicas) of that same sequence. For example with three replicas open, one showing the sequence, another the feature table and the third the map view, then if you select a gene feature in the Feature table, then the sequence representing that gene will be highlighted, and the feature in the Map window will also be highlighted.  This is great for visualising your sequences. Especially with large ones.</p>
<p>However, this major change has left quite a few legacy functions that do not fit the new model so well.  One such function is the linear/circular button in the floating graphics palette. This button has always been a visual function, and it does not affect the real topology of the sequence underneath. There are times when it is nice to display a circular sequence in linear mode &#8211; when you want to zoom in on a crowded region for example, or looking at the MCS of a plasmid whilst cloning fragments. </p>
<p>This made complete sense with the old sequence model of MacVector. However with the new unified tabbed windows viewing a circular view of a linear sequence gives the incorrect impression that you are viewing a circular sequence.  Thus meaning that some algorithms would provide unexpected results.   A classic example is the cloning vector pBR322, which has an Eco<i>R</i>I site that crosses the origin. As a plasmid the sequence&#8217;s topology should be circular. However, if you have toggled the topology button in the sequence window, the sequence is now linear and the Eco<i>R</i> I site disappears. Prior to MacVector 11 if you now viewed this as a circular plasmid the restriction site is still absent, yet the molecule appears to be circular. That&#8217;s going to produce a confusing set of fragments on a gel!</p>
<div style="text-align:center;"><img src="http://macvector.com/blog/wp-content/uploads/2009/09/pBR322_Topology.png" alt="pBR322_Topology.png" border="0" width="371" height="487" /></div>
<p>We thought that it was safer to change this behaviour in MacVector 11, and so now although you can view a circular molecule as linear, you cannot view a linear molecule as a circular one. We could not think of any user need for viewing a linear sequence as circular, although there are many advantages for viewing a circular sequence as linear.  e.g. the aforementioned example of viewing the MCS of a plasmid.  So now whenever you view a linear sequence the floating graphics palette linear/circular button is disabled to prevent you from inadvertently changing the Map and *thinking* you are dealing with a circular molecule. It&#8217;s a fairly small change, but unfortunately it has confused quite a few users! We do think that it is safer to have made this change. Displaying misleading results is something that we definitely do not want to do!</p>
<p>Currently, only the restriction enzyme, subsequence searching and (most) nucleic acid toolbox plots consider linearity/circularity in their algorithms. However, we plan on adding much better support for circular sequences in MacVector 11.5, so we thought it important to clean up this ambiguous linear/circular issue in this release. To this end, you will also see that the Topology button has now been added to the default toolbar button set of all the sequence window tabs to help indicate it is an important property of the sequence.</p>
<div id="tweetbutton168" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F09%2Ftopology-of-a-sequence-and-interface-changes%2F&amp;via=macvector&amp;text=Topology%20of%20a%20sequence%20and%20interface%20changes&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F09%2Ftopology-of-a-sequence-and-interface-changes%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/09/topology-of-a-sequence-and-interface-changes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MacVector 11.0 and Mac OS X 10.6 Snow Leopard</title>
		<link>http://macvector.com/blog/2009/08/macvector-110-and-mac-os-x-106-snow-leopard/</link>
		<comments>http://macvector.com/blog/2009/08/macvector-110-and-mac-os-x-106-snow-leopard/#comments</comments>
		<pubDate>Fri, 28 Aug 2009 04:46:18 +0000</pubDate>
		<dc:creator>Nivek</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[OSX]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=118</guid>
		<description><![CDATA[MacVector 11.0 and Mac OS X 10.6 Snow Leopard (Uncia uncia or Panthera uncia) On August 28th Apple Computer, Inc. publicly release Mac OS X 10.6 also known as Snow Leopard. With any new release of the operating system (OS) there is always the question of whether the applications you own will continue to work. [...]]]></description>
			<content:encoded><![CDATA[<p>MacVector 11.0 and <a href="http://www.apple.com/macosx/">Mac OS X 10.6 Snow Leopard</a> (Uncia uncia or Panthera uncia)</p>
<p>On August 28th <a href="http://www.apple.com/">Apple Computer, Inc.</a> publicly release Mac OS X 10.6 also known as Snow Leopard. With any new release of the operating system (OS) there is always the question of whether the applications you own will continue to work. Often companies will release a new version of their application that fixes any issues that it might have with the new OS. At MacVector, Inc. we work to make sure that our applications already support the new OS releases. MacVector 11.0 and MacVector Assembler 11.0 have been tested with Snow Leopard and any necessary fixes have already been made to keep our applications running on the latest OS from Apple Computer, Inc.</p>

<a href='http://macvector.com/blog/2009/08/macvector-110-and-mac-os-x-106-snow-leopard/mv11sl2/' title='mv11sl2'><img width="150" height="150" src="http://macvector.com/blog/wp-content/uploads/2009/08/mv11sl2-150x150.png" class="attachment-thumbnail" alt="mv11sl2" title="mv11sl2" /></a>
<a href='http://macvector.com/blog/2009/08/macvector-110-and-mac-os-x-106-snow-leopard/mv11sl1/' title='mv11sl1'><img width="150" height="150" src="http://macvector.com/blog/wp-content/uploads/2009/08/mv11sl1-150x150.png" class="attachment-thumbnail" alt="mv11sl1" title="mv11sl1" /></a>

<p>As ardent Mac fans we&#8217;re as excited as ever about this new OS, and even though we&#8217;re running development builds already, we&#8217;ll be first in line on Friday to purchase our shrink wrapped copies!</p>
<p>Talking of releases don&#8217;t forget if you are still running an old copy of MacVector you can upgrade at a highly discounted price <a href="http://macvector.com/blog/?p=108">here</a>.</p>
<div id="tweetbutton118" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F08%2Fmacvector-110-and-mac-os-x-106-snow-leopard%2F&amp;via=macvector&amp;text=MacVector%2011.0%20and%20Mac%20OS%20X%2010.6%20Snow%20Leopard&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F08%2Fmacvector-110-and-mac-os-x-106-snow-leopard%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/08/macvector-110-and-mac-os-x-106-snow-leopard/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Next Generation sequencing formats</title>
		<link>http://macvector.com/blog/2009/06/next-generation-sequencing-formats/</link>
		<comments>http://macvector.com/blog/2009/06/next-generation-sequencing-formats/#comments</comments>
		<pubDate>Wed, 24 Jun 2009 13:26:19 +0000</pubDate>
		<dc:creator>Chris</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[NGS]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=100</guid>
		<description><![CDATA[As is common with the lack of standards seen with most emerging technologies there are many different and competing types of sequencing file formats for storage of short read or next generation sequencing data. All these formats try to solve the same question of storing an almost unprecedented amount of sequence data in a useable [...]]]></description>
			<content:encoded><![CDATA[<p>As is common with the lack of standards seen with most emerging technologies there are many different and competing types of sequencing file formats for storage of short read or next generation sequencing data. All these formats try to solve the same question of storing an almost unprecedented amount of sequence data in a useable and complete format. However, one emerging format that seems very appropriate for this type of data is <a href="http://en.wikipedia.org/wiki/FASTQ_format">Fastq</a>.  </p>
<p><a href="http://en.wikipedia.org/wiki/Phrap">Phrap</a> was one of the first real high throughput assemblers that could also deal with quality scores (generated by its stablemate <a href="http://en.wikipedia.org/wiki/Phred_base_calling">Phred</a>). The general input files to Phrap are a single Fasta file containing the reads, and an associated Qual file that contains quality scores for each and every read in the Fasta file. I&#8217;m not sure what it is about bioinformaticians, but they always feel the need to add Yet Another Format rather than reuse one of the many decent formats.  However, in this case Fastq is a logical progression of the Fasta + Qual format in that the two individual files are now merged. That is each read comprises of four lines; A label header, a sequence, and second label header and a quality score line.  </p>
<p><em>Here&#8217;s an example of such a file. Note that this example has a single &#8216;+&#8217; character to indicate the quality label line, rather than a duplicate of the label.</em></p>
<pre>@ERR000955.3982 IL6_1091:3:1:210:502/1
TCCAAACACACTTTGTGTAGAATCTGCAAGTGGAGAT
+
>>>>>>>>>>>>>>>>>>>;>>>>><>>>;>>;><>></pre>
<p>Many of the raw file formats (sff/ SRF etc) are big! They contain the raw image files as well as basecalled sequence and quality data.  Fastq files are at the complete opposite end.  They are small, and only contain minimal data. They may contain millions of reads, yet are still in the less than a tenth of a Gigabyte range for a single run&#8217;s data. It is worthwhile noting that the various Short Read Archives (NCBI, EBI etc.) require the submission of original raw image files, but only allow the reads to be downloaded in fastq format.</p>
<p>The quality line must comprise of the same number of characters as bases. i.e. one quality character per base. However, most quality scores are double digits. Fastq gets around this by using an ASCII character to encode the quality score. However, here&#8217;s where consistency fails. The quaility line mat be one of three different types.  Sanger format will encode a <a href="http://en.wikipedia.org/wiki/Phred_quality_score">Phred quality score</a> of 0 &#8211; 60 using the ASCII characters 33 to 93. The latest Illumina 1.3 format will also contain a Phred Quality score from 0 to 40 however, this time encoded using ASCII 64 to 104. Finally the older Illumina (nee Solexa) 1.0 format has its own Solexa/Illumina quality score from -5 to 40 encoded using ASCII 59 to 104.  Of course this does now pose problems, as unless you know which quality score was used, there is now way of knowing without guesswork, which it is.</p>
<p>There are also other issues with this format. It could be said that the label line for the quality score line is redundant, and the filesize could be reduced by 25% if this was removed. Some applications do generate and accept fastq files that have a single &#8216;@&#8217; or &#8216;+&#8217; in place of the quality label line. </p>
<p>It would  be helpful to see a tightening of this format and indeed there is a <a href="http://illumina.ucr.edu/ht/documentation/standardized-fastq-format-aka-fastq2">fastq2</a> format that does not have these weaknesses.</p>
<p>With the next release of MacVector and Assembler during the Summer we will be adding support for the Fastq format. We will also be adding support for <i>de novo</i> assembly of short read data. This release is currently in internal beta testing, and will be out for a public beta trial soon.</p>
<div id="tweetbutton100" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F06%2Fnext-generation-sequencing-formats%2F&amp;via=macvector&amp;text=Next%20Generation%20sequencing%20formats&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F06%2Fnext-generation-sequencing-formats%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/06/next-generation-sequencing-formats/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MacVector, Universal Type Identifiers and File Extensions</title>
		<link>http://macvector.com/blog/2009/05/macvector-universal-type-identifiers-and-file-extensions/</link>
		<comments>http://macvector.com/blog/2009/05/macvector-universal-type-identifiers-and-file-extensions/#comments</comments>
		<pubDate>Sat, 02 May 2009 22:32:40 +0000</pubDate>
		<dc:creator>Nivek</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=14</guid>
		<description><![CDATA[One feature that Macintosh applications have typically provided since the very beginning has been the ability to associate a document with an application such that opening the document (double clicking) would launch the appropriate application. Historically, on the Mac OS, this has been accomplished by including two additional pieces of information stored in the directory [...]]]></description>
			<content:encoded><![CDATA[<div>One feature that Macintosh applications have typically provided since the very beginning has been the ability to associate a document with an application such that opening the document (double clicking) would launch the appropriate application. Historically, on the Mac OS, this has been accomplished by including two additional pieces of information stored in the directory entry for the document: type and creator.  The document type is a 4 character identifier such as &#8216;TEXT&#8217; or &#8216;NUCL&#8217; which identifies the format of the document. The document creator is another 4 character identifier that is unique to an application &#8211; for example, MacVector uses the identifier &#8216;MVTR&#8217;. Although for the most part this has worked well as a Macintosh specific solution, there are a number of issues that have arisen, such as what should happen when a document was on one floppy disk and the application was on another.</div>
<div>A greater problem has arisen with the increase in computer networking and the integration of Macintosh computers into mixed computing environments. These days, files often come from many different computers using differing protocols to move the data between machines.  To solve this problem, Apple Computer developed a new technology known as <a title="Uniform Type Identifiers" href="http://developer.apple.com/documentation/Carbon/Conceptual/understanding_utis/understand_utis_intro/understand_utis_intro.html#//apple_ref/doc/uid/TP40001319-CH201-SW1" target="_blank">Uniform Type Identifiers</a>, or UTIs for short, that help software developers associate documents with their application even in a multi-platofrm environment. UTIs allow documents created on a Macintosh that might use the old type and creator, files created on a PC that might use file extensions or even files on served up by a web server using <a title="MIME" href="http://en.wikipedia.org/wiki/MIME" target="_blank">Multipurpose Internet Mail Extensions</a> (MIME type) to all be associated with a specific application.</div>
<div>MacVector 10.5 uses UTIs and adds the common extensions and MIME types used by bioinformaticians to simplify the handling of sequence files obtained from different sources. So, for example, a UTI called &#8220;public.text&#8221; can have a type of &#8216;TEXT&#8217;, an extension of .txt or .text and a MIME type of &#8220;text/plain&#8221;. While there are no official assignments for sequence files, there are a number of common conventions. Again, for an example, a UTI &#8220;biosequence.genbank&#8221; might have a type &#8216;NUCL&#8217;, extensions .gb, .gen or .gbk, and a MIME type of &#8220;chemical/seq-na-genbank&#8221; or &#8220;chemical/x-genbank&#8221;. The system is still not perfect &#8211; for example, many sequence formats are represented as simple text files with a .txt extension, so MacVector will typically not be the default application to open such files. In these cases, you need to launch MacVector and open the file directly from the File menu. MacVector will then directly examine the contents of the file to determine the &#8220;real&#8221; type of the file.</div>
<div>If you are using MacVector on a Macintosh in an environment where many of your colleagues are using PCs, Unix machines or downloading sequence data from the Internet, the support for UTIs in MacVector 10.5 should make it so much easier to share data with your collaborators. You can find the list of UTIs used by MacVector in the <a title="knowledge base" href="http://www.macvector.com/KnowledgeBase/macvectorfiletypes.html" target="_blank">knowledge base</a> section of our web site.</div>
<p><!-- Technorati Tags Start --></p>
<p>Technorati Tags:<br />
<a rel="tag" href="http://technorati.com/tag/MacVector">MacVector</a></p>
<p><!-- Technorati Tags End --></p>
<div id="tweetbutton14" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F05%2Fmacvector-universal-type-identifiers-and-file-extensions%2F&amp;via=macvector&amp;text=MacVector%2C%20Universal%20Type%20Identifiers%20and%20File%20Extensions&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F05%2Fmacvector-universal-type-identifiers-and-file-extensions%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/05/macvector-universal-type-identifiers-and-file-extensions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Patch Releases</title>
		<link>http://macvector.com/blog/2009/04/patch-releases/</link>
		<comments>http://macvector.com/blog/2009/04/patch-releases/#comments</comments>
		<pubDate>Fri, 17 Apr 2009 15:58:06 +0000</pubDate>
		<dc:creator>Kevin</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[releases]]></category>

		<guid isPermaLink="false">http://macvector.com/blog/?p=24</guid>
		<description><![CDATA[A trail of ants came marching though my kitchen last week. I found the hole they came in and plugged it up. There, that’s sorted them. Two days later they were back, this time through a different hole. Bugs – I hate them. Just like software. There are probably as many different approaches to software [...]]]></description>
			<content:encoded><![CDATA[<p>A trail of ants came marching though my kitchen last week. I found the hole they came in and plugged it up. There, that’s sorted them. Two days later they were back, this time through a different hole. Bugs – I hate them. Just like software.</p>
<p>There are probably as many different approaches to software bugs as there are software applications. And none of them are perfect – I can’t think of a single application or OS that hasn’t crashed on me at least once over the years. So given that bugs are a reality in most (all?) software products, what is the best approach to reducing their impact once the product has been released? As a Product Manager over the past 10+ years, I’ve been involved in 20 to 30 major releases and probably twice as many minor ones across a variety of products and platforms. They all had one thing in common – within 2 weeks of release, at least one customer reported a bug that, had we known about it before release, we would have fixed before it went out.</p>
<p>At one company I was at, the policy was to “test, test, test” with the QA department somewhat removed from the R&amp;D department.  A huge battery of mostly manual tests was run on every release. If a single line of code was changed, all had to be run again, on every combination of OS and platform supported. Each round of testing took a minimum of a week with every developer pulled off coding to help. Finally, the product was released and, inevitably, a bug report would come in almost straight away.. But, because of the extended release cycle, fixing that bug would take all of the developers and testers time for two weeks, during which time other bug reports would come in. In this way, four months pass before a patch was released, during which time no progress has been made on the next major release. This starts a vicious cycle where at the release post mortem, the QA department insists on more testing for the next version, so that release gets delayed even longer and when the inevitable bug reports come in, an even longer delay before that patch is released.</p>
<p>At MacVector, Inc, we’ve taken a more streamlined approach. We certainly do a lot of testing, but we rely on a much tighter integration between the QA and R+D departments to only retest areas that the developer believes would have been affected by the bug fix. That reduces our bug fix development cycle time dramatically, so we can have patch releases posted on the web site within a week of the initial report. The risks, of course, are those “Whack-a-Mole” bug fixes where the code to fix one bug reveals another bug elsewhere in the application. But if that happens, we know we can turn around a second bug fix in a week or so, not the six months that my old company took.</p>
<p>So the take home lessons are (a) check back to the <a title="MacVector, Inc." href="http://www.macvector.com">MacVector web site</a> often to pick up any new patches and (b) I really need to find out where those ants are coming from…</p>
<div id="tweetbutton24" class="tw_button" style=""><a href="http://twitter.com/share?url=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F04%2Fpatch-releases%2F&amp;via=macvector&amp;text=Patch%20Releases&amp;related=macvector&amp;lang=en&amp;count=horizontal&amp;counturl=http%3A%2F%2Fmacvector.com%2Fblog%2F2009%2F04%2Fpatch-releases%2F" class="twitter-share-button"  style="width:55px;height:22px;background:transparent url('http://macvector.com/blog/wp-content/plugins/wp-tweet-button/tweetn.png') no-repeat  0 0;text-align:left;text-indent:-9999px;display:block;">Tweet</a></div>]]></content:encoded>
			<wfw:commentRss>http://macvector.com/blog/2009/04/patch-releases/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

