MacVector and Assembler 12.5 are released

MacVector 12.5 was released yesterday. A few days late but we’d always prefer to wait rather than release with annoying bugs.

MV125 BowtieOverview

This is a major release with a lot of new features. Many parts of MacVector have been substantially rewritten both to keep up with OS X as well as many dialogs that have been redesigned.

The main new features are for sequence alignments. We’ve added Bowtie to Assembler for further support of NGS data assemblies. Along with variant detection using Samtools to match the existing SNP report tab.

We’ve also added additional alignment algorithms. Both Muscle and T-coffee have been added to the Multiple Sequence Alignment analysis interface, complementing the existing ClustalW algorithm. We’ve wanted both of these for a while now and judging from the results of last year’s survey so have many users.

There’s also enhanced support for segmented features as well as being able to display translated CDS features directly in the sequence editor as well as the Map view. There’s SNP reporting for the Align to Reference editor as well as greatly improved performance for its algorithm and also for the Auto Annotation tool.

MV125 TranslatedCDS inthe Editor
MacVector 12.5 is fully Mac OS X 10.7 (Lion) compatible as well as being supported on OS X 10.5 (Leopard) and OS X 10.6 (Snow Leopard). However, MacVector 12.5 is not supported on OS X 10.4 (Tiger). If you are still using Tiger (from our last survey much less than 1% of our users still use Tiger) MacVector 12.0.6 will still remain supported.

This is just a short overview, if you have a valid license download the updater or if you do not download a 21 day trial.

Technorati Tags:

Posted in Releases | Tagged | Comments closed

MacVector 12.5: Interface improvements and redesign

This is part of a series of posts about the shortly to be released MacVector 12.5.

With each release of MacVector as well as adding new features we continually redesign and rewrite many existing parts of MacVector. This is done for a number of reasons. For example:

  • to accommodate changes in OS X, where generally with every major release they make major changes under the hood.
  • to optimise workflows, where we try to respond to user feedback and make analyses both easier to use and quicker without removing any functionality.
  • to redesign dialogues and tools to make them easier to use whilst keeping all the options for the power user.
  • to take advantage of new OSX features to make life easier.
  • As a recent example you can now run a primer analysis and literally drag and drop a primer pair or their product to the sequence window to annotate those features to the sequence.

    CreateResults.png

    Here’s a few of those general improvements that have been included in MacVector 12.5.

    Interface improvements

  • In MacVector 12.5 a number of editors and lists have been rewritten in Cocoa. The restriction enzyme, proteolytic enzyme, DNA and protein subsequence editors and the Matrix editor have all been rewritten using Cocoa table views. Apart from an improved appearance, this change makes it easier to navigate and select entries in the editors, simplifying the creation of subsets (you can copy and paste between windows). In addition, you can sort the columns by clicking on the titles.

  • MV125EntrezBrowser

  • The Entrez browser has been rewritten and now permits an unlimited number of search terms to be combined for a search. So you can search for a specific gene in two different organisms at the same time. For example in the screenshot we are searching for the gene “ADH1” in two different yeasts,
    Sacchoromyces cervisiae and Schizosaccharomyces pombe

  • Many dialogues have had error handling code added so that red information text will be displayed explaining why the OK button is not enabled if invalid parameters have been entered. This helps understand how to optimise an analysis without having to read up in depth about the various parameters.

  • Sequence Window Enhancements

  • There are new “Segmented Hollow Arrow”, “Segmented Hollow Box” and “Segmented Full Height Hollow Box” symbol. You can use these for segmented features (e.g. CDS features containing introns) in the Map tab and the exon segments will be shown joined by a line. See the screenshot below for an example of the end of a multi exon segmented feature.

  • MV125 SegmentedFeatures

  • The Editor tab can now display the translations of CDS features above or below the sequence – turn this on using the Strands button.

  • Miscellaneous Enhancements

  • Segmented subsequence matches now have all segments displayed in the text and graphical outputs. For example you can now search for canonical E.coli promoter sequences and see both the -35 and -10 regions displayed in the Map.

  • The Align to Reference algorithm now runs as a standard job so that you can switch to other windows and carry on using MacVector while waiting for lengthy alignments to complete.

  • The maximum number of results that can be returned by the Align to Folder function has been increased to 32,767.

  • The SNP tab of the Align to Reference Window now lists all of the changes between the consensus and the reference sequence and includes the codon change and amino acid change of any CDS feature crossing each SNP.

  • ..and remember if you want to add Assembler before MacVector 12.5 is released you can do so at half price. You can also purchase an upgrade or a new license before the release of MacVector 12.5 and get a 10% discount. Both these offers include a free upgrade to MacVector 12.5 when it is released. Please request a quote now. Don’t forget to quote the promotional code of “Assembler50%” or “MV12510” for the discounts.

    Technorati Tags: ,

    Posted in Development, Releases | Tagged | Comments closed

    MacVector 12.5: Creating reference assemblies with Bowtie

    This is part of a series of posts about and leading up to the release of MacVector 12.5.

    With Assembler 12.5 our developers have come up with an affordable and straightforward solution for assembling and visualizing your NGS data. Generating sequencing data is cheaper than it has ever been, however, with this increase in data has come a problem with analysis. Assembler will now create reference assemblies with just a few mouse clicks using Bowtie. Instead of sending your millions of reads away to be assembled or delving into complicated software tools you’ll be able to align millions of NGS reads to multi megabase reference sequences in literally minutes. Bowtie is a fast algorithm, and although it’s an ungapped assembler, what it loses in accuracy it makes up for in speed. You do not need a 32GB 8core Mac Pro to assemble your data. In addition to the existing phrap/phred tools this makes Assembler a simple, cost-effective solution to analyzing your Next Generation Sequencing reads.

    Creating a Reference Assembly

  • – Chose File | New | Assembly Project to create a new empty project file.
  • – Click on the Add Reads tool bar button, then select the sequence files you wish to assemble and click on the Open button. Read(s) file(s) can also be drag and dropped on the open Assembly Project window.
  • – Click on the Add Ref tool bar button, then select the sequence file you wish to align the reads against and click on the Open button.
  • – Choose Analyze | Bowtie to run the Bowtie algorithm on all of the sequences in the project. Note that if no sequences are selected, Bowtie will be run on ALL of the files in the project. However, if any sequences are selected then the reference sequence and at least one reads file must be selected.
  • Your reference sequence can be in any “openable” format. However, your reads need to be in FASTQ format.

    Hit Reporting

    In the dialogue you’ll see an important setting called Hit Reporting. Bowtie uses a concept of strata to score alignments. A stratum is defined by all reads that contain the same number of mismatches in the seed (the seed is the first “n” bases of a read which is given higher priority in scoring than the entire read). You can either show ALL ALIGNMENTS, REPORT BEST ALIGNMENT ONLY (show the best alignment in the stratum with the least amount of mismatches) or REPORT ALL BEST ALIGNMENTS (which shows the best alignment in all strata). Which you choose depends on a few factors. For example how many references you have, how many repeated regions you expect, whether you are using a reference sequence from the same organism or a related one, and many others. Generally start with show all alignments, which is the quickest, and work from there.

    Analysis

    ..and that’s how easy it is. Of course generating results is always easier than analysing them and to help analyse your reference contig Assembler has a few useful tools. We’ll talk about variant detection in a later blog post, but the coverage map is one of the first tools that you will see upon completing an assembly.

    Using the Coverage Map

    It is extremely useful to be know the depth of reads that are aligned on your reference. Areas of low coverage indicate that you need further sequencing and peaks of high coverage can be indicative of repeats. The Map view of a reference contig will show details of the depth of reads in a coverage map with four statistics. A single plot line shows a running average of the number of reads at that point. However, an average plot is not very sensitive when viewed at a high level and so two shaded areas indicate the maximum value and the minimum value of the averaged reads at that point. As the coverage map is viewed at a lower level these three values will become increasingly closer to the extent that when viewed at, or close to, residue level these three plots will become identical. Areas of zero coverage are shown in light grey. Note that these areas are always displayed even when they are disproportionate to the level of magnification.

    MV125 ReferenceContigCoverageMapSymbols

    Multiple reference sequences

    You can add multiple reference sequences and depending on the settings reads will be aligned against the best match or against multiple ones. This is great for such tasks as identifying a sequenced isolate amongst a series of closely related strains of virus or bacteria. Having multiple reference sequences helps determine which is most closely related (or identical) to the isolate.

    Paired end reads

    Paired end reads are very useful for improving the accuracy of alignments and also for indel detection. Paired end reads are created by sequencing both ends of the same DNA molecule, with known fragment size. Since the two reads are now separated by a known distance assembly and orientation of the two reads is less complicated. For Assembler if your reads are paired end all you need to do is ensure that the same filenames but appended with version numbers and Paired End assembly is enabled.

    e.g.

    READS_1.fastq
    READS_2.fastq

    You’ll also need to input the fragment size.

    In the next Assembler post we’ll talk about variant detection.


    ..and remember if you purchase an upgrade or a new license before the release of MacVector 12.5 you can get Assembler with a 50% discount and a free upgrade to MacVector 12.5 when it is released. This offer ends on 1st December. Please request a quote now. Don’t forget to quote the promotional code of “Assembler50%”

    Technorati Tags: ,

    Posted in Releases | Tagged , , | Comments closed

    MacVector 12.5: Get Assembler for half price!

    Our latest version, MacVector 12.5, will be released on the 1st of December, 2011. There’s some great new features in this release with the addition of extra alignment algorithms, Muscle and T-coffee, to the Multiple Sequence Alignment analysis interface, as well as many interface enhancements and performance improvements.

    MV125 MuscleTcoffee

    However, the most important new feature in MacVector 12.5 is the addition of Bowtie to MacVector’s sequence assembly module, Assembler to create reference assemblies with your NGS data. Additionally, the Contig Editor has been enhanced with some useful features to visualise your genome sized alignments. With these improvements, our developers have come up with an affordable and straightforward solution to assembling your NGS data. Generating sequencing data is cheaper than it has ever been, however, analyzing it is not. Instead of sending your millions of reads away to be assembled at great expense, or spending time on the computer trying to assemble your reads, Assembler will now create reference assemblies with just a few mouse clicks.

    To mark the release of MacVector 12.5 and this great new functionality in Assembler we’re offering users an opportunity to add Assembler to your MacVector license for half price. Not only that but if you have an older license of MacVector and want to upgrade your license then you’ll still be able to add Assembler at half price. If you do not have MacVector and want this great new functionality then you’ll still be able to add Assembler to a new license for half price.

    If you take advantage of this offer, you will not only get a great price on Assembler, but you will get MacVector 12.5 and a free upgrade to MacVector 12.6 a few months after. MacVector 12.6 will include even more NGS support as well as a Quick-test function to easily analyze your existing primer sequences, and more. Plus, your license will include a year’s maintenance which will not start until MacVector 12.5 is officially released on the 1st December, 2011.

    Remember that this half-price offer ends on 1st December, and the price of Assembler is going up next year. Please request a quote now. Don’t forget to quote the promotional code of “Assembler50%”

    If you want to see the new functionality then we have a prerelease preview available for download. This is not yet a release candidate but is part of our external beta program

    ..and remember both MacVector 12.0.6 and MacVector 12.5 are fully supported on OS X Lion.

    MacVector 12.5 will be released on the 1st of December, 2011

    MV125 BowtieOverview

    Technorati Tags:

    Posted in Releases | Tagged , , , | Comments closed

    MacVector 12.5: Sequence Assembly made easy.

    This is part of a series of posts about and leading up to the release of MacVector 12.5.

    Assembler has always made it easy to assemble your sequencing projects. It hides the complicated algorithms and provides a point and click interface to show you the results. With the release of MacVector 12.5 the range of tasks that Assembler will perform has been greatly extended. Now with the addition of high throughput reference assembly using the popular Bowtie algorithm, Assembler can now support the alignment of many millions of NGS reads against genome sized references. Assembler will also generate output in the popular BAM format. Collections of contigs can be exported in FastQ format for additional analysis. Additionally the interface has been enhanced to increase the number of reads that can be submitted for de novo assembly. ..and to analyse your assemblies SNP detection and reporting has been enhanced with VCF output from Bowtie alignments and listing of all the codon and amino acid changes between the consensus and reference sequence alignments.

    Here’s a selection of tasks that Assembler will make easy for you

    De novo assembly of Sanger trace files

    Add in your Sanger sequencing trace files, basecall the reads to improve accuracy then assemble using quality scores.

    de novo short read Assembly

    Add short reads from a variety of sources in FASTQ format as well as Sanger sequencing. Great for Hybrid assemblies

    MV125 BowtieOverview

    Reference assembly to identify SNPs in a bacterial/viral isolate

    Add a Reference sequence and an NGS file(s) representing the sequence of an individual isolate and assemble using Bowtie. View a report listing all the potential SNPs based on the differences between the consensus and the reference. Be able to quickly identify the genes that the SNPs lie in and drill down to view the nucleotide and amino acid changes.

    De novo bacterial assembly assisted by a Reference scaffold

    Create a reference assembly using Bowtie, then take the individual contig consensus sequences along with all the input NGS Reads that did not assemble and assemble with a de novo assembler (Phrap) directly from within a new Assembly Project.

    Assembly to multiple similar references

    Take reference sequences from a series of closely related strains of virus or bacteria. The reads come from a single isolate. Use Bowtie to assemble these against the collection of References to determine which is most closely related (or identical) to the isolate.

    Assembly to multiple dissimilar references to identify SNPs

    Essentially similar to the bacterial SNP assembly, but using yeast or some other organism with multiple genomes (or even some bacteria that have multiple chromosomes or large plasmids).

    Exome (transcriptome) sequencing)

    Use the genomic sequence of an organism as a reference and align reads from sequenced mRNA, cDNA or total RNA of that same organism. Uses are splice site junction identification, novel gene identification amongst many others.

    MV125 ReferenceContigCoverageMapSymbols


    ..and remember if you purchase an upgrade or a new license before the release of MacVector 12.5 you will get a 10% discount and a free upgrade to MacVector 12.5 when it is released. Contact Sales@macvector.com and quote “MV12510” for the discount.

    Technorati Tags: ,

    Posted in Releases, Techniques | Tagged , , | Comments closed

    Zooming to the sequence level in the Map view

    MacVector11MapSequenceLevel.png

    ClickCloningStaggeredEditorEnds_MV12.png

    When zoomed to the residue level it’s easy to check restriction enzyme ends.

     

    Technorati Tags:

     

    Posted in Techniques | Comments closed

    MacVector 12.5 new features

    This is the first in a series of posts about and leading up to the release of MacVector 12.5.

    MacVector 12.5 will be released shortly. As long as testing shows up no obstacles it will be soon entering the external beta stage. If you want to participate in the beta program please do email support.

    This is a major release with a lot of new features. Many parts of MacVector have been substantially rewritten both to keep up with OS X as well as many dialogs that have been redesigned.

    MacVector 12.5 will be fully Mac OS X 10.7 (Lion) compatible (MacVector 12.0.6 is too). It will also be supported on OS X 10.5 (Leopard) and OS X 10.6 (Snow Leopard). However, MacVector 12.5 will not be supported on OS X 10.4 (Tiger). If you are still using Tiger (from our last survey less than 1% of our users still use Tiger) MacVector 12.0.6 will still remain supported.

    The main new features are for sequence alignments. We’ve added Bowtie to Assembler for further support of NGS data assemblies. Also extra alignment algorithms,Muscle and T-coffee, have been added to the Multiple Sequence Alignment analysis interface, complementing the existing ClustalW algorithm. We’ve wanted both of these for a while now and judging from the results of last year’s survey so have many users.

    Watch the blog for posts detailing these and other new features over the next few weeks,


    ..and remember if you purchase an upgrade or a new license before the release of MacVector 12.5 you will get a 10% discount and a free upgrade to MacVector 12.5 when it is released. Contact Sales@macvector.com and quote “MV12510” for the discount.

    MV125 ReferenceContigEDITOR

    Technorati Tags:

    Posted in Releases | Tagged , | Comments closed

    Problems with DMG appearing not to mount.

    Sometimes when you download the MacVector disk image and try to mount it nothing appears to happen.

    Generally this is because although it has been mounted a Finder window has not popped up. It also affects those of us with messy desktop (not me of course!). If you have many files then you might miss the MacVector CD icon.

    The default behaviour for when you mount a disc image file (DMG) is to show a Finder window with that mounted DMG. However, this is configurable and not from an obvious place. You need to launch Disk Utility and go to PREFERENCES. Ensure it’s selected to “AUTOMATICALLY OPEN ……..”.

    Do that and remount your DMG and it will appear.

    DISKUTILITY Prefs

    Technorati Tags:

    Posted in Tips | Tagged | Comments closed

    Steve Jobs RIP

    Steve Jobs RIP

    612px Steve Jobs Headshot 2010 CROP

    Credit: Matthew Yohe

    Technorati Tags:

    Posted in General | Tagged | Comments closed

    MacVector 12.0.6

    MacVector 12.0.6 was released a few days ago. This has a few small but significant bug fixes. If you are running MacVector 12 it’s a recommended upgrade.

    Posted in Releases | Tagged | Comments closed