General musings from the MacVector team about sequence analysis, molecular biology, the Mac in general and of course your favorite sequence analysis app for the Mac!

Simple Assembly of Sanger Sequencing Files with MacVector Assembler

With MacVector Assembler, assembling ABI Sanger Sequencing files is simple, fast and accurate. MacVector uses the popular phred/phrap/cross_match set of tools from the University of Washington. To improve accuracy, and to help resolve repeats, these tools use “quality scores” (popularly known as “phred scores”), giving them an advantage over many other methods. To assemble two or more ABI files, follow these steps.

  • Use File | New | Assembly Project to create a new project
  • Click on the Add Seqs toolbar button and select all of your ABI (or SCF) chromatogram files to import
  • Click on the phred toolbar button – this re-calls the traces and generates quality scores (no need to select any items in the project, though you can to run phred on specific files)
  • Click on the phrap toolbar button and accept the defaults
  • After phrap has run, you will be presented with one or more contigs (assuming your ABI reads actually overlap). If you double-click on one of those, a contig editor will open letting you view and edit the actual alignments.


    Not sure if you have Assembler? Choose MacVector | About MacVector. If the screen that appears says “MacVector with Assembler, Pro Edition” then you have it. If not, you can sign up for a fully functional 21 day trial version.

    Posted in Tips | Tagged , , , | Comments closed

    An overview of assembling sequencing data with MacVector’s Assembler plugin

    To assemble various types of sequencing reads, follow these steps.


    • Choose File | New | Assembly Project to create a new empty project file.

    Then follow one of the following:

    To create a de novo assembly from Sanger reads

    • Click on the Add Reads tool bar button, then select the sequence files you wish to assemble and click on the Open button. Read(s) file(s) can also be drag and dropped on the open Assembly Project window.
    • Click Phred to basecall sequences in the project. Note that if no sequences are selected, phred will be run on ALL of the files in the project.
    • Click Phrap to assemble the reads.

    To create a de novo assembly from NGS datasets

    • Click on the Add Reads tool bar button, then select the sequence files you wish to assemble and click on the Open button. File(s) can also be drag and dropped on the open Assembly Project window. Paired reads files are automatically detected.
    • Choose either SPAdes or Velvet to assemble the reads.

    To create a reference assembly

    • Click the Add Reads button, then select the sequence files you wish to assemble and click on the Open button.
    • Click Add Ref, select the sequence file(s) you wish to align the reads against and click on the Open button.
    • Click Bowtie to map all read files against all of the reference sequences in the project.

    Not sure if you have Assembler? Choose MacVector | About MacVector. If the screen that appears says MacVector with Assembler, Pro Edition then you have it. If not, you can sign up for a fully functional 21 day trial version

    Posted in Techniques, Tips | Tagged , , , , , | Comments closed

    MacVector video tutorials on You Tube

    There are short video tips on using MacVector on our blog and our YouTube channel. Each one is less than 2 minutes and generally shorter than that! There will be a new screencast every few weeks, so please subscribe to our YouTube channel so you don’t miss any!

    The latest screencasts include quickly annotating a gene to a sequence, confirming a small sequencing project against a reference and checking the orientation of a ligated insert using MacVector’s Restriction Digest and Agarose Gel tools.

    If there’s any tools or features you’d like to see a screencast about please do let us know…. and always remember we will come and run real live workshops for you too!

    Posted in Tips | Tagged , | Comments closed

    Viewing external database entries for features in a sequence.

    Sequences, or regions of sequences, can be linked to external databases. For example an entire sequence entry or for when annotation tools are used to annotate proteins with domain or motif information (for example InterProScan). Very useful for when you want to view more detailed or updated information. Within the Genbank specification, which MacVector extensively uses, an external database entry can be stored in a /DB_XREF qualifier. This allows the database entry to be easily viewed. The Genbank (and Genpept) specification allow for many different databases to be accessed using this qualifier.


    In MacVector the original database entry can easily be viewed in a web browser by selecting, then right clicking the feature entry in the Features tab and viewing the available DB_XREF entries. Selecting one will load it in your web browser.


    Posted in Tips | Tagged , , , | Comments closed

    Use the Replica Button For Synchronized Views

    Most primary MacVector windows (Nucleic Acid Sequence, Protein Sequence, Multiple Sequence Alignment, Align To Reference, Contig Assembly etc.) have a Replica toolbar button. If you click that button, a second window will open, potentially set to a different tab. The key to this functionality is that the two windows are linked – any selections you make in one window will reflect in the other. In most cases this means that if you select an object in, for example, the Map tab of one window, the Editor tab of the other window will actually scroll to display the selected region.

    Here’s an example where clicking on an aligned read in an Align To Reference Map tab has automatically scrolled the replica Editor tab to show the selected sequence.


    Posted in Tips | Tagged , , , , | Comments closed

    How to Identify Bacterial Promoters Using MacVector

    MacVector’s Subsequence tool is a very flexible search function that can be used for a variety of tasks. MacVector itself has a built-in variant of the function for maintaining and search primer databases (Analyze | Primer Database Search…). Each entry in the file MacVector uses as a source of subsequence data can have up to 3 segments, with variable length between the segments, along with a defined number of permitted mismatches and even a system for requiring that specific residues must match. That makes it ideal for searching for bacterial promoters. For example, the canonical Escherichia coli promoter sequence is a “-35” region TTGACA, then a gap of 16 to 18 residues, then a “-10” region “TATAAT”. You will find there is an EcoliPromoter.nsub file in the /MacVector/Subsequences/ folder. If not, you can download it. If you open the file in MacVector, you can see this.


    You can see that the file has four entries – each of these has two segments representing the -35 and -10 region, but each has additional settings that control how close a match has to be before it is reported. The names give some idea of the stringency of the match – Perfect, Probable, Possible and Weak. If you double-click on the Probable item, you get this editor.

    Posted in Tips | Tagged , , , | Comments closed

    Import Multi-Sequence Genbank Files into an Assembly Project for easy access to Features

    There are many genomes in the Genbank database that cannot be downloaded as single annotated sequences. These might be large multi-chromosome eukaryotic genomes, but, increasingly, partially sequenced bacterial chromosomes where the major contigs have been annotated using the NCBI annotation pipeline. Typically, when you encounter these, there are options to download annotated versions of these as multi-sequence Genbank formatted files. MacVector has the option to open any file containing multiple sequences as either a Multiple Sequence Alignment document or as individual Sequence documents. This is not always optimal if you have more than a handful of sequences in the file. However, if you use MacVector with Assembler, you can import these sequences into a project using the Add Ref toolbar button and the individual sequences will not only be displayed in the project window, but, if you double-click on one, the complete annotated sequence will be opened.GenbankintoAssemblyProject

    This is a great way to view and/or sort collections of annotated sequences in Genbank format that cannot be done directly through the Apple Finder. Once opened, you can Export… any sequence in another format if you wish.

    Posted in Tips | Tagged , , , , | Comments closed

    Opening multiple sequences as alignments or individual sequences

    Many sequence formats contain multiple concatenated sequence entries. For example FASTA and Genbank are two formats capable of storing multiple individual sequences.

    By default MacVector will treat such sequences as alignments and open them in the Multiple Sequence Alignment editor. Most users who want to open such a file do want to see an alignment. Additionally if the default behaviour was to open as individual sequences, then accidentally clicking on a large alignment would result in many hundreds of individual sequence windows opening up on your desktop (do remember that holding down the OPTION key and clicking on the close button will close all open sequences).

    If you need to open such a sequence file as individual sequences, then there’s a simple option that you need to check in the FILE | OPEN dialog. This behaviour has not changed for quite some time. However, back in MacVector 13 the appearance of the dialog changed, due to a change in Apple’s current guidelines on file dialogs. Whereas the older dialog had an obvious way to see this dropdown menu, now all you see is a small OPTIONS button in the bottom left hand corner.


    To open multiple sequence files as individual files you need to check an option in the FILE | OPEN dialog.

  • Click FILE | OPEN
  • In the dialog click OPTIONS (bottom left corner)
  • Click OPEN
  • Posted in Tips | Tagged , , | Comments closed

    Restoring file associations when MacVector no longer opens your sequences

    Macs are pretty good at choosing the right application to open a document. For example when you double click on a .nucl document then it will open in MacVector. However, sometimes this file association breaks. Applications should coexist peacefully on a Mac, but sometimes a misbehaving app will corrupt these file associations and you will find that your sequence displays a generic document icon (or what’s worse a different application!). When you double click on the icon, it will no longer open in your favourite DNA sequence analysis tool!

    Luckily this is easily fixable:


    1. Select a .nucl file in the Finder
    2. Choose File | Get Info (or use command-I).
    3. In the “Open with” section, click on the popup menu and select MacVector
    4. Then click on the Change All… button to apply the change to all files.
    5. Repeat for all file types used by MacVector that are not opening correctly (e.g. .prot, .msan, .msap, .axml)
    Posted in Tips | Tagged , | Comments closed

    What can MacVector do for me?

    Here’s what MacVector can do for your lab.

    Comparing sequences

    Whatever type of alignment your sequence needs, there’s a tool in MacVector.


    CRISPR Indel Analysis: Identify insertions and deletions following CRISPR editing of a target.

    Sequence assembly of NGS data against a reference genome or compare your sequencing against your new construct.

    Translated Multiple Sequence Alignments: Align DNA sequences based on their translations.

    Align proteins against a reference great for comparing known proteins against an unknown one.

    Auto Annotation of common plasmid features to blank sequences.

    InterProScan: Scan proteins for functional domains against many databases.



    Design Cloning workflows

    As simple as dragging a fragment to a cloning vector.

    Flexible Cloning Subclone with restriction enzymes, Gibson cloning, Gateway and more.

    Cloning history Every step is documented.

    Agarose Gel to run out digested sequences. Easily identify site(s) to differentiate successful clones.

    Primer Design

    Design primers with ease.

    QuickTest Primer changes primer design. Hairpin? Nudge your primer until it goes.

    Add tails to your primers with silent restriction sites/mismatches and view reading frame changes.

    Quickly design pairs of primers click a region to get the best primer pairs to amplify it.

    Primer Database store your primers and scans sequences for potential binding sites.


    Posted in General | Tagged , , , , | Comments closed