101 things you (maybe) didn’t know about MacVector: #41 – Extracting raw data from chromatogram files

Have you ever wanted to know exactly what the total signal value is for individual peaks in a chromatogram file? Perhaps you are looking for mixtures of residues at a particular location and want to get some idea of the relative proportions? You can open .ab1 and .scf chromatogram files directly in MacVector and view the “areas under the curve” in the Raw Data tab. You can open a chromatogram file by selecting File | Open in MacVector, or by dragging the file onto the MacVector application icon, whether it be on the dock, on your desktop, or in the MacVector application folder. Double-clicking on an .ab1 or .scf file *may* open that file in MacVector, depending on the other applications you have installed on your machine.

D01a scf Editor

When you click on the Raw Data tab you get a textual representation of the data.

D01a Raw Data

The first four columns are fairly self explanatory – the number of the residue, the residue at that position (A, G. C, T etc), the quality value if set (e.g. after a phred basecall) and the position of the peak in the chromatogram.

The “Tot. Area” value is the total value of all four traces under the curve for that peak. It should be noted that MacVector calculates the sides of each peak as the mid way point between the current peak and the peaks on either side. It does not attempt to account for the “wide base” you might see when an isolated residue’s trace bleeds into adjacent peaks.

Following on from that, the “A Area” is the value under the peak for the “A” trace, and the “A%” is the percentage of the entire signal that the “A” area represents. This repeats for the “C”, “G” and “T” traces.

Finally, the “mix” columns list the traces that exceed the % mix at that position to help you identify potential mixed alleles at each position. So, if you see “AGC” at the “15% mix” level, that means the A, G and C traces both represented more than 15% of the signal at that position. The “45%” mix is the most stringent – if you see a pair of residues in that column, that is a good sign that this is a position you might investigate in more detail if you are looking for allelic differences in a sample. Of course, because of sequencing artifacts, the beginning (as in this example) and end of sequencing runs are most likely to contain errors.

The entire plot can be selected, copied, and pasted into Microsoft Excel if you want to analyze the data in more detail. The data is presented in tab-delimited format so that when you copy and paste, each value gets pasted into a separate cell.



This is an article in a long running series of tips to help you get the most out of MacVector. If you want to get notified every time a new tip gets published, follow us @MacVector on twitter (or check the feed for the hashtag #101MacVectorTips) or like us on Facebook.

Posted in 101 Tips | Tagged | Comments closed

UNIX line endings for plain text files

Sometimes you may need to export data out of MacVector for further analysis with a command line tool or sending a file to a colleague who does not have MacVector). For example exporting the VCF report out of a reference assembly or exporting a sequence into Genbank format.

If these files are plain text, then you need to be careful about the format of the text. Mainly the control characters used to indicate the end of a line (EOL) have varied between different operating systems.

Historically Macs (pre OS X) used CR, DOS/Windows/PC used CR and LF and UNIX computers used LF.

Nowadays OS X uses UNIX line endings, it is UNIX underneath after all!

These different characters show their printer heritage when you look at what they actually mean:

CR = Carriage Return
LF = Line Feed.

MacVector has a setting to control what format is used. By default this will be LF (i.e. UNIX) and that version is most suitable for command line analysis on other OS X or UNIX machines.

General

However, many versions ago the default was CR (i.e. OS 9). This means that if you have been using MacVector for many years then it’s possible that this setting has stuck (MacVector does try to preserve your user settings between versions).

You can easily change it using the dialogue above. Alternatively you can reset (and refresh) your settings with this tool.

…and tell your colleagues without MacVector, that MacVector Free allows you to open, view and print sequences from MacVector files. It has a surprising amount of functionality!

Posted in Support, Tips | Tagged | Comments closed

101 things you (maybe) didn’t know about MacVector: #40 – Removing gaps from a DNA or protein sequence

There are often times when you end up with a sequence containing gaps, especially if you make extensive use of the Align To Reference, Contig Assembly or Multiple Sequence Alignment interfaces to generate consensus sequences. You can select and copy the consensus sequence, or even individual aligned reads, from the Align To Reference and Contig editor tabs. The copied sequence can then be pasted into a sequence editor window, or a new document can be created by choosing File | New From Clipboard. In either case, if any gaps were present in the copied sequence, these will be preserved when pasted. MacVector does this because there are times when you might want those gaps preserved and, as you will see, it is a lot easier to remove them after the paste than it would be to type them back in by hand!

You can search for gaps and remove them using the Edit | Find | Find… menu item. The key to this functionality is that you must first check the Literal box before you can add a gap character in the Find box;

FindGaps

Just leave the Replace box empty, then when you click on the Replace All button, all of the gaps will be deleted. Note that if the Replace All button is inactive, that usually means that no gaps are actually present in your sequence.



This is an article in a long running series of tips to help you get the most out of MacVector. If you want to get notified every time a new tip gets published, follow us @MacVector on twitter (or check the feed for the hashtag #101MacVectorTips) or like us on Facebook.

Posted in 101 Tips | Tagged | Comments closed

MacVector 13.5.3 is out

There’s a new release of MacVector out.

If you are on OS X 10.10 to OS X 10.7 then just update through Sparkle as normal.

However, we have seen issues doing that on Snow Leopard, so please use the downloadable installer instead.

Ironically some of the bug fixes are for OS X 10.6, but there’s a few other important ones as well.

Posted in Uncategorized | Tagged | Comments closed

MacVector Talk: January 2015: A review of 2014 and what’s coming in 2015.

MacVector and Assembler: What’s Next!

We strive to make MacVector the best sequence analysis app for the Mac. This means our developers are continually at work redesigning every aspect of MacVector, not only to improve functionality, but also to ensure that MacVector is always optimized for the latest versions of OS X. In addition we add new tools, and redesign old ones, to keep ahead of new trends in the molecular biology lab.

This results in multiple releases every year as well as many minor bug fix releases. As a small company, we are agile and able to respond to bug reports very quickly. As far as software development goes, bigger is not always better.

Here’s a short review of what we did last year and what is on our “to do” list for this year. We hope you find this useful and informative.

The MacVector team.

2014 Review:

There were two major releases of MacVector in 2014: MacVector 13 at the beginning of the year and MacVector 13.5 towards the end.

As well as the more visible new tools, a lot of small enhancements and a great deal of under the hood work was completed last year. The Mavericks and Yosemite OS X releases introduced a lot of changes, not least the new muted flat look, and we ensured MacVector remained fully compatible and redesigned the toolbar icons to complement that look. The redesign and rewrite of the old style results windows was completed and general performance tuning ensured the last release of 2014 was the fastest yet.

Our Quicktest Primers tool was enhanced to show restriction enzyme sites around the binding site of the primer. Potential “one-out” sites are shown and are colored to show whether they will affect encoded protein sequences.

Velvet was added to Assembler for de novo assembly of NGS data. Reference assemblies show the read coverage of every CDS and gene for comparison of expression levels across samples for RNA-Seq and ChIP-Seq.

The performance of the Align to Reference and Align to Folder algorithms was enhanced and you can retrieve and/or export hits in Align to Folder. These enhancements allow easier processing of NGS reads if you are just analyzing a single gene.

What’s coming in 2015?

During 2015 there will be multiple releases. The first release will be MacVector 14 and is very significant as it will be a fully 64 bit application. This has performance advantages when run on modern Macs, particularly the ability to handle longer sequences, larger alignments and bigger fastq files. MacVector’s rich graphics view is great for viewing your sequences and with MacVector 14 it will be optimized even for chromosome sized sequences.

We are very excited about this release and also for what this means for future releases. The transition to 64 bit is the cumulation of a lot of hard work behind the scenes that has gone into development over the past few years. Being a 64 bit application means MacVector will be better able to exploit the larger RAM capacities of modern Macs. This mean smoother and more responsive graphics, faster algorithms and many limitations, when working with very large datasets, will be gone. We are sure that you will be excited to see the levels of performance that you will see on your desktop!

Other new features in the 2015 releases will include:

  • Protein annotation: You’ll be able to scan sequences against InterProScan, which has a wide variety of protein sequence, protein families, domains and motifs databases.
  • Primer Database: All primer tools will now connect to a database of primers. You will be able to scan sequences against the database to quickly identify binding sites for your primers (available in MacVector 14).
  • Applescript support will be added for Auto-Annotate. Do you’ll be able to batch annotate folders of blank sequences using a curated library of sequences.
  • Assembler: the focus will again be on better visualization of your assembly results. You will have expression level comparison across contigs from separate samples.
  • Lastly, 2015 will see a solution for those users who only need a straightforward sequence analysis application to design primers and clone genes. A new edition of MacVector will fill the needs of these researchers.

    At MacVector we greatly appreciate hearing from users on what they want to see, so we always encourage you to keep us updated on your needs and what you need in a sequence analysis application. As always, our mission is to keep MacVector the easiest to use and best sequence analysis application for the Mac.

    Posted in newsletter | Tagged , , , | Comments closed

    Restriction enzyme sites and tooltips.

    EDIT: April 12, 2019 From MacVector 16 and later the setting for the Automatic RE Analysis is in: PREFERENCES | SCAN DNA | RESTRICTION SITES

    Quickly viewing the recognition sequence and cut site of a restriction site is very easy in the Map tab.

    If you hover your mouse over a restriction site in the Map tab then a tooltip will show you the restriction enzyme recognition site, the location of the cut site, and number of times that enzyme cuts your sequence.

    – Make sure PREFERENCES | MAP VIEW | AUTOMATIC RE ANALYSIS is turned on

    You can also see the full sequence and cut site when zoomed to sequence level in the Map tab too.

    – In the Graphics Palette click the Zoom to Residue button.

    FragmentPredictions

    Posted in 101 Tips, Tips | Tagged , | Comments closed

    101 things you (maybe) didn’t know about MacVector: #39 – Viewing sequence residues in the Map view

    OK, so this is a very simple tip that many users already know about. But we get enough queries about this functionality that its useful to go over it. The Editor tab has a fair amount of functionality, but it really is designed for editing of sequences at the residue level – we don’t directly show features or restriction enzyme cut sites in the Editor tab. However, although the Map tab normally shows a graphical overview of the sequence, you can zoom all the way in to the residue level simply by clicking on the Zoom To Residue button in the floating Graphics Palette;

    Graphics Palette

    When you click on that, you can not only see the residues and graphical representations of the features, but also the cut sites of any restriction enzymes on the sequence, including the structure of any staggered ends;

    ZoomedToResidueLevel

    If you have the Show residue letters if room option set in the Symbols editor;

    ShowResidueLetters

    Then you can actually see the translations of open reading frames in the display;

    MapTranslations



    This is an article in a long running series of tips to help you get the most out of MacVector. If you want to get notified every time a new tip gets published, follow us @MacVector on twitter (or check the feed for the hashtag #101MacVectorTips) or like us on Facebook.

    Posted in 101 Tips | Tagged | Comments closed

    MacVector, Yosemite and increasing dialog window sizes!

    There’s a small bug in Yosemite that causes the size of the dialog windows to increase in size.

    MacVector is affected by this problem.

    Basically every time you use a dialog its size increases slightly. Over time the dialog becomes too large to fit on the screen and you will have problems clicking the buttons.

    The easy workaround is to hold down SHIFT and resize the window from the edge. The new size will be saved in the preferences when you close MacVector.

    Hopefully we’ll see a fix to this in OS X 10.10.2

    Posted in Tips | Tagged | Comments closed

    A Black Friday/Cyber Monday special….Buy MacVector 13.5 and get one free.

    MacVector 13.5 is the best release yet for comparing large sequences and handling NGS data. The new interface is even more polished and further performance tuning has ensured this release of MacVector is faster than ever.

    If you do not yet have MacVector then the next seven days are just for you. Starting from today, Black Friday, for one week only, if you purchase one license you will get one free.

    Personal licenses

    If your lab has two Macs then purchase two licenses at 50% of the cost! That’s a license per Mac at a very good price.

    Standard licenses

    If you have more than two Macs then purchase two standard licenses, for the price of one, that you can use at home or at a conference and share with your colleagues back in the lab.

    Don’t miss out on this offer! Request a quote today.

    CircularSequenceSample Alignment axml Map

    We’ve worked hard to bring you MacVector 13.5 and we really hope that you’ll take advantage of this offer so that you can get these enhanced tools.

    If you are not convinced then download the trial now. However, do not forget that the offer will only be valid until the 5th of December.

    Posted in Releases | Tagged | Comments closed

    Keep Blast running on MacVector 12.7.5 and earlier versions.

    If you are running MacVector 12.7.5 or earlier Blast will stop working for you from the end of this month. The fix for this is below.

    MacVector makes it easy to run a Blast search from your Desktop, and retrieve the hits directly back into MacVector. MacVector does this by submitting these Blast searches directly to the NCBI’s Blast server in a similar way to accessing the website directly.

    A while ago the NCBI changed the URL for these searches and from the 1st of December, 2014 the deprecated URL for the Blast server at the NCBI will finally cease to function.

    We changed the URL back in MacVector 13.0. So if you are running MacVector 13.0 or later then you do not need to do anything.

    However, if you are running an earlier version then you will need to modify the URL in order to keep MacVector working.

    The old address is www.ncbi.nlm.nih.gov

    The official URL for BLAST searches is blast.ncbi.nlm.nih.gov

    How to change the URL in MacVector

  • Open MACVECTOR > PREFERENCES
  • Switch to the INTERNET tab
  • Enter the new URL into the Server pane
  • Click Apply
  • Now MacVector will use the new URL for all Blast searches.

    Of course you could just upgrade…..! MacVector 13.5 is now out and the new functionality should give you many reasons to upgrade!

    Internet

    Posted in General, Tips | Tagged , | Comments closed