GCG Wisconsin Reviewed by William C. Ray | |
| Overall scores | ||
| Installation | Excellent | |
| Learning curve (beginner who can Web surf and word process) | Shallow, but extremely long | |
| Technical support | Very good | |
| Features | Very good | |
| Customizability | Excellent | |
| Utility to biologists | Very good | |
| Value for money | Very good | |
Overview
The GCG Wisconsin Package is an integrated package featuring a comprehensive collection of DNA-, RNA-, and protein-sequence-analysis tools. Its functions span the range of sequence-analysis tasks from sequencing-run assembly, sequence editing, database searching, and pattern finding to nucleotide-chain folding. The Wisconsin Package is to the sequence-analysis world as Photoshop is to the graphics-editing world, Mathematica to the symbolic-mathematics world, and PowerPoint to the desktop-presentation world - a de facto standard that most serious users expect to be present, and some can scarcely function without.
The Wisconsin Package's greatest strength is that it is a collection of relevant pieces of software gathered into a suite. In-house programs, as well as software and algorithms adopted from other researchers, are made available under a unified interface. When new and useful software is published, Genetics Computer Group licenses, adapts, and adopts the software into the Wisconsin Package. Recent examples include the MEME/MAST suite of de novo pattern-discovery programs, the Staden sequence-analysis package, the HMMER Markov-model construction suite, and the PFAM protein-motif database. This umbrella approach to providing sequence-analysis tools removes the need for the constant collection of new software by the end user, and the requirement for users to learn new interfaces as new software is released. A frequently overlooked aspect is that the components of the Wisconsin Package are all functional as command-line programs in the Unix environment, which allows the user to automate complex tasks. Unfortunately, this feature is often considered to be an impediment by beginning users, who prefer to use the X Window-based SeqLab interface. For new users, investing as much effort in learning the Unix environment as in learning the Wisconsin Package will pay off in considerably more than doubled efficiency in their use of the software.
Available platforms | Alpha - Digital Unix 4.0E and later |
System requirements | Complete installation of the software portion of the package requires 70 to 100 Mb RAM, depending on the platform. GCG recommends a minimum of 128 Mb physical memory and 250 Mb virtual memory. Use of the optional graphical interface (SeqLab) requires X Window server software on the user's terminal. The database portion of the system requires 50 Gb hard-drive space as of February, 2001, with a doubling time of less than 12 months. The most current hardware and software requirements for the GCG Wisconsin package are available online. Depending on the intended usage, a minimally configured Unix box sufficient to run the required operating system may be all that is necessary to meet the user's needs. Many portions of the system will perform adequately with hardware considered antique by today's standards. On the other hand, certain analytical routines can take many hours, even on top-end hardware. |
Test platforms | DEC AlphaServer, 500 Mhz Alpha, 512 Mb main memory, 512 Mb swap, Digital Unix 4.0 SGI O2, 233 Mhz R5000, 256 Mb main memory, 1 Gb swap, IRIX 6.5 SGI Indigo, 50 Mhz R4000, 64 Mb main memory, 256 Mb swap, IRIX 6.5 |
Price | $18,000 (commercial) |
How Long Did It Take to Learn to Use It Productively?
The Wisconsin Package takes a day to learn, and considerably more time to master. Most novice users usually can learn to use any given component of the package to solve a particular problem in an hour or so. Learning all the optional parameters that can be used to control a component takes longer. With devoted study, it's possible to make rapid progress at learning the system as a whole, but the practical reality is that many users take months before they begin to learn to link the many components together and to automate repetitive tasks. An additional layer of complexity in the learning curve is the requirement to have a working knowledge of Unix to make full use of the system.
Product Quality
| Ease of installation | Very good |
| User friendliness | Fair |
| Interface | X-Window-based GUI, csh- and ksh-based command line, and, with the optional SeqWeb module, Web-based GUI |
| Intuitiveness of design | If you find Unix intuitive, you'll find use of the command-line interface intuitive. The GUI interface will be more intuitive for fans of Macintosh or Windows, but the desktop-computer experience does not directly translate to the X-Window environment. |
Customizability
Highly customizable, partly through customization of the program, and partly through the inherent customizability of Unix itself.
Ability to Program in Scripts, Add Extension Modules, etc.
As a collection of independent pieces of software under Unix, the Wisconsin Package suite of programs submits very nicely to control through shell scripting, the insertion of custom modules, and the addition of user-written routines.
Ability to Import and Export in Different File Formats
The majority of the programs in the Wisconsin Package use an internal, GCG-specific set of file formats, but there is a "reformat" command that reads a plethora of sequence formats and rewrites the data into GCG format. Output options are somewhat more limited. Users should be aware that GCG-supplied versions of various adopted software packages such as FASTA, have been adapted to read only the GCG format, and not the file formats that they were originally designed to use.
Useful or Unusual Features
The database subscription, an optional add-on to the basic software package, allows fast local searches of all the major public databases and is one of the most noticeable timesaving features. The real power of the GCG Wisconsin Package, however, lies in a feature that many users seem to try to avoid - the fact that it is a cooperating collection of independent pieces of software operating on top of the Unix operating system. This allows the operation of the software to be customized and extended in a way that cannot be matched by any desktop-based system.
As an example of the flexibility and power that this approach to software provides, consider a researcher examining a set of genes that appear by expression analysis to be coordinately regulated. If the researcher is interested in examining the regulation scheme and whether it is conserved outside the organism in which it was observed, GCG commands can be combined to help answer the question. The MEME motif-finding program can be used to search the upstream regions of the genes for potential regulatory motifs. The results of this search can then be fed into the MotifSearch database-search tool to find other genes in the nonredundant database that share these potential regulatory motifs. The genes found to share the motifs can be retrieved from the database and aligned with the PileUp command, pairwise distances extracted from the alignment with the Distances command, and the final results fed into the GrowTree command to see whether the genes appear to fall into related groups, possibly indicating conservation of regulation in a metabolic pathway.
At first glance, that example may seem a somewhat daunting collection of tasks, and the reader may imagine a considerable amount of pointing, clicking, typing, and general unhappiness to be associated with the process. Fortunately, although it seems daunting, the reality of the matter is that through the power of Unix and the integrated design of the programs provided by the Wisconsin Package, this process can actually be condensed to a series of commands on a single Unix command line. For the user who needs to do similar analyses repeatedly, application of only the slightest bit of Unix shell programming would condense it all down to a single reusable command.
Limitations
As might be expected, since the Wisconsin Package is primarily a command-line-based suite of programs, the GUI interface does not give the user access to the complete power of the command line, nor to the complete set of programs. A limitation that will not be expected, and one that is a continuing source of annoyance to many users, is that many programs in the suite place a limit on the maximum size of the sequences that they can handle (350 Kb) and require that longer sequences be broken up into segments under this length. GCG reports tha this limitation will be removed in version 11.
Comparisons with Similar Software
The GCG Wisconsin Package really has little competition that could be considered similar. Packages such as the DNAStar software suite available for Mac OS and Windows, while they perform a number of similar tasks and are designed around a more integrated interface, do not provide the flexibility or automation that make the Wisconsin Package so powerful. Other Unix-based sequence-analysis packages are not as complete.
Technical Support and Documentation
The documentation is styled in a form similar to Unix man pages, being mostly reference, with minimal tutorial or example information. Online searchable access to the documentation is included, although oddly it is not available through the conventional Unix "man" command. Genetics Computer Group personnel have always been very helpful, and they should be willing to assist in choosing appropriate hardware to meet the intended level of use. Technical support seems willing to assist with the creation of custom scripts to help automate individual end-user projects.
Target Users
Although the Wisconsin Package works well in a teaching environment, the typical user is a researcher needing powerful and highly configurable sequence-analysis software. The expense and learning curve do not lend themselves to one-time or casual use, being more appropriate for users looking for a long-term commitment to a powerful package.
Publisher information |
Genetics Computer Group 575 Science Drive Madison, WI 53711 Tel: (608) 231-5200 Fax: (608) 231-5202 Email: info@gcg.com Web: www.gcg.com | |||||||||||||||||||||||||||||||||||
Pricing structure |
(1) Includes the GCG Wisconsin Package, documentation, SeqLab X-Window GUI interface, and one-year software maintainance. (2) Provides Web-based access to the Wisconsin Package suite. (3) GenBank, EMBL, SP-TREMBL, PIR, SWISS-PROT, PROSITE, and others, updated every two months. Some other pricing options and several add-on services for the system are also available. All products include free technical support. A price-list or quote request form is available. | |||||||||||||||||||||||||||||||||||
Software class | Molecular biology |
|||||||||||||||||||||||||||||||||||
Will Ray is a mathematician turned computer scientist turned biophysicist who is now living as a postdoctoral fellow developing bioinformatics tools for sequence annotation.
Primer Premier 5
OMIGA 2.0
DnaSP 3.50
REALbasic 2.1.2
WebLab ViewerPro 3.7
AxoGraph 4.6 (with optional Data Acquisition Package)