Software and Databases
Software and databases contributed by NESCent Informatics staff, sponsored scientists, and collaborators. If you have products as results from NESCent-sponsored projects, please report your products through our online web application.
Recent Products (25 products found)
- Species Autocomplete. Vaidya, G. 2013. Species Autocomplete. This tool provides basic Javascript and PHP code to run an autocompletion script. An included Perl script allows the ITIS-DwCA resource to be used to generate the autocompletion database (in SQLite). Further improvements may allow this software program to be integrated into websites which would like to validate the input of scientific names, either for data entry or to power searches.
- A server side javascript controller for phylotastic products following the MVC model. A PoC javascript framework that invokes the right Phylotastic REST services at the right time.
- CDAO ontology release using OBO library conventions. The CDAO Comparative Data Analysis Ontology was revised to meet OBO library ontology standards, such as numeric class identifiers.
- CMAPOBO. Midford, P.E., 2012, CMAPOBO, This is a simple command line tool to generate simple OBO format (syntax version 1.2) files from ontology graphs created in the CMAP tool.
- Data from: Testing models of sex ratio evolution in a gynodioecious plant: female frequency covaries with the cost of male fertility restoration.. Caruso CM, Case A (2012) Data from: Testing models of sex ratio evolution in a gynodioecious plant: female frequency covaries with the cost of male fertility restoration. Dryad Digital Repository. doi:10.5061/dryad.v5n09
- DateLife. O'Meara, B.C., L. Harmon, J. Eastman, P. Midford, T. Heath, J. Brown, M. Pennell, M. Alfaro. 2012. DateLife.org. Online database and software for returning ages of most recent common ancestors given a string of taxon names.
- ITIS-DwCA. The TNRS group contacted David Mitchell, ITIS Data Specialist at the NMNH, who gave us permission to make the entire ITIS database available as a DarwinCore Archive. This file was generated by dwca-hunter (https://github.com/GlobalNamesArchitecture/dwca-hunter), a Ruby program developed by the Global Names Architecture. I wrote a small script to automatically organize dwca-hunter's results, and have been running it regularly and uploading the results to http://gaurav.github.com/itis-dwca/.
- Mesquite-o-tastic - a Mesquite package for retrieving trees from Phylotastic. Midford, P. E. 2012. Mesquite-o-tastic - a Mesquite package for retrieving trees from Phylotastic. This is a prototype package that allows a user to retrieve a tree from phylotastic that matches the taxa present in a Mesquite character matrix. See the demo video by Arlin Stoltzfus at http://www.youtube.com/watch?v=Lak-zjwFuhQ&feature=youtube_gdata_player
- Newick to CDAO Ingestor. The Newick to CDAO ingestor is a Perl module. The module takes as input a tree described in Newick format and produces a CDAO representation of the tree. The module is capable of contacting the Phylotastic TNRS to resolve names and adds the result of the name resolution to the CDAO representation of the tree.
- obo-lint. Plugin for the Protege ontology editing environment which displays axioms which can not be represented in an OBO format serialization of the ontology.
- Ontology and RDF model for Taxonomic Name Resolution Service results. . Lapp H. 2012. Ontology and RDF model for Taxonomic Name Resolution Service (TNRS) results. The ontology describes the entities that make up a TNRS result and the relationship between those and those between an OTU and a TNRS resolution result. The RDF model is accompanied by an instance document and a graph visualization.
- OpenSim grid hosting on Amazon's EC2.. Using the OpenSim software, a 6 region grid was set up on virtual machine instances at Amazon's EC2 to host the 3D environments for this project. The EC2 API tools were used to automate starting and stopping the server instances in order to reduce costs by not running the instances at night when they are not used. Monitoring software was set up to restart the OpenSim software in case of errors.
- Phenex 1.2. New release of the Phenex annotation software with new features supporting collaborative editing across file sharing systems such as Dropbox.
- Phylogeny pruner based on MapReduce with web page and services interface. Rutger Vos, 2012. A phylotastic pruning service based on MapReduce. HIP working group of NESCent. This pruner was developed to provide automated pruning services, as part of the Phylotastic project. Given a set { S } of OTU names, and the name of a source tree, the pruner returns a topology for the OTUs that it can match from { S }. This kind of pruning can be done by recursive calls into a database (which probably would need to hit the database many times) or by loading the whole tree into memory (which might take a while to read in the file, and cost a bit of memory). The way it is done here is much cooler, because it never requires the whole tree to be in memory or in a database: the pruning is done in parallel using MapReduce. Some tests on the entire dump of the Tree of Life Web Project showed that this returns a pruned subtree within a few seconds, fast enough for a web service. The pruner has two interfaces, a web forms interface (with explanatory text and examples) and a web-services interface. The code and some documentation is available at a location indicated on the web page.
- Phylomatic v3. A major upgrade to this well-used tool. Phylomatic version 3 is a fork of the phylocom phylomatic code and exists only as a web service. I can now: read in trees as NeXML and CDAO, graft in taxa to the megatree, and write out in a number of formats.
- PhyloTastic services in Galaxy. To provide end-users with a familiar graphical user interface with which to access PhyloTastic services I have developed several wrapper classes that enable interaction with TNRS, DateLife, BabelPhysh and pruning functionality within the Galaxy web application. A demo Galaxy instance is available at http://galaxy.phylotastic.org, the source code at https://github.com/phylotastic/arch-galaxy and a screen cast that demonstrates the currently available functionality is available at http://youtu.be/kMME658xOu4
- Phylotastic tree-pruning SADI service. A proof-of-concept SADI-based web service which uses RDF and SPARQL to return subtrees from larger phylogenetic trees.
- PhylotasticTreeStore. Jim Balhoff, Karen A. Cranston, Mark T. Holder, Hilmar Lapp, Emily J. McTavish, and Enrico Pontelli. June, 2012. PhylotasticTreeStore. PhylotasticTreeStore RESTful adaptor for RDF-based tree store. It is a web2py application, which provides a RESTful interface by translating queries for trees into SPARQL queries to a triple store and then using DendroPy to translate the resulting RDF to NeXML. Built at the Phylotastic hackathon. Christopher Baron, Jeet Sukumaran, and Cam Webb provided helpful feedback.
- Phylotastic Website. Landing page for the Phylotastic project
- prototype controller as Perl CGI script . The Perl controller coordinates stub CGI implementations of the Phylotastic TNRS, tree store, topology, and branch length services which produce correct output for one example input. However, the user may substitute real service implementations into the controller workflow via CGI parameters, allowing the services to be tested for conformance to the Phylotastic specification. Usage instructions for the CGI controller, along with example input/output files for the stub services, are provided at https://github.com/phylotastic/cgi.
- Reconciliotastic. Reconcili-o-tastic starts with a gene tree, discovers the species sources, gets a tree for the species on the fly (phylotastically), then runs reconciliation software to identify which branchings represent speciations vs duplications.
- Taxonomy Ontology Tool (VTOTool). Midford, P. E. 2012. Taxonomy Ontology Tool (VTOTool). This tool constructs taxonomic ontologies (e.g., Phenoscape's VTO) from multiple taxonomic sources - existing ontologies (e.g., TTO) as well as other taxonomic sources (e.g., NCBI, CoL, PBDB). As of July 2012 it generates the ontologies in an OBO format where taxonomic groups are classes and follows the conventions of the OBO rendering of the full NCBI taxonomic ontology. A earlier version of this tool was used in support of a working group.
- Taxosaurus: A Phylotastic Taxonomic Name Resolution Service. Taxosaurus is a meta TNRS that implements the TNRastic API. It's composed by 2 main modules (the handler_library and the processor) that sit behind an http handler. The handler_library implements the TNRastic API whereas the processor coordinates the execution of the downstream calls to the sources. The processor itself has a modular design that allows the addition of new service via adaptors that are registered through a simple JSON description.
- TNRastic API. The TNRastic API is a lightweight RESTful API specification that provides a generalized framework to access Taxonomic Name Resolution Services. It's composed of a set of services that are essential for name resolution.
- Tools to convert trees of Goloboff, et al. (2009) from TNT to Newick format. The phylogeny from Goloboff, et al. "Phylogenetic analysis of 73 060 taxa corroborates major eukaryotic groups" (Cladistics 2009, 25:211-230) is a valuable resource, useful for the Phylotastic project. However, it is available only in TNT format, a nested-parenthesis format like Newick. The tree file has only numeric codes, with names encoded in a separate file. Converting this information into a single Newick tree is a 3-step process: 1. convert the TNT trees (in any of the *.tre files) to newick 2. pick the tree you want and put it in a file by itself 3. replace the numeric codes in the tree with species names from Taxon_Names_Only.tnt I developed and tested Perl scripts for steps 1 and 3. The scripts contain documentation using POD. They are available from a public repository using the URL below.
