Dropbox is an on-line file storage and sharing service. 2GB of Dropbox is available for free, with subscriptions up to 100GB available. Shared folders allow people to work together on the same projects and documents.
Dropbox files are also available off-line, and folders can be synced between multiple computers and mobile devices. Dropbox therefore can be used as a backup mechanism for important files, although it is by no means a complete solution.
A distributed version control system (DCVS). Git provides a distributed development, giving each developer/user a local copy of a repository, which includes the entire revision history. Changes are copied from one repository to another. Branching and merging are easy to do. Users are not dependent on network access or a central server so Git is very fast and scales well when working with large projects.
Mathematica is a computational platform used by scientists, engineers and mathematicians. Mathematica has support for equation solving, numerical analysis, as well as graphing and visualization. Mathematica has import and export filters for tabular data, images, video, sound, CAD, GIS documents and biomedical formats. There is support for data mining tools such as cluster analysis, sequence alignment and pattern matching as well as text mining support.
MATLAB is an interactive data analysis and visualization environment that can be used to perform computationally-intense operations on large data sets efficiently. MATLAB also provides a high level programming language that supports rapid development of work-flow scripts and Graphical User Interface applications to automate repetitive tasks.
Mercurial is a free, distributed source control management tool and is used for version control of files. Mercurial is distributed, giving each developer a local copy of the entire development history.
OpenMI provides users with a standard interface that allows the construction of modeling workflows. OpenMI allows models to exchange data with each other and other modeling tools as they run, facilitating the modeling of process interactions. Models may come from many different sources, represent processes from different scientific domains, have different spatial and temporal resolutions, and have different spatial domains/representations.
Pegasus encompases a set of technologies that help workflow-based applications execute in a number of different environments including desktops, campus clusters, grids, and now clouds. Scientific workflows allow users to easily express multi-step computations, for example retrieve data from a database, reformat the data, and run an analysis.
Platform LSF is a workload manager designed for use in large, high-performance computing environments. This commercial tool can be used to schedule complex scientific workflows and manage very large (up to petaFLOP scale) compute resources. It provides application support across distributed and heterogeneous platforms.
Project Trident is a scientific workflow workbench that allows users to author workflows visually by using a catalog of existing activities and complete workflows. The workflow workbench provides a tiered library that hides the complexity of different workflow activities and services for ease of use. Trident supports: analysis and visualization worksflows; composing, running, cataloging experiments as workflows, as well as capturing of provenance information.
The Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox and data assimilation system that synthesizes information contained in ecological models, data, and expert knowledge. This is done using modern statistical methods and state-of-the art ecosystem models. PEcAn has a web interface that enables users to run ecosystem models, as well as a suite of R packages that can be used for model-data fusion and more sophisticated analysis.
Tika java class library available through the Apache group. It supports media type detection based on file type signatures, metadata extraction and text parsing and extraction.
Supported Document Formats:
Webex is a proprietary web collaboration and meeting environment. WebEx allows users to host and join web video- and tele-conferences. WebEx requires client-side Java, a browser, and a plugin, but once installed, users can host and join meetings with a browser and no additional software. WebEx web conferencing allows any user to become the "presenter" can share an application or their entire desktop over the web with other meeting participants.
WEKA is a data mining tool. It is a collection of standard machine learning algorithms organized and presented to the user as a workbench. The algorithms can be applied directly to a dataset from the workbench or called from Java code. New classifiers, filters etc can be added through the GUI.
WEKA is written in Java and runs on platforms that support Java. It is available under the GNU Public License (GPL).