GRIDtoday Logo SUN

DAILY NEWS AND INFORMATION FOR THE GLOBAL GRID COMMUNITY / MARCH 10, 2003: VOL. 2 NO. 10

( Previous Article )   ( Table of Contents )   ( Next Article )

Systems/Enterprise:

A TURBOWORX INFORMATICS REVOLUTION
By Jeff Augen, PhD, President, TurboWorx

An Informatics Revolution from Within TurboWorx integrates disparate applications and heterogeneous computing resources into an accelerated pipelining and workflow solution without forcing businesses to change the way their IT professionals do their jobs. By Jeff Augen, PhD, President, TurboWorx

Almost every scientific, technical and research-intensive organization supports its own distinctive culture of IT personnel, practices and policies that governs how it goes about its business. That culture determines everything from hardware selection to whether the organization deploys commercial software or relies on its own resources to program and maintain Open Source Software (OSS) applications. Typically, the IT culture becomes so firmly embedded throughout an organization that it cannot be changed without serious disruption and expense.

This leads to an inherent conflict. Research and operations computing problems change and grow all the time. These mission-critical problems are complex- solutions often involve complex sequences of computational steps with large quantities of data passing from one step to the next. Further complicating the process is the heterogeneous nature of the data; most often derived from numerous different sources. In addition, the individual computational steps themselves are often compute intensive and very time-consuming. An IT culture must be able to respond rapidly and efficiently with high performance solutions to meet the demands posed by this new class of computing problems, or it will become a barrier to business success.

Current budgetary constraints only worsen the difficulties that businesses face in seeking to meet the multiplicity of computing demands. The conflict between an entrenched IT culture and the need for flexibility, responsiveness and growth in computing capability is one of the sharpest that any organization faces.

Previously, only revolutionary cultural change-accompanied by costly replacement of legacy systems-could break these bottlenecks. Today, however, grid and cluster computing networks present a breakthrough opportunity that allows virtually any organization or department within an enterprise to build its own high performance computing environment by evolving the IT culture instead of destroying it. Through innovative application integration tool sets and the use of automated workflow software such as that from TurboWorx, organizations can leverage existing computing resources and capabilities to achieve levels of computing performance that previously required massive investment in new hardware and staff. In most cases, the gains can be achieved while actually easing the burdens on software developers and system administrators.

Changing Computing without Changing Cultures

Advances in grid and cluster computing have made it possible to construct "virtual supercomputers" from networks of computing resources These new capabilities have played an important role in a wide variety of industries, but especially in the life sciences where compute demands continue to grow at a rate exceeding Moore's Law. Harnessing computing power into virtual supercomputers is not for the unschooled. It is difficult to achieve productive uses of clusters and grids without specialized knowledge. Such platforms are far too complex for effective use of ad hoc manual and OSS approaches to platform management or application optimization and tuning. To use these virtual supercomputers effectively most decisions must be made automatically by specialized software systems.

TurboWorx has stepped in to harness compute networks into virtual supercomputers that can be run and modified even by users with minimal programming skill. With our application integration and workflow pipelining software, users throughout an enterprise can now have unprecedented compute resources at their desktop workstation. For the first time it has become possible to meet all of an organization's IT needs by leveraging existing resources without extraordinary upheaval in its IT culture.

As a result, any business can make changes in its practices that measurably impact productivity, speed-to-market and job performance with minimal investment and, perhaps most importantly, without cultural disruption. Business organizations can undergo an informatics revolution from within.

Rethinking Open Source

The IT cultures of research-intensive organizations, especially those in the life sciences, reflect both the academic backgrounds of many bench scientists and the complexity of the compute tasks they face. These organizations encounter significant informatics challenges as the complexity and volume of experimental data needing to be processed and interpreted grows rapidly. As research projects change in scope and size, the applications and data formats required also change. The conventional approach is to write custom open source Perl programs designed to glue together applications and databases for individual projects. As soon as any aspect of the project changes, skilled programmers are required to modify and rewrite code.

I have personally been a longtime supporter of a number of open source initiatives. As Director of Strategy at IBM Life Sciences, I was an early advocate for Linux and campaigned aggressively to launch an open source initiative inside IBM as early as 1998. IBM derived great business benefit from that move, and the community of Linux users has experienced enormous growth as a result. Nowhere is the OSS community more active than in bioinformatics. For academic and government laboratories with unique, highly specialized application needs, OSS has allowed scientists to develop exactly the application programs they require for their research. IDC research estimates that 25 percent of all software used in the life sciences is open source.

While OSS has opened up the computationally intensive world of bioscientific research to laboratories with limited financial resources, it should be well understood that open source software is not free. Particularly when OSS is used as the core of the computing infrastructure, the costs of choosing OSS for harnessing the power of virtual supercomputers are often far greater than the costs of relying on commercial software. When working with computing grids and clusters linked into virtual supercomputers, the relative costs of relying on an OSS platform to automate and integrate applications into effective high performance workflows can dwarf any initial OSS price advantage. An organization must be ready to maintain and upgrade software that they did not develop and to rework it as their workflow needs change. This is a high-cost way to go about supporting mission-critical operations.

More significantly, the cumbersome and costly process of dealing with OSS can tax the IT resources of even the most well-endowed research facilities. The programming dexterity and efficiency of the IT staff, rather than the talents and capabilities of bench scientists, becomes the critical step in innovation. As a result, growing computing bottlenecks slow scientific and drug discovery or, in other industries, limit the productive use of computing power.

Automated application integration, pipelining and workflow software, such as ours, can significantly accelerate drug discovery, scientific advances and other business computing processes. The beauty of the software is that it allows users to leverage the power of OSS-with a simple drag-and-click, new OSS software modules that perform important computations can be added to any workflow without programming intervention. TurboWorx's platform will quickly pay for itself while those who rely solely on OSS and internal development for workflow management are likely to generate additional costs each time they modify a workflow, thwarting productivity and business-critical operations Finally, open source design methodologies are best suited to tasks that benefit from the simultaneous efforts of hundreds, even thousands of developers. The opposite is true of architectural-design initiatives such as our TurboBench platform. The development of mechanisms for application parallelization and integration does not fit well with open source methodologies because it does not scale well across large numbers of developers-these tasks are best left to small development teams with very specialized skills.

Breaking Computing Bottlenecks

TurboWorx is a best-of-breed software company specializing in application management tools and middleware for high-performance technical computing and workflows on clusters and computing grids. Designed by a small team of recognized experts in autoparallelization and application integration, our technologies and products represent 25 man-years of investment in their development. They address the key issues of application integration, workflow automation and parallel acceleration that must be handled in order to solve the new class of mission-critical technical computing problems found in many industries-including biotechnology, pharmaceutical, financial, geophysical and energy, automotive and manufacturing, and aerospace and defense industries.

To use clusters and computer grids as "virtual supercomputers" requires middleware and management tools that address three major tasks: resource management, data management and application management. Our TurboBench platform deals with all three of these business-critical functions in ways that require remarkably little training for new users or change in existing systems. It can be bolted on, embedded or coexist with any legacy system or set of applications, whether commercially developed or OSS.

The TurboBench platform integrates your chosen applications and data sets into computational workflows that exploit your heterogeneous network of PC workstations, servers, and rack-mounted blades without the need for complex, source-level programming. TurboBench completely automates workflow execution by handling all of the details of running distributed workflows on large quantities of data processed through multiple applications. Proven in leading life sciences laboratories around the world, TurboBench can optimize computational performance in any compute-intensive environment.

With its powerful graphical user interface, TurboBench provides users, including those with little programming skill, with an intuitive tool set so easy to use that a pilot system can be up-and-running in literally minutes. TurboBench's Java-based, XML integration can create automated workflows that take only a simple drag-and-click to modify. The system also includes a complete, easily mastered parallel programming interface that enables the creation of sophisticated parallel applications in any compute-intensive environment.

TurboWorx has partnered with the leading industry IT solution providers, including Dell, IBM, SUN and Apple, to assure smooth and simple integration as well as reliable service and superior support, so that any organization's IT resources can be leveraged to work at optimal performance levels now and in the future.

TurboBench Features

TurboBench software provides users with:

  • Easy-to-use Wizard to create simple building blocks called components.
  • Drop-and-drag feature to link components into workflows.
  • The capability to share, modify and debug components.
  • State-of-the-art scheduling capability for setting job priorities and running components on specific, particularly well-suited machines.
  • Plug-and-play scalability. As jobs require more computing horsepower, researchers simply add more computers to the network to improve performance.
  • Programs are automatically distributed among computers ("Workers") and executed in parallel.
  • Exceptional fault tolerance. If a component fails while executing a task, the task is automatically picked up by another Worker and executed.
  • Process monitoring to view and list job details and terminate jobs.
  • A single, virtual data repository that acts as a pointer for all data sources.
  • Tiered and controlled multi-user access. TurboBench supports different levels of user access: end-users to execute components and workflows, designers to design and execute workflows and components, and administrators to manage system setup and maintenance.

Web site: www.turboworx.com

( Top of Page )

( Previous Article )   ( Table of Contents )   ( Next Article )