GRIDtoday Logo UD

DAILY NEWS AND INFORMATION FOR THE GLOBAL GRID COMMUNITY / APRIL 21, 2003: VOL. 2 NO. 16

( Previous Article )   ( Table of Contents )   ( Next Article )

Special Features:

GRID DELIVERY: A NEW TECHNIQUE FOR CONTENT IN THE ENTERPRISE
By Daniel Sapir, Bandwiz, Inc.

1. The Need

Efficient and rapid delivery of information of all sorts to employees, distributors, and corporate partners that can be anywhere in the world is essential to meeting business goals. Employees, partners and distributors need to be trained on products, especially at the time of new product launches. Support personnel need to get detailed training on products if they are expected to effectively support these products. New regulations need to be distributed to make sure employees know about them, e.g. sexual harassment policies and OSHA safety regulations. Organizational goals need to be clearly distributed to all employees to make sure the organization is moving together as a team to meet corporate goals.

No organization can be successful with factions moving in different directions. One Texas CEO, upon being asked what his toughest challenge was, replied "keeping the herd moving approximately west."

Customers also need to be courted to gain loyalty and repeat sales. The way to do that today is largely by electronic delivery of news, market programs, and other material as well as with personal contact.

To be effective, the information content that is delivered is getting richer, is updated more frequently, and is sent to ever larger numbers of recipients. However, rich content means larger sizes. Our computers can usually cope as memory available on PCs is growing rapidly and PCs typically have much underutilized memory.

However, enterprise networks are usually not up to the task. If any two of the characteristics above are true (i.e. large content plus frequent delivery or frequent delivery to large numbers of recipients, or large content delivery to large numbers of recipients even if infrequent), there will be a content delivery problem in that organization.

At busy times of the day, the corporate network is often congested leading to long delays. Corporate IT personnel put in place policies that forbid downloading large content in attempts to prevent congestion in the corporate network. Upgrades to the network are generally out of the question; budgets are flat to down and IT staff is stretched thin.

2. Solutions

There are three basic approaches that can be taken to improve this situation:

1 Upgrade the links with more bandwidth – this is generally too expensive and is virtually never done.

2. Adapt an Internet Content Delivery Network (CDN) solution to the enterprise by adding proxy caches to store content at the edges closer to consumers. This has merit as it trades off bandwidth for storage; a good tradeoff given storage is about 100X cheaper than bandwidth. But, this approach was designed for the Internet environment, which is very different than the corporate Intranet.

3. Use a "Grid Delivery"* approach, a new technique specifically targeted at providing solutions for the enterprise.

The CDN and Grid Delivery approaches are discussed in more detail.

2.1 The Internet CDN Approach (using proxy caches)

Before discussing the use of proxy caches in the enterprise, it is worth examining the differences between the public Internet and the corporate Intranet. Proxy caches, the primary component used in Internet Content Delivery Networks, were designed for the Internet environment to speed up Web experiences.

The table below lists some of the primary differences between the Internet and corporate intranets:

  • Corporate Intranet
    • Low bandwidth WAN links
    • High bandwidth local networks
    • One or a small number of domains
    • Centralized control and management
    • Security and authentication is important
  • Public Intranet
    • High bandwidth backbones
    • Low bandwidth last mile links
    • Thousands of domains
    • Distributed control and management
    • Security and authentication only needed in some applications, e.g. transactions

Content Delivery Networks basically speed up Web traffic by trading off storage for bandwidth by storing popular Web content "closer" to the user in a proxy cache. Closer, in Internet terms, means within the same domain since the largest sources of congestion in the Internet aside from last mile constrictions are the junction points between domains. It is not unusual for traffic to traverse many domains to reach the recipient.

A mechanism is put into place (usually using the WCCP protocol in the local router) that redirects http and ftp requests from browser users to the local proxy cache. If the local cache has the content requested, it delivers it to the requestor. If it does not have it, it fetches it either from another cache further up in a hierarchy of caches or from the origin. It delivers it to the requestor and stores it locally so that it is immediately available to the next requestor.

Although cache capabilities have been upgraded to support streaming media and to be able to pre-populate them before a request is made for the content, the fact remains that they operate basically on the pull paradigm, i.e. content is stored at the edge based on a request from a consumer located at that edge. Additionally, they were designed for Web traffic and other content is dealt with as an afterthought if at all. Finally, there is no centralized management; they operate autonomously, a requirement for operation in the Internet.

However, proxy caches can be positioned within an enterprise at remote offices to speed up content delivery to users of the corporate portal at that site. However, this is a limited solution for the enterprise that is also expensive, as each site needs to add a proxy cache, a new piece of hardware. A summary of these limitations is shown below:

1. Proxy cache hardware costs about $2000-$5000 each, prohibitive when there are thousands of remote sites.

2. Addition of remote hardware adds support costs in the field, very undesirable when enterprise support personnel are typically overworked and centrally located.

3. Caches by nature store content in the clear open to all. Enterprises often want to secure the content.

4. Caches are autonomous devices, whereas the enterprise desires centralized management and control of remote devices.

5. Enterprises want a "push" solution, where content can be sent to recipients without requiring a request for content from the consumer. They can do this to some extent with e-mail attachments but content is getting too large and another way is desired. Caches do not address this need at all.

6. Enterprises often require usage reporting, i.e. reports of content actually received and opened. This is also not addressed by caches.

There is a fundamental difference between content delivery in the Internet and in the corporate Intranet. Consumers go to the Internet to find content of interest making it inherently a "pull" model, whereas in the corporate intranet, the supervisor of employees or the corporation may wish to "push" content to their employees for use in their work.

2.2 Grid Delivery

Enter "Grid Delivery", a new system first coined by the Gartner Group that is designed specifically to address the needs of the Enterprise. The term "Grid Delivery" is used since many desktop and possibly even laptop computers cooperate to perform the delivery in similar fashion as many computers can cooperate to perform "grid computing" tasks.

The intent of a grid delivery system is to utilize network and computing resources already present in the enterprise more efficiently. Enterprise networks only become congested at busy hours leaving other times with capacity. Additionally, there are computing and storage resources in the desktop and laptop computers throughout the organization that can be tapped. The key trick is to be able smooth out traffic on the network and tap into the underutilized computing power and storage available in desktop and laptop computers.

The key technology is called "mediated peer-peer". Desktop and laptop computers at remote sites are loaded with a piece of software that operates in the background and is used to provide four key functions:

1. Application layer routing forwarders. Efficient one-many delivery (application layer multicast) can be achieved without having to touch networking devices in the network, while at the same time gaining the efficiencies promised by network layer multicast routing.

2. Delayed time forwarding, to optimally utilize low network usage times to deliver content over the corporate intranet.

3. Caching of content on multiple desktop/laptop computers locally, to provide rapid delivery of content to local consumers of the content.

4. Reporting of content delivery and usage.

Additionally, since content may reside on desktop/laptop computers and may be confidential, security is an inherent requirement of the system. By default, all content is encrypted during delivery and when under local storage.

Grid Delivery is also inherently cost effective and support friendly since no new hardware is deployed in the field, only small software agents that reside on the organizations’ desktop and laptop computers. This software is all managed centrally.

A basic system would work with one remote site. Although there are many client computers at the remote site, content is only delivered once to that site from the central site. Within that site, application layer forwarders distribute the content to the other computers at the site. In some cases, all the computers may receive the content; in others, only a few and if a computer without the content wants it, it retrieves if from one or more of the local clients. Sometimes, there are downstream sites; in this case, the application layer forwarders send it off to the satellite sites, again with only one transmission per wide area link.

The remote peers also act as a distributed storage cache for the site, providing local (read rapid) access for content of interest to consumers at that site. Also, since content is often delivered to the actual desktop, reporting of delivery and usage by the consumer can be accomplished, important for enterprises.

A grid delivery system also has the flexibility to interface to applications already operating in the enterprise, removing any need for training by users of the system.

Some common applications are the following:

1. Pre-loading of content posted on the corporate portal (internal web site). In this application, content that management wants to deliver to its employees, partners, or customers is pre-downloaded to the site(s). When the consumer goes to the corporate portal to get it after notification of its availability, he is re-directed locally and fetches it locally providing rapid response.

2. Offloading of e-mail attachments that cannot be handled by the corporate e- mail system. Attachments not able to be carried by e-mail are stripped from the e-mail system and delivered by the grid delivery system; the result is the user can use a system for push delivery that he is very familiar with.

3. Secure delivery of sensitive content. By default, all content delivered is encrypted providing secure delivery of sensitive information.

4. Local caching of Web content without need for additional hardware. Desktop and laptop computers have a browser cache in each computer; by communicating with other local computers to discover content residing on their local browser cache, a virtual cache server for the site is created without the need for additional hardware.

5. Simulated live multimedia streaming media to the desktop. Streaming media sourced at corporate headquarters is usually not feasible due to the constriction of the wide area network links. With simulated live, the media is delivered as large files at times the network can support it and then is streamed locally over the high bandwidth local network at the multiple sites. This can be scheduled for a time to start the streaming, hence the term "simulated live".

3. Conclusions

"Grid Delivery" is a new type of system for content delivery focused on the needs of the enterprise. It is a flexible system that utilizes the network and computing resources already present in the enterprise much more efficiently, extremely important in this age of tight IT budgets and staffing. It solves all of the content delivery needs of the enterprise and is centrally managed and controlled, which is important for enterprise IT staffs. Finally, it can be used in both intranets and extranets, so that content can be delivered to partners and customers as well as to internal employees.

Expect Grid Delivery systems to enter the mainstream in the near future. Its benefits to the enterprise are too high for it to be ignored.

About the Author:

Daniel Sapir is VP of business development at Bandwiz (www.bandwiz.com), a knowledge logistics software company focusing on the distribution and management of content throughout extended enterprise environments. He was formerly a vice president at Ascential Software, a provider of enterprise data integration solutions. Previously, he founded iCentric Solutions, Inc., an eBusiness integrator. Sapir established and served as vice president of the enterprise solutions business unit of Data General Corporation. Sapir has also served as a director of sales for McDonnell Douglas Information Systems and Allerion Corporation, as well as product manager at Pertec Computer Corporation and engineering/QA project leader for MIS Data Corporation. Reach him at DSapir@bandwiz.com.

( Top of Page )

( Previous Article )   ( Table of Contents )   ( Next Article )