GRIDtoday Logo Digipede

DAILY NEWS AND INFORMATION FOR THE GLOBAL GRID COMMUNITY /

   ( Table of Contents )   

Special Features:

CONCLUSION: 7 STEPS TO SUCCESSFUL GRID COMPUTING
By Brooklin J. Gore, Senior Fellow, Micron Technology Inc

This report along with two other new, groundbreaking Grid reports from IDC and the Economic Strategy Institute will be released at Gt'04 and included with conference registration. www.gridtoday.com/04/conference/index.html

This is the final installment of a three-part series derived from the report "Enterprise Grids For General Purpose Computing" by Brooklin J. Gore.


Web services, utility computing, .NET, CPU harvesting and distributed computing are just a few of the technologies that fall under the Grid computing umbrella. Gt04 -- a premiere enterprise Grid computing conference targeting industrial and commercial users -- will gather experts, and outline strategies and road maps for Grid deployment. For more information, visit www.gt04.com.

Grid computing is here!


Lack of developer trust and reluctance to change work to slow new technology deployment. Moreover, rightly so. If your Grid is not as robust and reliable as your traditional compute platforms, your Grid efforts will become untrusted. It is hard enough to get people to take a chance on something new, but when they do, you must ensure them a positive experience. The low hanging fruit, education and integration keys (5, 6 and 7) above come to play here. Start with developers and applications that can be flexible regarding uptime and use your experience with these to build a solid production class Grid with a well-integrated support model.

Understand thick client and master worker program models of software development. While many "off the shelf" programs will run on the Grid, in the grand scheme of things the Grid is a jungle. It can be a tangle of misconfigured machines, machines that lie about their capabilities and machines that just go away while running a job. Grid software does a great job of dealing with these challenges, but ultimately for maximum reliability and use of resources Grid programs should be able to handle a variety of error conditions they might not be used to. Further, Grid jobs must learn to be nice. Nobody worries about a single ant in his or her home -- it is the colony we worry about. Well, Grid jobs generally appear in colonies and as such need to behave. They run on borrowed hardware and shouldn't make a mess of it. Collectively they can overwhelm a database server and should learn when to wait to connect. These are just a few challenges of Grid programming. It is more difficult to get Grid developers to adopt this new way of thinking than for them to actually implement these new ideas. We reiterate keys 4 and 6 in articulating the value proposition and education.

Not many organizations have experience managing collections of ten thousand machines with hundreds of users and millions of jobs per day in a single environment. Will Grid software be able to ensure the right user's jobs are getting the resources they need at these orders of magnitude? By the end of 2004, we expect our largest collection to be right around 10,000 machines. This author has as much pioneering spirit as the next, but he is not excited to be the first guy to discover a new flesh-eating virus at the heart of the Amazon. Are you? Therefore, here's a challenge to the Grid Software folks: we're using this stuff in ways you may have never imagined and we love what we see -- please listen to our needs and be prepared to meet them.

These challenges can and will be overcome to yield great rewards. In August of 2003, over 96,000 hours of work were completed on the Grid. Assuming $1,000 PC equivalent systems performing work full time (720 hours a month), this works out to 133 machines -- that didn't have to be purchase. That's a $133,000 savings. Of course, additional value is provided by making decisions on better data that is delivered more quickly. Costs for training, pool administration and developer support have been negligible.

Next Steps

If you do not have an active Grid Computing program in your organization, it is time to start one. If you have a fledgling program, hopefully, some of the tips provided in this article will accelerate your success and that the application examples have provided new incentives for the broad applicability of your Grid. If your Grid program is a great success you've probably been smiling a lot while reading this article as I'm sure much of it is familiar ground to you by now.

We continue to grow our machine collections aggressively and at the same time work with an increasing number of developers who want to Grid-enable their software. We also continue spreading the value proposition of grid computing and educating administrators and developers worldwide. Most importantly, we are working with our Grid software supplier to ensure they are aware of our current usage and future plans so they may anticipate potential problems before they manifest themselves in a production environment.

Acknowledgements

The Author would like to thank the following people for their pioneering spirit and efforts to make our Grid Computing efforts at Micron a success: Mike Dorough, Sam Evans, Todd Evans, Eric Fletcher, Chin Le, Michael Lee, Tim Long, Joe Lundgren, Ed Mahoney, Bryan Mann, Melissa Norgard, Adriana Sanchez, Mike Steele, Mike Urizar, Doug Warner and Bridger Wood.

Grid Primer

A wide selection of Grid software is available from open source and commercial providers. Condor from the University of Wisconsin, Sun ONE Grid Engine from Sun Microsystems and LSF from Platform Computing are examples that span the open source to commercial range.

A well-deployed and functioning Grid provides the following benefits:

  • Lowers computing costs by more efficiently using existing resources.
  • Increases application reliability by dynamically directing jobs to available resources.
  • Improves decision making by delivering huge amounts of raw computing power.

About Brooklin J. Gore

Brooklin Gore has been researching and implementing enterprise grid technologies for the past three years to create Micron's global grid infrastructure which runs over 15 production applications today. Brooklin has been with Micron for 16 years. In that time he served as a product engineer, Computer Aided Design group manager, network manager and General Manager of Micron's Internet Services Division. Brooklin has been issued several US patents and is a Senior Member of the IEEE. He holds Bachelor of Science degrees in Computer Science and Electrical Engineering from the University of Idaho and a Masters of Science in Computer Science from the National Technological University.

( Top of Page )

   ( Table of Contents )