Purdue Plans a 1-Day Supercomputer "Barnraising"

Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

Purdue Plans a 1-Day Supercomputer "Barnraising" 97

Posted by timothy on Thursday May 01, 2008 @06:03PM from the bring-some-ethernet-and-some-lemonade dept.

An anonymous reader points out an article which says that "Purdue University says it will only need one day to install the largest supercomputer on a Big Ten campus. The so-called 'electronic barn-raising' will take place May 5 and involved more than 200 employees. The computer will be about the size of a semi trailer. Vice President for Information Technology at Purdue Gerry McCartney says it will be built in a single day to keep science and engineering researchers from facing a lengthy downtime." Another anonymous reader adds "To generate interest on campus, the organizers created a spoof movie trailer called 'Installation Day.'"

This discussion has been archived. No new comments can be posted.

Purdue Plans a 1-Day Supercomputer "Barnraising"

Load All Comments

Search 97 Comments Log In/Create an Account

Comments Filter:

Amish (Score:1)

by SBrach ( 1073190 ) writes:

The Amish are great "barn raisers" maybe they can help.
- Re: (Score:2)
  
  by mikael ( 484 ) writes:
  
  Are the builders of this system required to wear beards and black hats?
  
  I've seen the websites where the Amish organise barn-raising parties. It's quite impressive. The womenfolk make sandwiches and other light meals, while the menfolk completely construct and assemble the parts to make a three or four floor structure. Presumably they can construct a house in the same amount of time?
  - Re: (Score:2)
    
    by silas_moeckel ( 234313 ) writes:
    
    They can frame a house in about the same amount of time. There is a lot of work to get the foundation ready and to finish the outside. A normal 4-6 man crew can frame a 3k square foot house and get it weather tight in about a week. They do about the same in a day or two.
  - Amish use websites? (Score:1)
    
    by Ocker3 ( 1232550 ) writes:
    
    Wait, you're alleging the Amish(!) use a website to organise barn-raisings? Linky to prove?
    - Re: (Score:2)
      
      by mikael ( 484 ) writes:
      
      No, I've just visited photography web-pages dedicated to the craft of barn-raising:
      
      Amish Barn-Raising [amishphoto.com]
      
      A discussion [ittoolbox.com]
      
      I'm amazed that so many people can be coordinated in such a confined space. There's a new building being built on my local campus. At most there are never more than 10 workmen on site at any time, and even then, they are always working in separate areas, operating machinery (elevators, cranes, clamps for plate glass).
- - Re: (Score:2)
    
    by gumpish ( 682245 ) writes:
    
    Is it any shock?
    
    This site is written in fucking PERL for christ's sake.
    
    It's a wonder that it ever works.
    - Re: (Score:2)
      
      by DAldredge ( 2353 ) writes:
      
      Please go back to digg and leave the adults alone.
- Re: (Score:2)
  
  by MiKM ( 752717 ) writes:
  
  ...there's no Heaven, it's easy if you try?
Abe is the biggest cluster on a BigTen campus (Score:2)

by jpu8086 ( 682572 ) writes:

Biggest on Big10 campus is a lie.

The article lists BigRed at Indiana (#43 on Top500) based on a technicality. But even the technicality is incorrect. The ABE cluster at NCSA@UIUC (#14 on Top500) is literally on the UIUC campus.

I doubt the Purdue one will beat Abe on the Top500 list.
- Re: (Score:2, Informative)
  
  by navygeek ( 1044768 ) writes:
  
  Someone need to go back and read (re-read?) the article. It says ABE is the biggest on a Big Ten campus. Purdue's will be the largest not connected to a national center. A semantic? Maybe, but it doesn't invalidate the claim.
- Re: (Score:1)
  
  by navygeek ( 1044768 ) writes:
  
  Purdue does have (at least) one very cool thing UIUC doesn't have - an operational nuclear reactor. Sure, sure UIUC may still have the facilities, but it's under a decommission order and will be shut down soon.
Not Funny (Score:1)

by IKILLEDTROTSKY ( 1197753 ) writes:

Making fun of the Amish on the internet is like mooning a blind guy.
- - Re: (Score:2)
    
    by TheLinuxSRC ( 683475 ) * writes:
    
    Well... farts smell so that deaf people can enjoy them as well; I am not sure what that has to do with a blind person though.
- pfft... ever hear of mopery? (Score:2)
  
  by way2trivial ( 601132 ) writes:
  
  http://www.imdb.com/title/tt0088000/quotes [imdb.com]
  
  turn in yer nerd ID please....
  
  if the crime has a name, someone thought of it....
- Re: (Score:2)
  
  by Gat0r30y ( 957941 ) writes:
  
  Dude, that's what makes it funny. No harm no foul, and we all get a good chuckle
Dumb (Score:4, Insightful)

by Spazmania ( 174582 ) writes: on Thursday May 01, 2008 @06:24PM (#23268962) Homepage

built in a single day to keep science and engineering researchers from facing a lengthy downtime

Sounds like poor planning to me. The correct way to keep science and engineering researchers from facing a lengthy downtime: don't turn off the old computer until the new one is running and tested.

Share
twitter facebook
- Re: (Score:1)
  
  by maxume ( 22995 ) writes:
  
  I'm sure if you built them a redundant building with proper environmental control systems (that is, cooling), that they would be happy to keep everything online while they are putting in the new one.
- - Re: (Score:2)
    
    by Spazmania ( 174582 ) writes:
    
    So what you're saying is that it's poorly planned AND underfunded?
    - Re: (Score:2)
      
      by slimjim8094 ( 941042 ) writes:
      
      Because every university is going to need a dedicated room to replace their fucking supercomputer every 10 years...
      
      Chill out - in this case, it's easier than any other options. Plus it got them on Slashdot.
      - Re: (Score:2)
        
        by Spazmania ( 174582 ) writes:
        
        I'm just sayin': it looks to me like a primo example of "work harder not smarter." There are other ways this could have been done than by having 200 folks play rack-and-stack at the same time. The breakage from this is gonna be out of sight.
        
        Re: (Score:1)
        
        by gregsv ( 631463 ) writes:
        
        There are other ways this could have been done than by having 200 folks play rack-and-stack at the same time.
        How, exactly? If the old clusters being replaced were taking all available space, power, and/or cooling in the data center, that makes it pretty hard to build a new cluster without first turning them off. And new data centers are not exactly cheap.
        
        Re: (Score:2)
        
        by Spazmania ( 174582 ) writes:
        
        If they're out of physical space (not just power and cooling) then the facility is way oversubscribed and they'll tend to suffer failures as a result. They should have taken some of the money spent on the machine and used it to improve the facility.
        
        If they're not out of physical space then they could have built the cabinets ahead of time, powered them up one at a time to verify correct cabling, hardware operation and software installation and then rolled them off into a corner. On the cutover day you'd then
        
        Re: (Score:1)
        
        by schmeckendeugler ( 864881 ) writes:
        
        Jesus christ, you're just full of easy answers, aren't you? Well, why don't you come on over to our campus and just fix everything right up? I'll get you an interview with the CIO right away!! You can convince the board of directors that our data center needs more space, cooling, and millions of dollars spent on it (in addition to the millions we've already spent)! Obviously nobody has thought of THAT, yet! Freaking Moron.
        
        Re: (Score:2)
        
        by Spazmania ( 174582 ) writes:
        
        Full != Oversubscribed.
        
        Yes, it does. You're supposed to have about a 20% reserve slack on space and power cabling, and the "+1" of n+1 reserve for battery, hvac and generator systems. That covers instant failures of those systems, but it also covers maneuvering room when its time to upgrade.
        
        Re: (Score:2)
        
        by poot_rootbeer ( 188613 ) writes:
        
        If they're out of physical space (not just power and cooling) then the facility is way oversubscribed
        
        Assuming that the new supercomputer has the same space needs as the one it's replacing, what you're telling us is that any machine room running at more than 50% volumetric capacity is oversubscribed. I can't agree with that.
        
        Re: (Score:1)
        
        by puriots0 ( 173203 ) writes:
        
        Skip the raised floor and other stuff that isn't critical, shop carefully and buy the battery system used and you can put it together for well under $100k.
        The cost of the additional building power transformers and switchgear *alone* would cost well over $100k, for the amount of power necessary. Add to that at least 85 tons of air conditioning necessary to cool the cluster (assuming 100% effiency, and no redundancy). At about $40k for each 30ton Liebert chilled/potable water DX unit, that's another $120k...
        
        Also, if you think that we can afford to put either a UPS or generator backup under this whole mess, you don't seem to understand what kind of minimal fu
        
        Re: (Score:2)
        
        by Spazmania ( 174582 ) writes:
        
        Add to that at least 85 tons of air conditioning
        
        85 tons? So you're budgeting 350 watts of actual (not max or rated) draw per node plus an extra 15kw overhead? That sounds a little high to me.
        
        Even if you're right on target, it's all in the shopping. If you go brand new, top of the line Lieberts, raised floor and all the frills you'll chew through half a mil easy. On the other hand, I once bought a nice redundant 3+3 ton data center A/C unit for $50 at auction. Datacenter A/C units sell for dirt at auctions b
        
        Re: (Score:1)
        
        by tinkergeek ( 800871 ) writes:
        
        Wow, you have seriously no idea about High Performance Computing do you?! First off, even the distance between buildings increases the latency of the network beyond what is useful for multi-node programs. Yes our campus has a wonderful fiber plant. No, it can't be used for HPC. Speed of light issues dude. Next, keeping batteries under a cluster at full load is not very doable. Each rack pulls 13+kW, that's a ton o' battery. (Not to mention you also *need* to keep the cooling going during this time as well
      - Re: (Score:1)
        
        by kyle6477 ( 1174231 ) * writes:
        
        Because every university is going to need a dedicated room to replace their fucking supercomputer every 10 years... Chill out - in this case, it's easier than any other options. Plus it got them on Slashdot.
        Who really needs to chill out here?
- Re: (Score:2)
  
  by Corbets ( 169101 ) writes:
  
  Sure. If you have lots of space, enough resources to cover the cost of maintaining dual systems, etc. etc. etc.
  
  Sounds to me like you've never had to upgrade servers in an already overloaded data center. ;-)
  - Re: (Score:2)
    
    by Spazmania ( 174582 ) writes:
    
    Sounds to me like you've never had to upgrade servers in an already overloaded data center. ;-)
    
    Sure I have. I solved the problem by moving to a data center that wasn't overloaded.
    
    When you're installing that expensive a piece of hardware, you don't try to fit it to the environment; you fit the environment to it.
- Re: (Score:1)
  
  by schmeckendeugler ( 864881 ) writes:
  
  I am a participant in this event. Spazmania, that is the usual modus operandi; however, there is limited space, and there is nowhere to put one thousand rack mounted mahcines while the previous 75+ RACKS are emptied. Secondly, all users are QUITE aware, I'm sure, that their jobs are being temporarily placed on hold so that they can install this cluster. Sometimes, my friend, out here on the bleeding edge, exceptions must be made.
- Re: (Score:1)
  
  by mscman ( 1102471 ) writes:
  
  Well if we had the space, we would have. Unfortunately, our data center is rather small and fairly stressed on cooling and power. This was the only way possible to fit the new cluster in.
  - Re: (Score:2)
    
    by Spazmania ( 174582 ) writes:
    
    Like I said elsewhere in the thread: you'd have been better off sacrificing a few machines in the cluster and spending the money improving the space instead. Reliable computing starts with reliable infrastructure. If you're running that close to the edge then you don't have reliable infrastructure.
    - Re: (Score:1)
      
      by mscman ( 1102471 ) writes:
      
      And as others have said elsewhere, if you would like to come convince Purdue's Board of Trustees along with our CIO to give us that money, we would be happy to. The reality is that everyone wants better computers, but can't afford a new facility. If you're interested in making a large donation... :)
      - Re: (Score:2)
        
        by Spazmania ( 174582 ) writes:
        
        if you would like to come convince Purdue's Board of Trustees along with our CIO to give us that money, we would be happy to
        
        No thanks. I was in my element at the DNC, but university politics are deadly. ;)
    - Re: (Score:1)
      
      by puriots0 ( 173203 ) writes:
      
      First, the amount of time wasted by trying to do this incrementally would be a much bigger hassle than doing this all at once. The last cluster we built a rack at a time was 512 nodes (16 racks plus one network rack), and required about 2 months to construct. Even if we could achieve something like a zero-downtime switchover, the months it would take to assemble the systems and test them out incrementally would be completely unacceptable to our customers.
      
      With the 1 week downtime, we were able to clean out
Thin on details (Score:2, Interesting)

by NotBornYesterday ( 1093817 ) writes:

TFA mentioned the Dell 2*quad Xeon hardware, but failed to mention what kind of storage will be attached to it, what kind of network(s) they plan to use to rope it all together, what OS & filesystem they plan to use, & other stuff that would be fun to know.

If they don't tell us what they're using, how can we have flame wars over whose technology really should have been used in it? We'll be stuck with nothing to do but make up bad car analogies.

It would be like, "GM is announcing a barnraising
- Re: (Score:1)
  
  by pathological liar ( 659969 ) writes:
  
  You can make a pretty good guess at interconnect based on the cost (if it's there, I don't care enough to read the article) ... remember to add a factor of 2 or 3 to the price to account for the edu discount...
- Re: (Score:2)
  
  by Digi-John ( 692918 ) writes:
  
  I'd say it's highly likely that the interconnect will be Infiniband. As for storage... when you get that big, I think there are generally "service nodes" that connect to the storage systems on behalf of the compute nodes; I'm just gonna go out on a limb and say they'll use either Lustre or NFS. I wish there was more information somewhere...
  - Re: (Score:2)
    
    by SanityInAnarchy ( 655584 ) writes:
    
    I'd guess either Lustre or gFarm -- I really don't see NFS working. Maybe NFS4 does more than I think it does?
    - Re: (Score:2)
      
      by Troy Baer ( 1395 ) writes:
      
      NFSv3 can scale this big for home directories if you spread the namespace and load across several beefy servers, especially if you also train your users to stage data in and out of parallel file systems (GPFS, Lustre, PVFS, etc.) and/or node-local file systems for I/O intensive jobs. There's no "silver bullet" file system that does everything well*, and there's no shame in using multiple file systems for different parts of your workload where they will work well.
      - Re: (Score:2)
        
        by SanityInAnarchy ( 655584 ) writes:
        
        I thought we were talking about a giant supercomputer, though -- I don't think we're talking about home directories.
        
        Also, what's the footnote on your "no silver bullet" line?
        
        Re: (Score:2)
        
        by Troy Baer ( 1395 ) writes:
        
        Er, supercomputers do have home directories, or at least rationally administered ones do.
        The footnote I'd intended to put in was "There are, however, several file systems that do everything poorly", but I figured I'd be in trouble with several vendors if I gave specific examples...
        
        Re: (Score:2)
        
        by petermgreen ( 876956 ) writes:
        
        I thought we were talking about a giant supercomputer, though -- I don't think we're talking about home directories.
        Well users of the supercomputer need somewhere to keep files that their jobs on the supercomputer will need to access. Sure you could use the users central campus home directories but that is likely to be bad for performance and may also cause other issues (for example some universities are pretty tight when it comes to quotas for central storage).
        
        Re: (Score:2)
        
        by SanityInAnarchy ( 655584 ) writes:
        
        Fair enough. I'm not so much debating that there will be any home directories...
        
        I'm suggesting that the NFS solution may well work for central campus home directories, but looks like it would not work well at all for the kinds of files you'd be dealing with on a supercomputer.
  - Re: (Score:1)
    
    by gregsv ( 631463 ) writes:
    
    A subsection of the cluster will have Infiniband interconnect. Most nodes will be GigE connected. Storage will be NFS, served from several very high end dedicated NFS servers. The cluster will run RedHat Enterprise Linux.
- Re: (Score:1)
  
  by tinkergeek ( 800871 ) writes:
  
  The majority of the cluster will use Ethernet for the networking. (All machines will connect to the *same* switch.) A small number will use our existing Infiniband infrastructure. The machines will all have a single 160GB SATA disk, formatted with ext3, and run RHEL4. Half of the cluster will have 16GB of RAM and the other 32GB of RAM. All clusters are served networked storage over NFSv3 from four BlueArc NAS devices.
- Re: (Score:1)
  
  by puriots0 ( 173203 ) writes:
  
  The storage will be provided by our already-in-place BlueArc Titan 2200 and 2500 systems. Both are two-head clusters with a few racks of disk behind each, the 2200 with 4Gb of uplink per head for home directory storage, and the 2500s each with 10Gb of uplink for scratch (high-speed) storage. They export their filesystems using NFS v3. We also provide an archival storage system using EMC's DXUL and a tape library that has a capacity of a couple of petabytes. We've tested the BlueArc Titans (which are FPG
  - Re: (Score:1)
    
    by NotBornYesterday ( 1093817 ) writes:
    
    Whoa. Cool.
    
    Thanks for the info, all of you. Enjoy your new toy. How's it running so far?
- Re: (Score:1)
  
  by siphonophore ( 158996 ) writes:
  
  Perhaps you're referring to our Biofuels Conference [purdue.edu]?
- Re: (Score:1)
  
  by navygeek ( 1044768 ) writes:
  
  Awww... Sounds like someone didn't get into their first choice college or got kicked on a weed-out class. ;-)
Hey editors (Score:1)

by pathological liar ( 659969 ) writes:

"The so-called 'electronic barn-raising' will take place May 5 and involved more than 200 employees."

Either the date or the tense is wrong.
- Re: (Score:2)
  
  by Gat0r30y ( 957941 ) writes:
  
  Free tequila and beers with limes will be offered as compensation upon completion. How else could you motivate 200 college students?
  - Re: (Score:2)
    
    by Vancorps ( 746090 ) writes:
    
    You mean how else could you motivate 200 college students on Cinco de Mayo!
  - Re: (Score:2)
    
    by halcyon1234 ( 834388 ) writes:
    
    ...200 college students?
    
    Which one of you fuckers if fake-lifting?!?
- Re: (Score:1)
  
  by oodaloop ( 1229816 ) writes:
  
  Either the date or the tense is wrong.
  Or it was in a previous year.
Oblig. Simpsons (Score:2)

by B3ryllium ( 571199 ) writes:

'tis a fine pool, english, but a supercomputer it ain't.
- Re: (Score:1)
  
  by jimrob ( 1092327 ) writes:
  
  (nitpick)
  
  Actually, since the referenced line is to the effect of, "'Tis a fine barn, English, but 'tis no pool." the line should read:
  
  "'Tis a fine barn, English, but 'tis no supercomputer."
  
  (/nitpick)
  - Re: (Score:2)
    
    by B3ryllium ( 571199 ) writes:
    
    I think you mean "obligatory nitpick".
SySadmin + Cinco de Miyo = Bad News (Score:2)

by Picass0 ( 147474 ) writes:

Whay a crappy day to pick for such a big job.
- - Re: (Score:2)
    
    by gatzke ( 2977 ) writes:
    
    I spent a Cinqo de Mayo in West Lafayette a few years back... We hit the Taco bell after a few rounds of margaritas. Good times (for Indiana)
  - Re: (Score:2)
    
    by catdevnull ( 531283 ) writes:
    
    Cinco de Mayo--it's like the Mexican version of St. Patrick's Day. Evidently, there was a big military victory at Puebla in the mid 1800s and the date stuck. Now it's more of an Americanized holiday/celebration of culture. It's not like the 4th of July--Mexican Independence day is in September.
    
    It's really big in TX, NM, AZ, and CA. Here in Texas, it's a pretty big deal--but mostly for the Tex-Mex restaurants and the cultural nazis.
    
    Gringos usually celebrate by eating "Mexican" food (i.e., something with chee
Moderation broken? (Score:1, Offtopic)

by slashflood ( 697891 ) writes:

Is it me or is/was the moderation system broken? At least all of the comments in the earlier story about SCO were unmoderated.
Downtime (Score:2)

by eap ( 91469 ) writes:

"...it will be built in a single day to keep science and engineering researchers from facing a lengthy downtime."
I'm afraid the damage has already been done on the downtime front, since it has not existed up until this point in time.
Just imangine (Score:1)

by ThePawArmy ( 952965 ) writes:

a beowulf cluster of these...
Grammar Nazi says: (Score:2, Insightful)

by siphonophore ( 158996 ) writes:

The singular of "megaflops" is not "megaflop".

//pet peeve
Which raises the question... (Score:1)

by R2.0 ( 532027 ) writes:

Now that there will be a working Babbage engine around, can the Amish use it?
- Re: (Score:2)
  
  by justinlee37 ( 993373 ) writes:
  
  They could just use a Curta [wikipedia.org].
Loose defenition (Score:1)

by twistah ( 194990 ) writes:

When did we start calling clusters "supercomputers"?
Time off from my normal duties. (Score:1)

by westerman ( 1283038 ) writes:

As a Purdue IT employee but not one associated with central IT (who are creating the cluster) I am approaching this as a way to take part of the day off to rub shoulders with other geeks. It should be fun and maybe even informative. They are even providing a probably poor but at least free lunch. :-) As for what central IT gets aside from a bunch of free skilled labor is good publicity and a sense of community. The borg gets to inspect its potential minions.
Having installed Supercomputers... (Score:1)

by vil3nr0b ( 930195 ) writes:

I know 200 people is going to be a disaster. Can you guarantee me Jim Stoner and his buddies can assemble rail kits or anything else? Wait until they get to the infiniband cabling. One bend in that cable at 100 dollars a foot will cause all kinds of problems for the budget. No Thanks. Instead give me a software engineer and four hardware techs three days to do it properly. I guarantee you at least 195 of these folks have never installed one and the concept scares me.
- Re: (Score:2)
  
  by tuomoks ( 246421 ) writes:
  
  Having installed computer centers.. It is scary, it is not so much that a bend cable (fiber?) cost but do you have a spare? Are you sure that the electricity is connect correctly? Will the cooling work? Hopefully it doesn't include the fire extinguisher system installation - a scary thought! Did you remember to secure the raised floor? Did you label everything - correctly? And so on..
  
  We used to do that over weekends in very large installations but it was scary every time. Too many things which could have go

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Amish (Score:1)

Re: (Score:2)

Re: (Score:2)

Amish use websites? (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Abe is the biggest cluster on a BigTen campus (Score:2)

Re: (Score:2, Informative)

Re: (Score:1)

Not Funny (Score:1)

Re: (Score:2)

pfft... ever hear of mopery? (Score:2)

Re: (Score:2)

Dumb (Score:4, Insightful)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Thin on details (Score:2, Interesting)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Hey editors (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Oblig. Simpsons (Score:2)

Re: (Score:1)

Re: (Score:2)

SySadmin + Cinco de Miyo = Bad News (Score:2)

Re: (Score:2)

Re: (Score:2)

Moderation broken? (Score:1, Offtopic)

Downtime (Score:2)

Just imangine (Score:1)

Grammar Nazi says: (Score:2, Insightful)

Which raises the question... (Score:1)

Re: (Score:2)

Loose defenition (Score:1)

Time off from my normal duties. (Score:1)

Having installed Supercomputers... (Score:1)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals