Supercomputer

From JimboWiki
Revision as of 20:23, 11 November 2009 by J (Talk | contribs) (New page: ==Purpose== I have a bunch of older machines. There's nothing wrong with them, and they're not really all that old. I have a need for a multi-CPU system for some development work and for...)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Purpose

I have a bunch of older machines. There's nothing wrong with them, and they're not really all that old. I have a need for a multi-CPU system for some development work and for running Virtual Machines. Cluster computing and parallel processing have always interested me, so why not build a supercomputer in my basement?

Choosing A Cluster Platform

There are quite a few open source cluster platforms. My requirements include:

  • Must be free (as in no monetary cost), preferably Open Source
  • Must not require any special compilers, procedures, or programming languages - I want to run programs and have them execute somewhere in the cloud automatically, and transparently.
  • Must support a graphical environment - I want to have a multi-monitor setup (starting with 2, maybe up to 6 at a later date) to fit all the programs I'm running in parallel and to impress the less technically savvy.
  • Preferably Linux based - since I know it, and it is no monetary cost
  • Preferably supports running virtual machines that can use cores from multiple cluster nodes simultaneously, or at least allows some way to specify that the virtual machine's process should consume all available resources on one cluster node

Kerrighed

OpenSSI

OpenMosix

LinuxPMI

PelicanHPC

Chronium

Sun Grid Engine

Implementation Plan

Stage 1: Proof of Concept

This stage will use minimal hardware (1 storage server, 2 cluster nodes, 10/100 networking) to demonstrate that the solution can meet the requirements set forward. An installation procedure will also be produced during this stage.

Stage 2: Basic Implementation

This stage will use expanded hardware (maximum number of cluster nodes available - probably about 6, Gigabit networking) to demonstrate the full potential of the cluster. This stage will start with a fresh install, using the procedure created in the Proof-of-concept.

Stage 3: Expanded Implementation

This stage will build on the implementation from Stage 2 - no new computing hardware will be added. The graphical environment will be installed and tested. Multi-monitor support will be added.

Stage 4: Performance Evaluation

Benchmark tests will be run to test the system's capability. If problems are found, they will be addressed and the tests will be run again.