How do we Achieve High Performance Computing?

Good Algorithms

Adaptive algorithms
Load balancing
Available parallelism (Amdahl's law)

Fast computer architectures

Moore's law:

The number of transistors on a chip doubles every 18 months

Limit of latency:

Speed of light = 3*108 m/s

Well Optimized Code

Locality
Independent Basic Blocks
Latency Tolerant
Efficient Compiler

Parallelism

Pipeline
Superscalar
Explicit/Implicit