SHARC Processor Benchmarks
Real-time signal processing tasks are I/O and computationally intensive. In addition to high-speed math units and all instructions executing in a single-cycle, including single-cycle multiply accumulates (MACs), SHARC Processors are designed for maximum I/O and memory access bandwidth. This balance of core speed, memory integration and I/O bandwidth achieves the sustained performance critical to real-time applications.
Benchmarks are important in that they show how a particular DSP performs in the context of an application. The smaller the benchmark number, the quicker the algorithm execution. If a DSP can perform the task quicker, the processor can perform more tasks in a given amount of time. Just looking at the cycle time, clock speed or MIPS of a DSP can not give an accurate indication of the true performance of the processor. Therefore it's important to analyze algorithm benchmarks, not only clock speed and cycle time.
| Clock Cycle |
100 MHz |
150 MHz |
200 MHz |
266 MHz |
333 MHz |
400 MHz |
450 MHz |
| Instruction Cycle Time |
10 ns |
6.67 ns |
5 ns |
3.75 ns |
3 ns |
2.5 ns |
2.22 ns |
| MFLOPS Sustained |
400 MFLOPS |
600 MFLOPS |
800 MFLOPS |
1064 MFLOPS |
1332 MFLOPS |
1600 MFLOPS |
1800 MFLOPS |
| MFLOPS Peak |
600 MFLOPS |
900 MFLOPS |
1200 MFLOPS |
1596 MFLOPS |
1998 MFLOPS |
2400 MFLOPS |
2700 MFLOPS |
1024 Point Complex FFT (Radix 4, with bit reversal) |
92 µs |
61.3 µs |
46 µs |
34.5 µs |
28 µs |
23 us |
20.44 µs |
| FIR Filter (per tap) |
5 ns |
3.3 ns |
2.5 ns |
1.88 ns |
1.5 ns |
1.25 ns |
1.11 ns |
| IIR Filter (per biquad) |
20 ns |
13.3 ns |
10 ns |
7.5 ns |
6 ns |
5 ns |
4.43 ns |
Matrix Multiply (pipelined)
[3x3] * [3x1]
[4x4] * [4x1] |
45 ns
80 ns |
30 ns
53.3 ns |
22.5 ns
40 ns |
16.91 ns
30.07 ns |
13.5 ns
24 ns |
11.25 ns
20 ns |
10.00 ns
17.78 ns |
| Divide (y/x) |
30 ns |
20 ns |
15 ns |
11.27 ns |
9 ns |
7.5 ns |
6.67 ns |
| Inverse Square Root |
45 ns |
30 ns |
22.5 ns |
16.91 ns |
13.5 ns |
11.25 ns |
10.00 ns |