Processor register

In computer architecture, a processor register is a small amount of storage available as part of a digital processor, such as a central processing unit (CPU). Such registers are typically addressed by mechanisms other than main memory and can be accessed faster. Almost all computers, load-store architecture or not, load data from a larger memory into registers where it is used for arithmetic, manipulated or tested by machine instructions. Manipulated data is then often stored back into main memory, either by the same instruction or a subsequent one. Modern processors use either static or dynamic RAM as main memory, with the latter usually accessed via one or more cache levels.

Processor registers are normally at the top of the memory hierarchy, and provide the fastest way to access data. The term normally refers only to the group of registers that are directly encoded as part of an instruction, as defined by the instruction set. However, modern high-performance CPUs often have duplicates of these "architectural registers" in order to improve performance via register renaming, allowing parallel and speculative execution. Modern x86 design acquired these techniques around 1995 with the releases of Pentium Pro, Cyrix 6x86, Nx586, and AMD K5.

A common property of computer programs is locality of reference, which refers to accessing the same values repeatedly and holding frequently used values in registers to improve performance; this is what makes fast registers and caches meaningful. Allocating frequently used variables to registers can be critical to a program's performance; this register allocation is performed either by a compiler in the code generation phase, or manually by an assembly language programmer.

Categories of registers

Registers are normally measured by the number of bits they can hold, for example, an "8-bit register" or a "32-bit register". A processor often contains several kinds of registers, that can be classified according to their content or instructions that operate on them:

Hardware registers are similar, but occur outside CPUs.

In some architectures, such as SPARC and MIPS, the first or last register in the integer register file is a pseudo-register in a way that it is hardwired to always return zero when read (mostly to simplify indexing modes), and it cannot be overwritten. In Alpha this is also done for the floating-point register file. As a result of this, register files are commonly quoted as having one register more than how many of them are actually usable; for example, 32 registers are quoted when only 31 of them fit within the above definition of a register.

Examples

The table shows the number of registers of several mainstream architectures. Note that in x86-compatible processors the stack pointer (ESP) is counted as an integer register, even though there are a limited number of instructions that may be used to operate on its contents. Similar caveats apply to most architectures.

Although all of the above listed architectures are different, almost all are a basic arrangement known as the Von Neumann architecture, first proposed by mathematician John von Neumann.

Architecture GPRs/data+address registers FP
registers
Notes
4004[1] 1 accumulator, 16 others 0 Register A is for general purpose, while r0r15 registers are for the address and segment.
8008[2] 1 accumulator, 6 others 0 The A register is an accumulator to which all arithmetic is done; the H and L registers can be used in combination as an address register; all registers can be used as operands in load/store/move/increment/decrement instructions and as the other operand in arithmetic instructions. There is no FP unit available.
8080[3] 1 accumulator, 6 others 0 Plus a stack pointer. The A register is an accumulator to which all arithmetic is done; the register pairs B+C, D+E, and H+L, can be used as address registers in some instructions; all registers can be used as operands in load/store/move/increment/decrement instructions and as the other operand in arithmetic instructions. Some instructions only use H+L; another instruction swaps H+L and D+E. There is no FP unit available.
16-bit x86[4] 8 stack of 8 (if FP present) 8086/8088, 80186/80188, 80286, with 8087, 80187 or 80287 for floating-point, with an 80-bit wide, 8 deep register stack with some instructions able to use registers relative to the top of the stack as operands; without 8087/80187/80287, no floating-point registers
IA-32[5] 8 stack of 8 (if FP present), 8 (if SSE/MMX present) 80386 required 80387 for floating-point, later processors had built-in floating point, with both having an 80-bit wide, 8 deep register stack with some instructions able to use registers relative to the top of the stack as operands. The Pentium III and later had the SSE with additional 128-bit XMM registers.
x86-64[6][5] 16 16 FP registers are 128-bit XMM registers, later extended to 256-bit YMM registers with AVX.
Xeon Phi[7] 16 32 Including 32 256/512-bit ZMM registers with AVX-512
Motorola 6800[8] 2 data, 1 index 0 Plus a stack pointer
Motorola 68k[9] 8 data (d0-d7), 8 address (a0-a7) 8 (if FP present) Address register 8 (a7) is the stack pointer. 68000, 68010, 68012, 68020, and 68030 require an FPU for floating point; 68040 had FPU built in. FP registers are 80-bit.
SH 16-bit 1 6
Emotion Engine 3 32 The Emotion Engine's main core contains two entries 64-bit integer data registers and one accumulator for integer computation(able to split into eight entries 8-bit registers for sub mode or eight entries 16-bit register in compatibility mode or four entries 32-bit registers in low-half integer mode) the 64-bit accumulator register for issuing data to floating point vector unit and two 16-bit control registers(not general purpose) for connecting general floating-point computation from vector register file. The processor is built via 32 entries 128-bit vector register file in vector co-processor based on MIPS architecture. Accumulator in the this case is not general purpose but control status.
CUDA 1 8/16/32/64/128 Each CUDA core contains single 32/64-bit integer data register while floating point unit holding much larger number of register, G8x contains eight 128-bit HDR vector register. GT200 increased the count to 16. Fermi extended the register width to 256 bits and increased the register count to 32, while Kepler increased it to 64. Maxwell contains a massive amount of 128 512-bit vector registers.
IBM/360 16 4 (if FP present) This applies to S/360's successors, System/370 through System/390; FP was optional in System/360, and always present in S/370 and later. In processors with the Vector Facility, there are 16 vector registers containing a machine-dependent number of 32-bit elements.[10]
z/Architecture 16 16 64-bit version of S/360 and successors
Xelerated X10 1 32 a 32/40 bit stack machine based network processor with modified MIPS instruction and 128 bit floating point unit.
Parallax Propeller 0 2 An eight core 8/16 bit sliced stack machine controller with simple logic circus inside, have eight cog counter(core) and each contain three 8/16 bit special control registers with 32 bit x 512 stack ram however it does not carrying any general register for integer purpose. unlike most of shadow register file in modern processor and multi core system, all these stack ram in cog can be accessed in instruction level which all these cog can act as one big single general purpose core if necessary. Floating point unit is external and it contain two 80 bit vector register.
Itanium 128 128 And 64 1-bit predicate registers and 8 branch registers. The FP registers are 82-bit.
SPARC 31 32 Global register 0 is hardwired to 0. Uses register windows.
IBM POWER 32 32 And 1 link and 1 count register.
Power Architecture 32 32 And 1 link and 1 count register. Processors supporting the Vector facility also have 32 128-bit vector registers,
Blackfin 8 16 containing two external uncore 40 bit accumulator, but non are general purpose. Support 64 bit RISC architecture ISA, vector register are 256 bit.
IBM Cell SPE 128 N/A 128 GPRs, which can hold integer, address, or floating-point values[11]
Alpha 31 31 Registers R31 (integer) and F31 (floating-point) are hardwired to zero.
6502 1 0 6502's content A (Accumulator) register for main purpose data store and memory address (8-bit data/16-bit address), X,Y are condition register and SP register are specific index only.
W65C816S 1 0 65C816 is the 16-bit successor of the 6502. X,Y, D (Direct Page register) are condition register and SP register are specific index only. main accumulator extended to 16-bit (B) while keep 8-bit (A) for compatibility and main register can now address up to 24-bit (16-bit wide data instruction/24-bit memory address).
65k 1 0 Direct successor of 6502, 65002 only content A (Accumulator) register for main purpose data store and extend data wide to 32-bit and 64-bit instruction wide, support 48-bit virtual address in software mode, X,Y are still condition register and remain 8-bit and SP register are specific index but increase to 16-bit wide.
Xilinx Spartan 1 3
MeP 4 8 Media-embedded processor was a 32 bit processor developed by toshiba, a modded 8080 instruction set with only A, B, C, D register available through all mode(8/16/32 bit) and incompatible with x86, however it contain 80 bit floating point unit that is x87 compatible.
PIC microcontroller 1 0
AVR microcontroller 32 0
ARM 32-bit 14 Varies (up to 32) r15 is the program counter, and not usable as a GPR; r13 is the stack pointer; r8-r13 can be switched out for others (banked) on a processor mode switch. Older versions had 26-bit addressing,[12] and used upper bits of the program counter (r15) for status flags, making that register 32-bit.
ARM 64/32-bit[13] 31 32 Register r31 is the stack pointer or hardwired to 0, depending on the context.
MIPS 31 32 Register 0 is hardwired to 0.
Epiphany 64 (per core) Each instruction controls whether registers are interpreted as integers or single precision floating point. Architecture is scalable to 4096 cores with 16 and 64 core implementations currently available.

Register usage

The number of registers available on a processor and the operations that can be performed using those registers has a significant impact on the efficiency of code generated by optimizing compilers. The Strahler number of an expression tree gives the minimum number of registers required to evaluate that expression tree.

See also

References

This article is issued from Wikipedia - version of the Sunday, February 14, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.