RISController
64-bit Microprocessor, based on
RISCore4000
Features
Features
FeaturesFeatures
◆
High performance 64-bit microprocessor, based on the
RISCore4000
– Minimized branch and load delays, through streamlined
5-stage scalar pipeline.
– Single and double precision floating-point unit
– 125 peak MFLOP/s at 250 MHz
– 330 Dhrystone MIPS at 250 MHz
– Flexible RC4700-compatible MMU
– Joint TLB on-chip, for virtual-to-physical address mapping
◆
On-chip two-way set associative caches
– 16KB instruction cache (I-cache)
– 16KB data cache (D-cache)
◆
Optional I-cache and D-cache locking (per set), provides
improved real-time support
◆
Enhanced, flexible bus interface allows simple, low-cost
design
– 64-bit Bus Interface option, 1000MB/s bandwidth support
– 32-bit Bus Interface option, 500MB/s bandwidth support
– SDRAM timing protocol, through delayed data in write cycles
– RC4000/RC5000 family bus-protocol compatibility
– Bus runs at fraction of pipeline clock (1/2 to 1/8)
◆
Implements MIPS-III Instruction Set Architecture (ISA)
◆
3.3V core with 3.3V I/O
TM
Embedded
TM
RC64474
RC64475
◆
Software compatible with entire RISController Series of
Embedded Microprocessors
◆
Industrial temperature range support
◆
Active power management
– Powers down inactive units, through sleep-mode feature
◆
100% pin compatibility between RC64574, RC64474 and
RC4640
◆
100% pin compatibility between RC64575, RC64475 and
RC4650
◆
RC64474 available in 128-pin QFP package, for 32-bit only
systems
◆
RC64475 available in 208-pin QFP package, for full 64/32 bit
systems
◆
Simplified board-level testing, throu gh full Joint T est Action
Group (JTAG) boundary scan
◆
Windows® CE compliant
™
™
Block diagram
Block diagram
Block diagramBlock diagram
330 MIPS
64-bit
RISCore4000
CPU Core
Control Bus
Instruction Bus
16KB
Instruction Cache
(Lockable)
The IDT logo is a registered trademark and RC32134, RC32364, RC6414 5, RC64474, RC64475, RC4650, RC4640, RC4600,RC4700 RC3081, RC3052, RC3051, RC3041, RISController, and RISCore are trademarks of Integrated Device Technology, Inc.
System Control
Coprocessor
(CP O )
32-/64-bit
Synchronized
System
Interface
125 MFLO PS
Single/Double
Precision
FPA
D a ta B u s
16KB
Data Cache
(Lockable)
2001 Integrated Device Technology, Inc .
1 of 25April 10, 2001
DSC 4952
RC64474™ RC64475™
Device Overview
Device Overview
Device OverviewDevice Overview
1111
Extending Integrated Device Technology’ s (IDT) RISCore4000 based
choices (see Table 1), the RC64474 and RC64475 are high performance 64-bit microprocessors targeted towards applications that require
high bandwidth, real-time response and rapid data processing and are
ideal for products ranging from internetworking equipment (switches,
routers) to multimedia systems such as web browsers, set-top boxes,
video games, and Windows
®
CE based products. These processors are
rated at 330 Dhrystone MIPS and 125 Million floating point operations
per second, at 250 MHz. The internal cache bandwidth for these devices
is over 3GB/second. The 64-bit external bus bandwidth is at more than
1000MB/s, and the 32-bit external bus bandwidth is at 500MB/s.
The RC64474 is packaged in a 128-pin QFP footprint package and
uses a 32-bit external bus, offering the ideal combination of 64-bit
processing power and 32-bit low-cost memory systems. The RC64475
is packaged in a 208-pin QFP footprint package and uses the full 64-bit
external bus. The RC64475 is ideal for applications requiring 64-bit
performance and 64-bit external bandwidth.
IDT’s RISCore4000 is a 250MHz 64-bit execution core that uses a
5-stage pipeline, eliminating the “issue restrictions” associated with
other more complex pipelines. The RISCore4000 implements the
MIPS-III Instruction Set Architecture (ISA) and is upwardly compatible
with applications that run on earlier generation parts.
Implementation of the MIPS-III architecture results in 64-bit operations, improved performance for commonly used code sequences in
1.
Detailed system operation information is provided in the RC64474/RC64475
user’s manual.
operating system kernels, and faster execution of floating-point intensive
applications.
The RISCore4000integer unit implements a load/store architecture
with single cycle ALU operations (logical, shift, add, subtract) and an
autonomous multiply/divide unit. The ALU consists of the integer adder
and logic unit. The adder performs address calculations in addition to
arithmetic operations, and the logic unit performs all of the processor’s
logical and shift operations. Each unit is highly optimized and can
perform an operation in a single pipeline cycle. Both 32- and 64-bit data
operations are performed by the RISCore4000, utilizing 32 general
purpose 64-bit registers (GPR) that are used for integer operations and
address calculation. A complete on-chip floating-point co-processor
(CP1), which includes a floating-point register file and execution units,
forms a “seamless” interface, decoding and executing instructions in
parallel with the integer unit.
CP1’s floating-point execution units support both single and
double precision arithmetic—as specified in the IEEE Standard 754—
and are separated into a multiply unit and a combined add/convert/
divide/square root unit. Overlap of multiplies and add/subtract is
supported, and the multiplier is partially pipelined, allowing the initiation
of a new multiply instruction every fourth pipeline cycle.
The floating-point register file is made up of thirty-two 64-bit registers. The floating-point unit can take advantage of the 64-bit wide data
cache and issue a co-processor load or store doubleword instruction in
every cycle. The RISCore4000’s system control coprocessor (CP0)registers are also incorporated on-chip and provide the path through
which the virtual memory system’s page mapping is examined and
changed, exceptions are handled, and any operating mode selections
are controlled.
RISCore4000/RISCore5000 Family of Socket Compatible Processors
RISCore4000/RISCore5000 Family of Socket Compatible Processors
RISCore4000/RISCore5000 Family of Socket Compatible ProcessorsRISCore4000/RISCore5000 Family of Socket Compatible Processors
32-bit Processors64-bit Processors
RC4640RC64474RC64574RC4650RC64475RC64575
CPU
Performance
FPA
Caches
External Bus
Voltage
Frequencies
Packages
MMU
Key Features
64-bit RISCore4000
w/ DSP extensions
>350MIPS>330MIPS>440MIPS>350MIPS>330MIPS>440MIPS
89 mflops, single pre-
Cache locking, JTAG,
syncDRAM mode, 32bit external bus
DSP extensions
666 mflops, single and
double precision
32kB/32kB, 2-way,
lockable by line
32-bit, Superset pin
compatible w/RC4640,
RC64474
Cache locking, JTAG,
syncDRAM mode, 32bit external bus
64-bit RISCore4000
w/ DSP extensions
89 mflops, single precision only
8kB/8kB, 2-way, lockable by set
32- or 64-bit32-or 64-bit, Super-
Cache locking, onchip MAC, 32-bit & 64
bit bus option
Table 1 RISCore4000/RISCore5000 Processor Family
64-bit RISCore4000 64-bit RISCore5000
125 mflops, single
and double pr ecision
16kB/16kB, 2-way,
lockable by set
set pin compatible w/
RC4650
Cache locking, JTAG,
syncDRAM mode, 3264- bit bus option
w/ DSP extensions
666 mflops, single
and double precision
32kB/32kB, 2-way,
lockable by line
32-or 64-bit, Superset pin compati ble w/
RC4650, RC64475
Cache locking, JTAG,
syncDRAM mode, 3 264- bit bus option
2 of 25April 10, 2001
RC64474™ RC64475™
A secure user processing environmentis provided through the user,
supervisor, and kernel operating modes of virtual addressing to
system software. Bits in a status register determine which of these
modes is used.
If configured for 64-bit virtual addressing, the virtual address space
layout becomes an upwardly compatible extension of the 32-bit virtual
address space layout. Figure 1 is an illustration of the address space
layout for the 32-bit virtual address operation.
Kernel virtual address space
(kseg3)
Mapped, 0.5GB
(sseg)
Mapped, 0.5GB
Uncached kernel physical address space
(kseg1)
Unmapped, 0.5GB
Cached kernel physical address space
(kseg0)
Unmapped, 0.5GB
User virtual address space
(useg)
Mapped, 2.0GB
The RC64474/RC64475’s Memory Management Unit (MMU)
controls the virtual memory system’s page mapping and consists of a
translation lookaside buffer (TLB) used for the virtual memory-mapping
subsystem.
This large, fully associative TLB maps 96 virtual pages to their
corresponding physical addresses. The TLB is organized as 48 pairs of
even-odd entries and maps a virtual address and address space identifier into the large, 64GB physical address space. To assist in controlling
the amount of mapped space and the replacement characteristics of
various memory regions, two mechanisms are provided. First, the page
size can be configured on a per-entry basis, to map a page size of 4KB
to 16MB (in increments of 4x).
The second mechanism controls the replacement algorithm, when a
TLB miss occurs. A random replacement algorithm is provided to select
a TLB entry to be written with a new mapping; however, the processor
provides a mechanism whereby a system specific number of mappings
can be locked into the TLB and avoid being randomly replaced, which
facilitates the design of real-time systems, by allowing deterministic
access to critical software.
The TLB also contains information to control the cache coherency
protocol, and cache management algorithm for each page. However,
hardware-based cache coherency is not supported.
The RC64474 and RC64475 enhance IDT’s entire RISCore4000
series through the implementation of features such as boundary scan, to
facilitate board level testing; enhanced support for SyncDRAM, to
simplify system implementation and improve performance.
The RC64474/475 processors offer a direct migration path for
designs based on IDT’s RC4640/RC4650 processors
2
, through full pin
and socket compatibility. Also, full 64-bit-family software and busprotocol compatibility ensures the RC64474/475 access to a robust
development tools infrastructure, allowing quicker time to market.
Deve lop m ent Tools
Deve lop m ent Tools
Deve lop m ent ToolsDeve lop m ent Tools
An array of hardware and software tools is available to assist system
designers in the rapid development of RC64474/475 based systems.
This accessibility allows a wide variety of customers to take full advantage of the device’s high-performance features while addressing today’s
aggressive time-to-market demands.
Cache Memory
Cache Memory
Cache MemoryCache Memory
To keep the RC64474 and RC64475’s high-performance pipeline full
and operating efficiently, on-chip instruction and data caches have been
incorporated. Each cache has its own data path and can be accessed in
the same single pipeline clock cycle.
The 16KB two-way set associative instruction cache(I-cache) is
virtually indexed, physically tagged, and word parity protected. Because
this cache is virtually indexed, the virtual-to-physical address translation
occurs in parallel with the cache access, further increasing performance
by allowing both operations to occur simultaneously. The instruction
cache provides a peak instruction bandwidth of 1000MB/sec at 250MHz.
The 16KB two-way set associative data cache(D-cache) is byte
parity protected and has a fixed 32-byte (eight words) line size. Its tag is
protected with a single parity bit. To allow simultaneous address translation and data cache access, the D-cache is virtually indexed and physically tagged. The data cache can provide 8 bytes each clock cycle, for a
peak bandwidth of 2GB/sec.
To lock critical sections of code and/or data into the caches for quick
access, a “cache locking” feature has been implemented. Once
enabled, a cache is said to be locked when a particular piece of code or
data is loaded into the cache and that cache location will not be selected
later for refill by other data. This feature locks a set (8KB) of Instructions
and/or Data.
Table 2 lists the RC64474/475 Instruction and data cache attributes.
2.
To ensure socket compatibility, refer to Table 8 and Table 9 at back of data
sheet.
3 of 25April 10, 2001
RC64474™ RC64475™
CharacteristicsInstructionData
Size16KB16KB
Organization2-way set
associative
Line size32B32B
read unit32-bits64-bits
write policynawrite-back, write-through
Line transfer ordersub-block order,
for refill
Miss restart
after transfer of:
Parityper-wordper-byte
Cache lockingper setper set
with or without write-allocate
sub-block order, for load
sequential order, for store
The RC64475 supports a 64-bit system interface that is bus compatible with the RC4650 and RC64575 system interface. The system interface consists of a 64-bit Address/Data bus with 8 check bits and a 9-bit
command bus that is parity protected.
During 64-bit operation, RC64475 system address/data (SysAD)
transfers are protected with an 8-bit parity check bus, SysADC. When
initialized for 32-bit operation, the RC64475’s SysAD can be viewed as a
32-bit multiplexed bus that is protected by 4 parity check bits.
The RC64474 supports a 32-bit system interface that is bus compatible with the RC4640. During 32-bit operation, SysAD transfers are
performed on a 32-bit multiplexed bus (SysAD 31:0) that is protected by
4 parity check bits (SysADC 6:0).
Writes to external memory—whether they are cache miss writebacks, stores to uncached or write-through addresses—use the on-chip
write buffer. The write buffer holds a maximum of four 64-bit addresses
and 64-bit data pairs. The entire buffer is used for a data cache writeback and allows the processor to proceed in parallel with memory
updates.
A boot-time mode control interface initializes fundamental
processor modes. The boot-time mode control interface is a serial interface that operates at a very low frequency (MasterClock divided by
256). This low-frequency operation allows the initialization information to
be kept in a low-cost EPROM; alternatively, the twenty-or-so bits could
be generated by the system interface ASIC or a simple PAL. The boottime serial stream and configuration options are listed in Table 3.
The clocking interface allows the CPU to be easily mated with
external reference clocks. The CPU input clock is the bus reference
clock and can be between 25 and 125MHz. An on-chip phase-locked-loop (PLL) generates the pipeline clock (PClock) through multiplication
of the system interface clock by values of 2,3,4,5,6,7 or 8, as defined at
system reset. This allows the pipeline clock to be implemented at a
significantly higher frequency than the system interface clock. The
RC64474/475 support single data (one to eight bytes) and 8-word block
transfers on the SysAD bus.
The RC64474/475 implement additional write protocols that
double the effective write bandwidth. The write re-issue has a repeat
rate of 2 cycles per write. Pipelined writes have the same 2-cycle per
write repeat rate, but can issue an additional write after WrRdy* deasserts.
Choosing a 32- or 64-bit wide system interface dictates whether a
cache line block transaction requires 4 double word data cycles or 8
single word cycles as well as whether a single data transfer—larger than
4 bytes—must be divided into two smaller transfers.
Board-level testing during Run-Time mode is facilitated through the
full JTAG boundary scan facility . Six pins—TDI, TDO, TMS, TCK, TRST*
and JTAG32*—have been incorporated to support the standard JTAG
interface.
System Enhancement
System Enhancement
System EnhancementSystem Enhancement
To facilitate discrete interface to SDRAM, the RC64474/475 bus
interface is enhanced during write cycles with a programmable delay
that is inserted between the write address and the write data (for both
block and non-block writes).
The bus delay can be defined as 0 to 7 MasterClock cycles and is
activated and controlled through mode bit (17:15) settings selected
during the reset initialization sequence. The ‘000’ setting provides the
same write operations timing protocol as the RC4640, RC4650, and
RC5000 processors.
Included in the system interface are six handshake signals:
RdRdy*, WrRdy*, ExtRqst*, Release*, ValidOut*, and ValidIn*; six inter-rupt inputs, and a simple timing specification that is capable of transferring data between the processor and memory at a peak rate of
1000MB/sec. A boot-time selectable option to run the system interface
as 32-bits wide—using basically the same protocols as the 64-bit
system—is also supported.
4 of 25April 10, 2001
RC64474™ RC64475™
Serial
Bit
255:18Reserved Must be 0
17:15WAdrWData_Del
Write address to write data delay in MasterClock cycles.®
14:13Drv_Out
output dri ver sl ew ra te con trol . Bit 14 is MSB.
Affects only non-clock outputs.
12System interface bus width0 → 64-bit system interface
Executing the WAIT instruction enables the processor to enter
Standby mode. The internal clocks will shut down, thus freezing the
pipeline. The PLL, internal timer, and some of the input pins (Int[5:0]*,
NMI*, ExtReq*, Reset*, and ColdReset*) will continue to run. Once the
CPU is in Standby Mode, any interrupt, including the internally generated timer interrupt, will cause the CPU to exit Standby Mode.
Thermal Considerations
Thermal Considerations
Thermal ConsiderationsThermal Considerations
The RC64474/475 come in a QFP with a drop-in heat spreader and
are guaranteed in a case temperature range of 0° to +85° C, for
commercial temperature devices; - 40° to +85° for industrial temperature devices. The type of package, speed (power) of the device, and
airflow conditions affect the equivalent ambient temperature conditions
that will meet this specification.
The equivalent allowable ambient temperature, T
using the thermal resistance from case to ambient (∅
, can be calculated
A
) of the given
CA
package. The following equation relates ambient and case temperatures:
T
A = TC
- P * ∅CA
where P is the maximum power consumption at hot temperature,
calculated by using the maximum I
Typical values for ∅
at various airflows are shown in Table 4. Note
CA
specification for the device.
CC
that the RC64474/475 processors implement advanced power management, which substantially reduces the typical power dissipation of the
device.
January 17, 2000: Added “with DSP extensions” in the CPU row
under RC64574 and RC64575 columns in Table 1. Added “lockable by
line” in the Caches row under RC64574 and RC64575 columns in Table
1. Revised Data Output and Data Output Hold rows in System Interface
Parameters table.
February 10, 2000: Revised values in Table 4, Thermal Resistance.
Old values were:
∅∅∅∅
CA
Airflow (ft/min)0200 4006008001000
128 QFP20129876
208 QFP 20129876
March 13, 2000: Replaced existing figure in Mode Configuration
Interface Reset Sequence section with 3 reset figures.
March 28, 2000: Removed the symbol t
April 17, 2000: Changed V
.
0.7V
CC
value in 200MHz column from 2.0V to
IH
from Figure 3.
DZ
April 10, 2001: In the Data Output and Data Output Hold categories
of the System Interface Parameters table, changed values in the Min
column for all speeds from 1.0 to 0. Deleted Output for Loading AC
Testing diagram and added Output Loading for AC Timing diagram
(Figure 8).
∅∅∅∅
CA
Airflow (ft/min)0200 400 6008001000
128 QFP16109765
208 QFP 201310987
Table 4 Thermal Resistance (∅∅∅∅CA) at Various Airflows
Data Sheet Revision History
Data Sheet Revision History
Data Sheet Revision HistoryData Sheet Revision History
December 1998: Changed ordering code on 128-pin package from
DQ / DQI (Industrial) to DZ / DZI (Industrial).
January 1999: Removed 5V tolerance capability and deleted 5V
tolerant pin.
February 1999: Changed the package drawings to reflect the new
208-pin DP (DPI) and 128-pin DZ (DZI) packages.
May 1999: Removed “Preliminary” status from data sheet.
Changes in DC Electrical Characteristics table. Changes in Pin
Description table. Changes in Clock Parameters table. Changes in
System Interface Parameters table.
September 1999: Updated Revision History section.
6 of 25April 10, 2001
RC64474™ RC64475™
Pin Description Table
Pin Description Table
Pin Description TablePin Description Table
The following is a list of system interface pins available on the RC64474/475. Pin names ending with an asterisk (*) are active when low.
Pin NameTypeDescription
System Interface
ExtRqst*IExternal request
An external agent asserts ExtRqst* to request use of the System interface. The processor grants the request by asserting
Release*.
Release*ORelease interface
In response to the assertion of ExtRqst* or a CPU read request, the processor asserts Release* and signals to the requesting device that the system interface is available.
RdRdy*IRead Ready
The external agent asserts RdRdy* to indicate that it can accept a processor read request.
WrRdy*IWrite Ready
An external agent asserts WrRdy* when it can now accept a processor write request.
ValidIn*IValid Input
Signals that an external agent is now driving a valid address or data on the SysAD bus and a valid command or data identifier on the SysCmd bus.
ValidOut*OValid output
Signals that the processor is now driving a valid address or data on the SysAD bus and a valid command or data identifier
on the SysCmd bus.
SysAD(63:0)I/OSystem address/data bus
A 64-bit address and data bus for communication between the processor and an external agent. During address phases
only, SysAd(35:0) contains valid address information. The remaining SysAD(63:36) pins are not used. The whole 64-bit
SysAD(63:0) may be used during the data transfer phase.
In 32-bit mode and in the RC64474, SysAD(63:32) is not used, regardless of Endianness. A 32-bit address and data communication between processor and external agent is performed via SysAD(31:0).
SysADC(7:0)I/OSystem address/data check bus
An 8-bit bus containing parity check bits for the SysAD bus during data bus cycles.
In 32-bit mode and in the RC64474, SysADC(7:4) is not used. The SysADC(3:0) contains check bits for SysAD(31:0).
SysCmd(8:0)I/OSystem command/data identifier bus
A 9-bit bus for command and data identifier transmission between the processor and an external agent.
SysCmdPI/OSystem Command Parity
A single, even-parity bit for the Syscmd bus. This signal is always driven low.
Clock/Control Interface
MasterClockIMaster Clock
Master clock input establishes the processor and bus operating frequency. It is multiplied internally by 2,3,4,5,6,7,8 to gen-
erate the pipeline clock (PClock). This clock must be driven by 3.3V (Vcc) clock signals, regardless of the 5V tolerant pin
setting.
PIQuiet VCC for PLL
CC
V
for the internal phase locked loop.
CC
Quiet V
V
PIQuiet VSS for PLL
SS
Quiet V
Interrupt Interface
Int*(5:0)IInterrupt
Six general processor interrupts, bit-wise ORed with bits 5:0 of the interrupt register.
for the internal phase locked loop.
SS
Table 5 Pin Descriptions (Page 1 of 2)
7 of 25April 10, 2001
RC64474™ RC64475™
Pin NameTypeDescription
NMI*INon-maskable interrupt
Non-maskable interrupt, ORed with bit 6 of the interrupt register.
Initialization Interface
kIV
O
V
CC
CC
is OK
When asserted, this signal indicates to the processor that the power supply has been above the Vcc minimum for more
than 100 milliseconds and will remain stable. The assertion of V
ColdReset*ICold reset
This signal must be asserted for a power on reset or a cold reset. ColdReset must be de-asserted synchronously with Mas-
terClock.
Reset*IReset
This signal must be asserted for any reset sequence. It can be asserted synchronously or asynchronously for a cold reset,
or synchronously to initiate a warm reset. Reset must be de-asserted synchronously with MasterClock.
ModeClockOBoot-mode clock
Serial boot-mode data clock output at the system clock frequency divided by two hundred fifty-six.
ModeInIBoot-mode data in
Serial boot-mode data input.
JTAG Interface
TDIIJTAG Data In
On the rising edge of TCK, serial input data are shifted into either the Instruction register or Data register, depending on the
TAP controller state.
k initiates the initialization sequence.
CCO
TDOOJTAG Data Out
On the falling edge of TCK, the TDO is serial data shifted out from either the instruction or data register. When no data is
shifted out, the TDO is tri-stated (high impedance).
TCKIJTAG Clock Input
An input test clock used to shift into or out of the boundary-scan register cells. TCK is independent of the system and pro-
cessor clock with nominal 40-60% duty cycle.
TMSIJTAG Command Select
The logic signal received at the TMS input is decoded by the TAP controller to control test operation. TMS is sampled on
the rising edge of TCK.
TRST*IJTAG Reset
The TRST* pin is an active-low signal used for asynchronous reset of the debug unit, independent of the processor logic.
During normal CPU operation, the JTAG controller will be held in the reset mode, asserting this active low pin.
When asserted low, this pin will also tristate the TDO pin.
JTAG32*IJTAG 32-bit scan
This pin is used to control length of the scan chain for SYsAD (32-bit or 64-bit) for the JTAG mode. When set to Vss, 32-bit
bus mode is selected. In this mode, only SysAD(31:0) are part of the scan chain. When set to Vcc, 64-bit bus mode is
selected. In this mode, SysAD(63:0) are part of the scan chain. This pin has a built-in pull-down device to guarantee 32-bit
scan, if it is left uncovered.
JR_VccIJTAG VCC
This pin has an internal pull-down to continuously reset the JTAG controller (if left unconnected) bypassing the TRst* pin.
When supplied with Vcc, the TRst* pin will be the primary control for the JTAG reset.
Table 5 Pin Descriptions (Page 2 of 2)
8 of 25April 10, 2001
Loading...
+ 17 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.