IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
IDT79R3081
RISController
IDT 79R3081, 79R3081E
IDT 79RV3081, 79RV3081E
with FPA
Integrated Device Technology, Inc.
FEATURES
• Instruction set compatible with IDT79R3000A, R3041,
R3051, and R3071 RISC CPUs
• High level of integration minimizes system cost
— R3000A Compatible CPU
— R3010A Compatible Floating Point Accelerator
— Optional R3000A compatible MMU
— Large Instruction Cache
— Large Data Cache
— Read/Write Buffers
• 43VUPS at 50MHz
— 13MFlops
• Flexible bus interface allows simple, low cost designs
• Optional 1x or 2x clock input
• 20 through 50MHz operation
• "V" version operates at 3.3V
• 50MHz at 1x clock input and 1/2 bus frequency only
R3081 BLOCK DIAGRAM
ClkIn
Int(5:0)
Clock Generator
Unit/Clock Doubler
System Control
Coprocessor
(CP0)
Exception/Control
Registers
Memory Management
Registers
Translation
Lookaside Buffer
(64 entries)
Master Pipeline Control
Virtual Address
• Large on-chip caches with user configurability
— 16kB Instruction Cache, 4kB Data Cache
— Dynamically configurable to 8kB Instruction Cache,
8kB Data Cache
— Parity protection over data and tag fields
• Low cost 84-pin packaging
• Superset pin- and software-compatible with R3051, R3071
• Multiplexed bus interface with support for low-cost, lowspeed memory systems with a high-speed CPU
• Bus Interface can operate at half-processor frequency
BrCond(3:2,0)
Floating Point
Integer
CPU Core
General Registers
(32 x 32)
ALU
Shifter
Mult/Div Unit
Address Adder
PC Control
FP Interrupt
Coprocessor
(CP1)
Register Unit
(16 x 64)
Exponent Unit
Add Unit
Divide Unit
Multiply Unit
Exception/Control
Physical Address Bus
32
Parity
Generator
4-deep
The IDT logo is a registered trademark, and RISController, R3041, R3051, R3052, R3071, R3081, R3720, R4400, R4600, IDT/kit, and IDT/sim are trademarks of Integrated Device Technology, Inc.
Read
Buffer
Configurable
Instruction
Cache
(16kB/8kB)
4-deep
Write
Buffer
Address/
Data
Data Bus
R3051 Superset Bus Interface Unit
DMA
DMA
Arbiter
Ctrl
Configurable
Data
Cache
(4kB/8kB)
BIU
Control
Rd/Wr
Ctrl
SysClk
36
Coherency
Logic
Invalidate
Control
Data Bus
2889 drw 01
MILITARY AND COMMERCIAL TEMPERATURE RANGES OCTOBER 2001
2001 Integrated Device Technology, Inc. 5.5 DSC-9064/5
5.51
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
INTRODUCTION
The IDT R3051 family is a series of high-performance 32bit microprocessors featuring a high-level of integration, and
targeted to high-performance but cost sensitive processing
applications. The R3051 family is designed to bring the highperformance inherent in the MIPS RISC architecture into
low-cost, simplified, power sensitive applications.
Thus, functional units have been integrated onto the CPU
core in order to reduce the total system cost, rather than to
increase the inherent performance of the integer engine.
Nevertheless, the R3051 family is able to offer 43VUPS
performance at 50MHz without requiring external SRAM or
caches.
The R3081 extends the capabilities of the R3051 family, by
integrating additional resources into the same pin-out. The
R3081 thus extends the range of applications addressed by
the R3051 family, and allows designers to implement a single,
base system and software set capable of accepting a wide
variety of CPUs, according to the price/performance goals of
the end system.
In addition to the embedded applications served by the
R3051 family, the R3081 allows low-cost, entry level computer
systems to be constructed. These systems will offer many
times the performance of traditional PC systems, yet cost
approximately the same. The R3081 is able to run any
standard R3000A operation system, including ACE UNIX.
Thus, the R3081 can be used to build a low-cost ARC
compliant system, further widening the range of performance
solutions of the ACE Initiative.
An overview of this device, and quantitative electrical
parameters and mechanical data, is found in this data sheet;
consult the
complete description of this processor.
"R3081 Family Hardware User's Guide"
for a
DEVICE OVERVIEW
As part of the R3051 family, the R3081 extends the offering
of a wide range of functionality in a compatible interface. The
R3051 family allows the system designer to implement a
single base system, and utilize interface-compatible processors
of various complexity to achieve the price-performance goals
of the particular end system.
Differences among the various family members pertain to
the on-chip resources of the processor. Current family members
include:
• The R3052E, which incorporates an 8kB instruction cache,
a 2kB data cache, and full function memory management
unit (MMU) including 64-entry fully associative Translation
Lookaside Buffer (TLB).
• The R3052, which also incorporates an 8kB instruction
cache and 2kB data cache, but does not include the TLB,
and instead uses a simpler virtual to physical address
mapping.
• The R3051E, which incorporates 4kB of instruction cache
and 2kB of data cache, along with the full function MMU/
TLB of the R3000A.
• The R3051, which incorporates 4kB of instruction cache
and 2kB of data cache, but omits the TLB, and instead uses
a simpler virtual to physical address mapping.
• The R3081E, which incorporates a 16kB instruction cache,
a 4kB data cache, and full function memory management
unit (MMU) including 64-entry fully associative Translation
Lookaside Buffer (TLB). The cache on the R3081E is user
configurable to an 8kB Instruction Cache and 8kB Data
Cache.
• The R3081, which incorporates a 16kB instruction cache,
a 4kB data cache, but uses the simpler memory mapping
of the R3051/52, and thus omits the TLB. The cache on the
R3081 is user configurable to an 8kB Instruction Cache and
8kB Data Cache.
Figure 1 shows a block level representation of the functional
units within the R3081E. The R3081E could be viewed as the
embodiment of a discrete solution built around the R3000A
and R3010A. However, by integrating this functionality on a
single chip, dramatic cost and power reductions are achieved.
CPU Core
The CPU core is a full 32-bit RISC integer execution
engine, capable of sustaining close to single cycle execution.
The CPU core contains a five stage pipeline, and 32 orthogonal
32-bit registers. The R3081 uses the same basic integer
execution core as the entire R3051 family, which is the
R3000A implementation of the MIPS instruction set. Thus, the
R3081 family is binary compatible with the R3051, R3052,
R3000A, R3001, and R3500 CPUs. In addition, the R4000
represents an upwardly software compatible migration path to
still higher levels of performance.
The execution engine in the R3081 uses a five-stage
pipeline to achieve near single-cycle instruction execution
rates. A new instruction can be initiated in each clock cycle;
the execution engine actually processes five instructions
concurrently (in various pipeline stages). Figure 2 shows the
concurrency achieved in the R3081 execution pipeline.
System Control Co-Processor
The R3081 family also integrates on-chip the System
Control Co-processor, CP0. CP0 manages both the exception
handling capability of the R3081, as well as the virtual to
physical address mapping.
As with the R3051 and R3052, the R3081 offers two
versions of memory management and virtual to physical
address mapping: the extended architecture versions, the
R3051E, R3052E, and R3081E, incorporate the same MMU
as the R3000A. These versions contain a fully associative 64entry TLB which maps 4kB virtual pages into the physical
address space. The virtual to physical mapping thus includes
kernel segments which are hard-mapped to physical
addresses, and kernel and user segments which are mapped
page by page by the TLB into anywhere in the 4GB physical
address space. In this TLB, 8 pages can be “locked” by the
kernel to insure deterministic response in real-time applications.
Figure 3 illustrates the virtual to physical mapping found in the
R3081E.
5.52
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
The extended architecture versions of the R3051 family
I#1
IF
ALURDMEM WB
(the R3051E, R3052E, and R3081E) allow the system designer
to implement kernel software which dynamically manages
I#2
IF
ALURDMEM WB
user task utilization of system resources, and also allows the
Kernel to protect certain resources from user tasks. These
capabilities are important in general computing applications
I#3
IF
ALURDMEM WB
such as ARC computers, and are also important in a variety of
embedded applications, from process control (where protection
I#4
IF
ALURDMEM WB
may be important) to X-Window display systems (where
virtual memory management can be used). The MMU can
I#5
IF
ALURDMEM WB
also be used to simplify system debug.
R3051 family base versions (the R3051, R3052, and R3081)
remove the TLB and institute a fixed address mapping for the
Current
CPU
Cycle
Figure 2. R3081 5-Stage Pipeline
2889 drw 02
various segments of the virtual address space. These devices
still support distinct kernel and user mode operation, but do
not require page management software, leading to a simpler
software model. The memory mapping used by these devices
is shown in Figure 4. Note that the reserved spaces are for
compatiblity with future family members, which may map onchip resources to these addresses. References to these
addresses in the R3081 will be translated in the same fashion
as the rest of their respective segments, with no traps or
exceptions signalled.
When using the base versions of the architecture, the
system designer can implement a distinction between the
user tasks and the kernel tasks, without having to implement
page management software. This distinction can be
implemented by decoding the output physical address. In
systems which do not need memory protection, and wish to
have the kernel and user tasks operate out of the same
memory space, high-order address lines can be ignored by
0xffffffff
0xc0000000
0xa0000000
0x80000000
VIRTUALPHYSICAL
Kernel Mapped
(kseg2)
Kernel Uncached
(kseg1)
Kernel Cached
(kseg0)
Any
Physical
Memory
3548MB
the address decoder, and thus all references will be seen in
User Mapped
Cacheable
(kuseg)
0x00000000
Figure 3. Virtual to Physical Mapping of Extended Architecture
Any
Memory
Versions
512 MB
2889 drw 03
the lower gigabyte of the physical address space.
Floating Point Co-Processor
The R3081 also integrates an R3010A compatible floating
point accelerator on-chip. The FPA is a high-performance coprocessor (co-processor 1 to the CPU) providing separate
add, multiply, and divide functional units for single and double
precision floating point arithmetic. The floating point accelerator
0xffffffff
VIRTUAL
1MB Kernel Rsvd
PHYSICAL
features low latency operations, and autonomous functional
units which allow differing types of floating point operations to
function concurrently with integer operations. The R3010A
Kernel Cached
(kseg2)
0xc0000000
Kernel Uncached
(kseg1)
0xa0000000
Kernel Cached
(kseg0)
0x80000000
1MB User Rsvd
User
Cached
(kuseg)
0x00000000
Figure 4. Virtual to Physical Mapping of Base Architecture Versions
Kernel Cacheable
Tasks
Kernel/User
Cacheable
Tasks
Inaccessible
Kernel Boot
and I/O
1024 MB
2048 MB
512 MB
512 MB
2889 drw 04
appears to the software programmer as a simple extension of
the integer execution unit, with 16 dedicated 64-bit floating
point registers (software references these as 32 32-bit registers
when performing loads or stores). Figure 5 illustrates the
functional block diagram of the on-chip FPA.
Clock Generator Unit
The R3081 is driven from a single input clock which can be
either at the processor rated speed, or at twice that speed. Onchip, the clock generator unit is responsible for managing the
interaction of the CPU core, caches, and bus interface. The
R3081 includes an on-chip clock doubler to provide higher
frequency signals to the internal execution core; if 1x clock
mode is selected, the clock doubler will internally convert it to
5.53
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
a double frequency clock. The 2x clock mode is provided for
compatiblity with the R3051. The clock generator unit replaces
the external delay line required in R3000A based applications.
Instruction Cache
The R3081 implements a 16kB Instruction Cache. The
system may choose to repartition the on-chip caches, so that
the instruction cache is reduced to 8kB but the data cache is
increased to 8kB. The instruction cache is organized with a
line size of 16bytes (four entries). This large cache achieves
hit rates in excess of 98% in most applications, and substantially
contributes to the performance inherent in the R3081. The
cache is implemented as a direct mapped cache, and is
capable of caching instructions from anywhere within the 4GB
physical address space. The cache is implemented using
physical addresses (rather than virtual addresses), and thus
does not require flushing on context switch.
The instruction cache is parity protected over the instruction
word and tag fields. Parity is generated by the read buffer
during cache refill; during cache references, the parity is
checked, and in the case of a parity error, a cache miss is
processed.
Data Cache
The R3081 incorporates an on-chip data cache of 4kB,
organized as a line size of 4 bytes (one word). The R3081
allows the system to reconfigure the on-chip cache from the
default 16kB I-Cache/4kB D-Cache to 8kB of Instruction and
8kB of Data caches.
The relatively large data cache achieves hit rates in excess
of 95% in most applications, and contributes substantially to
the performance inherent in the R3081. As with the instruction
cache, the data cache is implemented as a direct mapped
physical address cache. The cache is capable of mapping any
word within the 4GB physical address space.
The data cache is implemented as a write-through cache,
to insure that main memory is always consistent with the
internal cache. In order to minimize processor stalls due to
data write operations, the bus interface unit incorporates a 4deep write buffer which captures address and data at the
processor execution rate, allowing it to be retired to main
memory at a much slower rate without impacting system
performance. Further, support has been provided to allow
hardware based data cache coherency in a multi-master
environment, such as one utilizing DMA from I/O to memory.
The data cache is parity protected over the data and tag
fields. Parity is generated by the read buffer during cache refill;
during cache references, the parity is checked, and in the case
of a parity error, a cache miss is processed.
Bus Interface Unit
The R3081 uses its large internal caches to provide the
majority of the bandwidth requirements of the execution
engine, and thus can utilize a simple bus interface connected
to slower memory devices. Alternately, a high-performance,
low-cost secondary cache can be implemented, allowing the
processor to increase performance in systems where bus
bandwidth is a performance limitation.
As part of the R3051 family, the R3081 bus interface utilizes
a 32-bit address and data bus multiplexed onto a single set of
pins. The bus interface unit also provides an ALE (Address
Latch Enable) output signal to de-multiplex the A/D bus, and
Cache
Data
Data Bus
(32)(32)
InstructionsOperands
Register Unit (16 X 64)
Exponent PartFraction
Condition
Codes
Control Unit
and Clocks
(11)(11)(11)(53)(53)(53)
ABResultResult
Exponent
Unit
Divide Unit
Multiply Unit
Figure 5. FPA Functional Block Diagram
AB
(53)(53)(56)
(53)(53)(56)
Add Unit
Round
ResultAB
ResultAB
2889 drw 05
5.54
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
simple handshake signals to process CPU read and write
requests. In addition to the read and write interface, the R3051
family incorporates a DMA arbiter, to allow an external master
to control the external bus.
The R3081 also supports hardware based cache coherency
during DMA writes. The R3081 can invalidate a specified line
of data cache, or in fact can perform burst invalidations during
burst DMA writes.
The R3081 incorporates a 4-deep write buffer to decouple
the speed of the execution engine from the speed of the
memory system. The write buffers capture and FIFO processor
address and data information in store operations, and present
it to the bus interface as write transactions at the rate the
memory system can accommodate.
The R3081 read interface performs both single datum
reads and quad word reads. Single reads work with a simple
handshake, and quad word reads can either utilize the simple
handshake (in lower performance, simple systems) or utilize
a tighter timing mode when the memory system can burst data
at the processor clock rate. Thus, the system designer can
choose to utilize page or nibble mode DRAMs (and possibly
use interleaving, if desired, in high-performance systems), or
use simpler techniques to reduce complexity.
In order to accommodate slower quad word reads, the
R3081 incorporates a 4-deep read buffer FIFO, so that the
external interface can queue up data within the processor
before releasing it to perform a burst fill of the internal caches.
The R3081 is R3051 superset compatible in its bus interface.
Specifically, the R3081 has additional support to simplify the
design of very high frequency systems. This support includes
the ability to run the bus interface at one-half the processor
execution rate, as well as the ability to slow the transitions
between reads and writes to provide extra buffer disable time
for the memory interface. However, it is still possible to design
a system which, with no modification to the PC Board or
software, can accept either an R3041, R3051, R3052, R3071,
or R3081.
SYSTEM USAGE
The IDT R3051 family has been specifically designed to
allow a wide variety of memory systems. Low-cost systems
can use slow speed memories and simple controllers, while
other designers may choose to incorporate higher frequencies,
faster memories, and techniques such as DMA to achieve
maximum performance. The R3081 includes specific support
for high perfromance systems, including signals necessary to
implement external secondary caches, and the ability to
perform hardware based cache coherency in multi-master
systems.
Figure 6 shows a typical system implementation.
Transparent latches are used to de-multiplex the R3081
address and data busses from the A/D bus. The data paths
between the memory system elements and the A/D bus is
managed by simple octal devices. A small set of simple PALs
is used to control the various data path elements, and to
control the handshake between the memory devices and the
CPU.
Depending on the cost vs. performance tradeoffs appropriate
to a given application, the system design engineer could
include true burst support from the DRAM to provide for highperformance cache miss processing, or utilize a simpler,
lower performance memory system to reduce cost and simplify
the design. Similarly, the system designer could choose to
implement techniques such as external secondary cache, or
DMA, to further improve system performance.
DEVELOPMENT SUPPORT
The IDT R3051 family is supported by a rich set of
development tools, ranging from system simulation tools
through PROM monitor and debug support, applications
software and utility libraries, logic analysis tools, sub-system
modules, and shrink wrap operating systems. The R3081,
which is pin and software compatible with the R3051, can
directly utilize these existing tools to reduce time to market.
Figure 7 is an overview of the system development process
typically used when developing R3051 family applications.
The R3051 family is supported in all phases of project
development. These tools allow timely, parallel development
of hardware and software for R3051 family applications, and
include tools such as:
• Optimizing compilers from MIPS, the acknowledged leader
in optimizing compiler technology.
• Cross development tools, available in a variety of
development environments.
• The IDT Evaluation Board, which includes RAM, EPROM,
• IDT/kit, which implements a run-time support package for
R3051 family systems.
PERFORMANCE OVERVIEW
The R3081 achieves a very high-level of performance. This
performance is based on:
• An efficient execution engine. The CPU performs ALU
operations and store operations in a single cycle, and has
an effective load time of 1.3 cycles, and branch execution
rate of 1.5 cycles (based on the ability of the compilers to
avoid software interlocks). Thus, the execution engine
achieves over 35 VUPS performance when operating out
of cache.
• A full featured floating point accelerator/co-processor.
The R3081 incorporates an R3010A compatible floating
point accelerator on-chip, with independent ALUs for floating
point add, multiply, and divide. The floating point unit is fully
hardware interlocked, and features overlapped operation
and precise exceptions. The FPA allows floating point
adds, multiplies, and divides to occur concurrently with
each other, as well as concurrently with integer operations.
• Large on-chip caches. The R3051 family contains caches
which are substantially larger than those on the majority of
today’s microprocessors. These large caches minimize the
number of bus transactions required, and allow the R3051
family to achieve actual sustained performance very close
to its peak execution rate. The R3081 doubles the cache
available on the R3052, making it a suitable engine for
5.55
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
ClkIn
IDT R3081
RISController
I/O Controller
Address/Data
R3051
Local Bus
DRAMDRAMPROMI/OI/O
Figure 6. R3081 RISChipset Based System
Control
DRAM
Controller
IDT73720
Bus Exchanger
2889 drw 06
System
Architecture
Evaluation
Cache-3051
SPP
Benchmarks
Evaluation Board
Laser Printer System
X-Terminal System
System
Development
Phase
Software
DBG Debugger
PIXIE Profiler
MIPS Compiler Suite
Stand-Alone Libraries
Floating Point Library
Cross Development Tools
Adobe PostScript PDL
MicroSoft TrueImage PDL
PeerlessPage Printer OS
X-Server
Hardware
Hardware Models
General CAD Tools
Evaluation Board
Laser Printer System
Support Chips
Figure 7. R3051 Family Development Toolchain
System
Integration
and Verfification
Logic Analysis
Diagnostics
IDT/sim
IDT/kit
In-Circuit Emulation
Remote Debug
Real-Time OS
2889 drw 07
5.56
IDT79R3081 RISControllerMILITARY AND COMMERICAL TEMPERATURE RANGES
many general purpose computing applications, such as ARC
compliant systems.
• Autonomous multiply and divide operations. The R3051
family features an on-chip integer multiplier/divide unit which is
separate from the other ALU. This allows the CPU to perform
multiply or divide operations in parallel with other integer operations, using a single multiply or divide instruction rather than
“step” operations.
• Integrated write buffer. The R3081 features a four deep
write buffer, which captures store target addresses and data at
the processor execution rate and retires it to main memory at
the slower main memory access rate. Use of on-chip write
buffers eliminates the need for the processor to stall when performing store operations.
• Burst read support. The R3051 family enables the system
designer to utilize page mode or nibble mode RAMs when performing read operations to minimize the main memory read
penalty and increase the effective cache hit rates.
These techniques combine to allow the processor to
achieve over 43 VUPS integer performance, 13MFlops of Linpack performance, and 70,000 dhrystones without the use of
external caches or zero wait-state memory devices.
The performance differences between the various family
members depends on the application software and the design
of the memory system. The impact of the various cache sizes,
and the hardware floating point, can be accurately modeled
using Cache-3051. Since the R3041, R3051, R3052, R3071,
and R3081 are all pin and software compatible, the system
designer has maximum freedom in trading between performance and cost. A system can be designed, and later the
appropriate CPU inserted into the board, depending on the
desired system performance.
SELECTABLE FEATURES
The R3081 allows the system designer to configure certain
aspects of operation. Some of these options are established
when the device is reset, while others are enabled via the
Config registers:
• BigEndian vs. LittleEndian Byte Ordering. The part can be
configured to operate with either byte ordering. ACE/ARC systems typically use Little Endian byte ordering. However, various embedded applications, written originally for a Big Endian
processor such as the MC680x0, are easier to port to a Big
Endian system.
• Data Cache Refill of one or four words. The memory system must be capable of performing four word refills of instruction cache misses. The R3081 allows the system designer to
enable D-Cache refill of one or four words dynamically. Thus,
specialized algorithms can choose one refill size, while the
rest of the system can operate with the other.
• Half-frequency bus mode. The processor can be configured such that the external bus interface is at one-half the frequency of the processor core. This simplifies system design;
however, the large on-chip caches mitigate the performance
impact of using a slower system bus clock.
• Slow bus turn-around. The R3081 allows the system
designer to space processor operations, so that more time
is allowed for transitions between memory and the processor
on the multiplexed address/data bus.
• Configurable cache. The R3081 allows the system designer
to use software to select either a 16kB Instruction Cache/4kB
Data Cache organization, or an 8kB Instruction/8kB Data
Cache organization.
• Cache Coherent Interface. The R3081 has an optional
hardware based cache coherency interface intended to support multi-master systems such as those utilizing DMA
between memory and I/O.
• Optional 1x or 2x clock input. The R3081 can be driven
with an R3051 compatible 2x clock input, or a lower frequency
1x clock input.
THERMAL CONSIDERATIONS
The R3081 utilizes special packaging techniques to
improve the thermal properties of high-speed processors.
Thus, the R3081 is packaged using cavity down packaging,
with an embedded thermal slug to improve thermal transfer to
the surrounding air.
The R3081 utilizes the 84-pin plastic package, with the die
being attached to a heat slug. The DL84 package allows for an
efficient thermal transfer between the die and the heat slug.
The heat slug offers a greater area for convection at any given
temperature. Even nominal amounts of airflow will dramatically
reduce the junction temperature of the die, resulting in cooler
operation. The DL84 package is available at all frequencies,
and is pin and form compatible with the PLCC used for the
R3051. Thus, designers can interchange R3081s and R3051s
in a particular design, without changing their PC Board.
The R3081 is guaranteed in a case temperature range of
0°C to +85°C. The type of package, speed (power) of the
device, and airflow conditions, affect the equivalent ambient
temperature conditions which will meet this specification.
The equivalent allowable ambient temperature, TA, can be
calculated using the thermal resistance from case to ambient
(ØCA) of the given package. The following equation relates
ambient and case temperature:
TA = T C - P * ØCA
where P is the maximum power consumption at hot temperature, calculated by using the maximum Icc specification
for the device.
Typical values for ØCA at various airflows are shown in
Table 1.
Note that the R3081 allows the operational frequency to be
turned down during idle periods to reduce power consumption.
This operation is described in the R3081 Hardware User'sGuide. Reducing the operation frequency dramatically
reduces power consumption.
5.57
IDT79R3081 RISControllerMILITARY AND COMMERICAL TEMPERATURE RANGES
minimize skew due to slow rise or fall times. A typical part will
Airflow (m/s)012345
Ø
DL84
21.012.812.09.99.28.5
JA
have less than 2ns rise or fall (10% to 90% signal times) when
driving the test load.
Therefore, the system designer should use care when
designing for direct SysClk use. Total loading (due to devices
Ø
DL84
Ø
PL84
Ø
PL84
Table 1 Thermal Resistance at Various Airflows
18.310.19.37.26.65.8
CA
36.629.627.026.225.424.8
JA
18.310.19.37.26.65.8
CA
connected on the signal net and the routing of the net itself)
should be minimized to ensure the SysClk output has a
smooth and rapid transition. Long rise and/or fall times may
cause a degradation in the speed capability of an individual
device.
Similarly, the R3081 employs feedback on its ALE output to
ensure adequate address hold time to ALE. The system
designer should be careful when designing the ALE net to
NOTES ON SYSTEM DESIGN
The R3081 has been designed to simplify the task of highspeed system design. Thus, set-up and hold-time requirements have been kept to a minimum, allowing a wide variety of
system interface strategies.
To minimize these AC parameters, the R3081 employs
minimize total loading and to minimize skew between ALE and
the A/D bus, which will ensure adequate address access latch
time.
IDT's field and factory applications groups can provide the
system designer with assistance for these and other design
issues.
feedback from its SysClk output to the internal bus interface
unit. This allows the R3081 to reference input signals to the
reference clock seen by the external system. The SysClk output is designed to provide relatively large AC drive to
5.58
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
PIN DESCRIPTION
PIN NAMEI/ODESCRIPTION
A/D(31:0)I/OAddress/Data: A 32-bit time multiplexed bus which indicates the desired address for a bus transaction
in one phase, and which is used to transmit data between the CPU and external memory resources during
the rest of the transfer.
Bus transactions on this bus are logically separated into two phases: during the first phase, information
about the transfer is presented to the memory system to be captured using the ALE output. This
information consists of:
Address(31:4):The high-order address for the transfer is presented on A/D(31:4).BEBE(3:0):These strobes indicate which bytes of the 32-bit bus will be involved in
the transfer, and are presented on A/D(3:0).
During write cycles, the bus contains the data to be stored and is driven from the internal write buffer.
On read cycles, the bus receives the data from the external resource, in either a single data transaction
or in a burst of four words, and places it into the on-chip read buffer.
During cache coherency operations, the R3081 monitors the A/D bus at the start of a DMA write to capture
the write target address for potential data cache invalidates.
Addr(3:2)OLow Address (3:2) A 2-bit bus which indicates which word is currently expected by the processor.
Specifically, this two bit bus presents either the address bits for the single word to be transferred (writes
or single datum reads) or functions as a two bit counter starting at ‘00’ for burst read operations.
During cache coherency operations, the R3081 monitors the Addr bus at the start of a DMA write to
capture the write target address for potential data cache invalidates.
Diag(1)ODiagnostic Pin 1. This output indicates whether the current bus read transaction is due to an on-chip
cache miss, and also presents part of the miss address. The value output on this pin is time multiplexed:
Cached:During the phase in which the A/D bus presents address information, this
pin is an active HIGH output which indicates whether the current read is
a result of a cache miss.
Miss Address (3):During the remainder of the read operation, this output presents address
bit (3) of the address the processor was attempting to reference when the
cache miss occurred. Regardless of whether a cache miss is being
processed, this pin reports the transfer address during this time.
On write cycles, this output signals whether the data being written as retained in the on-chip data cache.
The value of this pin is time multiplexed during writes:
Cached:During the address phase of write transactions, this signal is an active
high output which indicates that the store data was retained in the on-chip
data cache.
Reserved:The value of this pin during the data phase of writes is reserved.
Diag(0)ODiagnostic Pin 0. This output distinguishes cache misses due to instruction references from those
due to data references, and presents the remaining bit of the miss address. The value output on this
pin is also time multiplexed:
I/
DD:If the “Cached” Pin indicates a cache miss, then a high on this pin at this
time indicates an instruction reference, and a low indicates a data
reference. If the read is not due to a cache miss but rather an uncached
reference, then this pin is undefined during this phase.
Miss Address (2):During the remainder of the read operation, this output presents
address bit (2) of the address the processor was attempting to
reference when the cache miss occurred. Regardless of whether a
cache miss is being processed, this pin reports the transfer address
during this time.
During write cycles, the value of this pin during both the address and data phases is reserved.
2889 tbl 02
5.59
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
PIN DESCRIPTION (Continued):
PIN NAMEI/ODESCRIPTION
ALEI/OAddress Latch Enable: Used to indicate that the A/D bus contains valid address information for the bus
transaction. This signal is used by external logic to capture the address for the transfer, typically using
transparent latches.
During cache coherency operations, the R3081 monitors ALE at the start of a DMA write, to capture the write
target address for potential data cache invalidates.
RdWr
DataEn
Burst/WrNear
Ack
RdCEn
SysClk
BusReq
BusGnt
IvdReq
CohReq
SBrCond(3:2)IBranch Condition Port: These external signals are internally connected to the CPU signals CpCond(3:0).
BrCond(0)These signals can be used by the branch on co-processor condition instructions as input ports. There are two
BusError
ORead: An output which indicates that the current bus transaction is a read.
I/OWrite: An output which indicates that the current bus transaction is a write. During coherent DMA, this input
indicates that the current transfer is a write.
OExternal Data Enable: This signal indicates that the A/D bus is no longer being driven by the processor during
read cycles, and thus the external memory system may enable the drivers of the memory system onto this bus
without having a bus conflict occur. During write cycles, or when no bus transaction is occurring, this signal is
negated, thus disabling the external memory drivers
OBurst Transfer/Write Near: On read transactions, the
Burst
signal indicates that the current bus read is
requesting a block of four contiguous words from memory. This signal is asserted only in read cycles due to
cache misses; it is asserted for all I-Cache miss read cycles, and for D-Cache miss read cycles if quad word refill
is currently selected.
On write transactions, the
WrNear
output tells the external memory system that the bus interface unit is
performing back-to-back write transactions to an address within the same 512 word page as the prior write
transaction. This signal is useful in memory systems which employ page mode or static column DRAMs, and
allows near writes to be retired quickly.
IAcknowledge: An input which indicates to the device that the memory system has sufficiently processed the
bus transaction, and that the CPU may either terminate the write cycle or process the read data from this read
transfer.
During Coherent DMA, this input indicates that the current write transfer is completed, and that the internal
invalidation address counter should be incremented.
IRead Buffer Clock Enable: An input which indicates to the device that the memory system has placed valid
data on the A/D bus, and that the processor may move the data into the on-chip Read Buffer.
OSystem Reference Clock: An output from the CPU which reflects the timing of the internal processor "Sys"
clock. This clock is used to control state transitions in the read buffer, write buffer, memory controller, and bus
interface unit. This clock will either be at the same frequency as the CPU execution rate clock, or at one-half
that frequency, as selected during reset.
IDMA Arbiter Bus Request: An input to the device which requests that the CPU tri-state its bus interface signals
so that they may be driven by an external master.
ODMA Arbiter Bus Grant. An output from the CPU used to acknowledge that a
BusReq
has been detected, and
that the bus is relinquished to the external master.
IInvalidate Request. An input provided by an external DMA controller to request that the CPU invalidate the
Data Cache line corresponding to the current DMA write target address. This signal is the same pin as Diag(0)
ICoherent DMA Request. An input used by the external DMA controller to indicate that the requested DMA
operations could involve hardware cache coherency. This signal is the Rsvd(0) of the R3051.
types of Branch Condition inputs: the SBrCond inputs have special internal logic to synchronize the inputs, and
thus may be driven by asynchronous agents. The direct Branch Condition inputs must be driven synchronously.
Note that BrCond(1) is used by the internal FPA, and thus is not available on an external pin.
IBus Error: Input to the bus interface unit to terminate a bus transaction due to an external bus error. This signal
is only sampled during read and write operations. If the bus transaction is a read operation, then the CPU will
take a bus error exception.
2889 tbl 03
5.510
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
PIN DESCRIPTION (Continued):
PIN NAMEI/ODESCRIPTION
Int
(5:3)IProcessor Interrupt: During normal operation, these signals are logically the same as the
signals of the R3000. During processor reset, these signals perform mode initialization of the CPU, but in a
different (simpler) fashion than the interrupt signals of the R3000.
There are two types of interrupt inputs: the
SInt
inputs are internally synchronized by the processor, and may
be driven by an asynchronous external agent. The direct interrupt inputs are not internally synchronized, and
thus must be externally synchronized to the CPU. The direct interrupt inputs have one cycle lower latency than
the synchronized interrupts. Note that the interrupt used by the on-chip FPA will not be monitored externally.
ClkInIMaster Clock Input: This input clock can be provided at the execution frequency of the CPU (1x clock mode)
or at twice that frequency (2x clock mode), as selected at reset.
Reset
IMaster Processor Reset: This signal initializes the CPU. Mode selection is performed during the last cycle
of
Reset
.
Rsvd(4:1)I/OReserved: These four signal pins are reserved for testing and for future revisions of this device. Users must not
connect these pins. Note that Rsvd(0) of the R3051 is now used for the
CohReq
input pin.
Int
(5:0)
SInt
2889 tbl 04
(2:0)
ABSOLUTE MAXIMUM RATINGS
(1, 3)
SymbolRatingCommercialMilitaryUnit
TERMTerminal Voltage –0.5 to +7.0 –0.5 to +7.0V
V
with Respect
to GND
T
COperating Case 0 to +85 –55 to +125°C
Temperature
T
BIASCase Temperature –55 to +125 –65 to +135°C
Under Bias
T
STGStorage –55 to +125 –65 to +155°C
Temperature
V
INInput Voltage –0.5 to +7.0 –0.5 to +7.0V
NOTES:
1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS
may cause permanent damage to the device. This is a stress rating only
and functional operation of the device at these or any other conditions
above those indicated in the operational sections of this specification is not
implied. Exposure to absolute maximum rating conditions for extended
periods may affect reliability.
IN minimum = –3.0V for pulse width less than 15ns.
2. V
V
IN should not exceed VCC +0.5V.
3. Not more than one output should be shorted at a time. Duration of the short
should not exceed 30 seconds.
2889 tbl 05
AC TEST CONDITIONS—R3081
SymbolParameterMin.Max.Unit
IHInput HIGH Voltage3.0—V
V
V
ILInput LOW Voltage—0V
IHSInput HIGH Voltage3.5—V
V
ILSInput LOW Voltage—0V
V
AC TEST CONDITIONS—RV3081
SymbolParameterMin.Max.Unit
V
IHInput HIGH Voltage3.0—V
ILInput LOW Voltage—0V
V
IHSInput HIGH Voltage3.0—V
V
V
ILSInput LOW Voltage—0V
OUTPUT LOADING FOR AC TESTING
+4mA
2889 tbl 06
2889 tbl 06
RECOMMENDED OPERATING
TEMPERATURE AND SUPPLY VOLTAGE
GradeTemperature(Case)GNDVCC
Military–55°C to +125°C0V5.0 ±10%
Commercial0°C to +85°C0V5.0 ±5%
Commercial0°C to +85°C0V3.3 ±5%
V
REF
+1.5V
–
+
–4mA
SignalCLD
2889 tbl 07
SysClk
50 pf
All Others25 pf
2889 tbl 08
5.511
To Device
Under Test
LD
C
2889 drw 08
IDT79R3081 RISControllerMILITARY AND COMMERCIAL TEMPERATURE RANGES
tderateAll outputsTiming deration for loading—1—1ns/
NOTES:
1. All timings referenced to 1.5V. All timings measured with respect to a 2.5ns rise and fall time.
2. The AC values listed here reference timing diagrams contained in the
3. Guaranteed by design.
4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition.
5. In 1x clock mode, t22 is replaced by t44/2.
6. In 1x clock mode, the design guarantees that the input clock rise and fall times can be as long as 5ns.
SysClk
(full frequency mode)Pulse Width
t32
SysClk
(full frequency mode)Clock High Time
t33
SysClk
(full frequency mode)Clock LOW Time
SysClk
(half frequency mode)Pulse Width
t34
SysClk
(half frequency mode)Clock HIGH Time
t35
SysClk
(half frequency mode)Clock LOW Time
t36ALESet-up to
t37ALEHold from
t38A/DSet-up to ALE falling10—9—ns
t39A/DHold from ALE falling2—2—ns
t40
Wr
Set-up to
t41
Wr
Hold from
t42ClkIn (1x clock mode)Pulse Width HIGH
t43ClkIn (1x clock mode)Pulse Width LOW
t44ClkIn (1x clock mode)Clock Period
SysClk
falling9—8—ns
SysClk
falling2—2—ns
SysClk
rising10—9—ns
SysClk
rising3—3—ns
SysClk
Pulse Width from Vcc valid200—200—µs
Minimum Pulse Width32—32—tsys
Set-up to
Mode set-up to
Mode hold from