Datasheet Stratix II Datasheet (ALTERA)

Page 1

SII51002-4.3

2. Stratix II Architecture

Functional Description

Stratix®II devices contain a two-dimensional row- and column-based architecture to implement custom logic. A series of column and row interconnects of varying length and speed provides signal interconnects between logic array blocks (LABs), memory block structures (M512 RAM, M4K RAM, and M-RAM blocks), and digital signal processing (DSP) blocks.

Each LAB consists of eight adaptive logic modules (ALMs). An ALM is the Stratix II device family’s basic building block of logic providing efficient implementation of user logic functions. LABs are grouped into rows and columns across the device.

M512 RAM blocks are simple dual-port memory blocks with 512 bits plus parity (576 bits). These blocks provide dedicated simple dual-port or single-port memory up to 18-bits wide at up to 500 MHz. M512 blocks are grouped into columns across the device in between certain LABs.

M4K RAM blocks are true dual-port memory blocks with 4K bits plus parity (4,608 bits). These blocks provide dedicated true dual-port, simple dual-port, or single-port memory up to 36-bits wide at up to 550 MHz. These blocks are grouped into columns across the device in between certain LABs.

M-RAM blocks are true dual-port memory blocks with 512K bits plus parity (589,824 bits). These blocks provide dedicated true dual-port, simple dual-port, or single-port memory up to 144-bits wide at up to 420 MHz. Several M-RAM blocks are located individually in the device's logic array.

DSP blocks can implement up to either eight full-precision 9 × 9-bit multipliers, four full-precision 18 × 18-bit multipliers, or one full-precision 36 × 36-bit multiplier with add or subtract features. The DSP blocks support Q1.15 format rounding and saturation in the multiplier and accumulator stages. These blocks also contain shift registers for digital signal processing applications, including finite impulse response (FIR) and infinite impulse response (IIR) filters. DSP blocks are grouped into columns across the device and operate at up to 450 MHz.

Altera Corporation 2–1 May 2007

Page 2

Functional Description

Each Stratix II device I/O pin is fed by an I/O element (IOE) located at the end of LAB rows and columns around the periphery of the device. I/O pins support numerous single-ended and differential I/O standards. Each IOE contains a bidirectional I/O buffer and six registers for registering input, output, and output-enable signals. When used with dedicated clocks, these registers provide exceptional performance and interface support with external memory devices such as DDR and DDR2 SDRAM, RLDRAM II, and QDR II SRAM devices. High-speed serial interface channels with dynamic phase alignment (DPA) support data transfer at up to 1 Gbps using LVDS or HyperTransport standards.

Figure 2–1 shows an overview of the Stratix II device.

Figure 2–1. Stratix II Block Diagram

M512 RAM Blocks for Dual-Port Memory, Shift Registers, & FIFO Buffers

DSP Blocks for Multiplication and Full Implementation of FIR Filters

M4K RAM Blocks for True Dual-Port Memory & Other Embedded Memory Functions

IOEs Support DDR, PCI, PCI-X, SSTL-3, SSTL-2, HSTL-1, HSTL-2, LVDS, HyperTransport & other I/O Standards

technology I/O

IOEs

LABs

IOEs

LABs

DSP Block

IOEs IOEs

LABs LABs

LABs

LABs LABs

LABs

LABs LABs

LABs

LABsLABs

M-RAM Block

LABsLABs

2–2 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 3

The number of M512 RAM, M4K RAM, and DSP blocks varies by device along with row and column numbers and M-RAM blocks. Table 2–1 lists the resources available in Stratix II devices.

Table 2–1. Stratix II Device Resources

Stratix II Architecture

Device

EP2S15 4 / 104 3 / 78 0 2 / 12 30 26

EP2S30 6 / 202 4 / 144 1 2 / 16 49 36

EP2S60 7 / 329 5 / 255 2 3 / 36 62 51

EP2S90 8 / 488 6 / 408 4 3 / 48 71 68

EP2S130 9 / 699 7 / 609 6 3 / 63 81 87

EP2S180 11 / 930 8 / 768 9 4 / 96 100 96

Logic Array Blocks

M512 RAM

Columns/Blocks

Each LAB consists of eight ALMs, carry chains, shared arithmetic chains, LAB control signals, local interconnect, and register chain connection lines. The local interconnect transfers signals between ALMs in the same

M4K RAM

Columns/Blocks

M-RAM

Blocks

DSP Block

Columns/Blocks

LAB

Columns

LAB Rows

LAB. Register chain connections transfer the output of an ALM register to

the adjacent ALM register in an LAB. The Quartus

II Compiler places associated logic in an LAB or adjacent LABs, allowing the use of local, shared arithmetic chain, and register chain connections for performance and area efficiency. Figure 2–2 shows the Stratix II LAB structure.

Altera Corporation 2–3 May 2007 Stratix II Device Handbook, Volume 1

Page 4

Logic Array Blocks

Figure 2–2. Stratix II LAB Structure

Direct link interconnect from adjacent block

Row Interconnects of Variable Speed & Length

ALMs

Direct link interconnect from adjacent block

Direct link interconnect to adjacent block

Local Interconnect

LAB

Local Interconnect is Driven

from Either Side by Columns & LABs,

& from Above by Rows

Column Interconnects of Variable Speed & Length

LAB Interconnects

The LAB local interconnect can drive ALMs in the same LAB. It is driven by column and row interconnects and ALM outputs in the same LAB. Neighboring LABs, M512 RAM blocks, M4K RAM blocks, M-RAM blocks, or DSP blocks from the left and right can also drive an LAB's local interconnect through the direct link connection. The direct link connection feature minimizes the use of row and column interconnects, providing higher performance and flexibility. Each ALM can drive 24 ALMs through fast local and direct link interconnects. Figure 2–3 shows the direct link connection.

2–4 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 5

Figure 2–3. Direct Link Connection

Direct link interconnect from

left LAB, TriMatrix memory

block, DSP block, or IOE output

Stratix II Architecture

Direct link interconnect from right LAB, TriMatrix memory block, DSP block, or IOE output

ALMs

Direct link

interconnect

to left

Direct link interconnect to right

Local

Interconnect

LAB Control Signals

Each LAB contains dedicated logic for driving control signals to its ALMs. The control signals include three clocks, three clock enables, two asynchronous clears, synchronous clear, asynchronous preset/load, and synchronous load control signals. This gives a maximum of 11 control signals at a time. Although synchronous load and clear signals are generally used when implementing counters, they can also be used with other functions.

Each LAB can use three clocks and three clock enable signals. However, there can only be up to two unique clocks per LAB, as shown in the LAB control signal generation circuit in Figure 2–4. Each LAB's clock and clock enable signals are linked. For example, any ALM in a particular LAB using the labclk1 signal also uses labclkena1. If the LAB uses both the rising and falling edges of a clock, it also uses two LAB-wide clock signals. De-asserting the clock enable signal turns off the corresponding LAB-wide clock.

Each LAB can use two asynchronous clear signals and an asynchronous load/preset signal. By default, the Quartus II software uses a NOT gate push-back technique to achieve preset. If you disable the NOT gate push-up option or assign a given register to power up high using the Quartus II software, the preset is achieved using the asynchronous load

Altera Corporation 2–5 May 2007 Stratix II Device Handbook, Volume 1

Page 6

Adaptive Logic Modules

signal with asynchronous load data input tied high. When the asynchronous load/preset signal is used, the labclkena0 signal is no longer available.

The LAB row clocks [5..0] and LAB local interconnect generate the LAB-wide control signals. The MultiTrack skew allows clock and control signal distribution in addition to data.

Figure 2–4 shows the LAB control signal generation circuit.

Figure 2–4. LAB-Wide Control Signals

Dedicated Row LAB Clocks

Local Interconnect

There are two unique

clock signals per LAB.

interconnect's inherent low

labclr1

Adaptive Logic Modules

labclk0

labclkena0

or asyncload

or labpreset

labclk1

labclkena1 labclkena2 labclr0 synclr

labclk2

syncload

The basic building block of logic in the Stratix II architecture, the adaptive logic module (ALM), provides advanced features with efficient logic utilization. Each ALM contains a variety of look-up table (LUT)-based resources that can be divided between two adaptive LUTs (ALUTs). With up to eight inputs to the two ALUTs, one ALM can implement various combinations of two functions. This adaptability allows the ALM to be

2–6 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 7

completely backward-compatible with four-input LUT architectures. One

ALM can also implement any function of up to six inputs and certain seven-input functions.

In addition to the adaptive LUT-based resources, each ALM contains two programmable registers, two dedicated full adders, a carry chain, a shared arithmetic chain, and a register chain. Through these dedicated resources, the ALM can efficiently implement various arithmetic functions and shift registers. Each ALM drives all types of interconnects: local, row, column, carry chain, shared arithmetic chain, register chain, and direct link interconnects. Figure 2–5 shows a high-level block diagram of the Stratix II ALM while Figure 2–6 shows a detailed view of all the connections in the ALM.

Figure 2–5. High-Level Block Diagram of the Stratix II ALM

carry_in

shared_arith_in

dataf0

datae0

dataa

datab

datac

datad

datae1

dataf1

Combinational

Logic

shared_arith_out

adder0

adder1

carry_out

reg_chain_in

reg_chain_out

reg0

reg1

Stratix II Architecture

To general o

local routing

To general o

local routing

To general o

local routing

To general o

local routing

Altera Corporation 2–7 May 2007 Stratix II Device Handbook, Volume 1

Page 8

Adaptive Logic Modules

Figure 2–6. Stratix II ALM Details

Row, column &

direct link routing

Local

Interconnect

direct link routing

Row, column &

direct link routing

Local

Interconnect

direct link routing

sclr asyncload

syncload ena[2..0]

reg_chain_in

carry_in

shared_arith_in

4-Input

LUT

ADATA

PRN/ALD

ENA

3-Input

CLRN

LUT

3-Input

LUT

4-Input

LUT

ADATA

PRN/ALD

ENA

3-Input

CLRN

LUT

3-Input

LUT

reg_chain_out

carry_out clk[2..0]

shared_arith_out aclr[1..0]

dataf0

Local

Interconnect

datae0

Local

Interconnect

datac

Local

Interconnect

dataa

Local

Interconnect

datab

Local

Interconnect

datad

Interconnect

Local

datae1

Local

Interconnect

dataf1

Local

Interconnect

2–8 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 9

Stratix II Architecture

One ALM contains two programmable registers. Each register has data, clock, clock enable, synchronous and asynchronous clear, asynchronous load data, and synchronous and asynchronous load/preset inputs. Global signals, general-purpose I/O pins, or any internal logic can drive the register's clock and clear control signals. Either general-purpose I/O pins or internal logic can drive the clock enable, preset, asynchronous load, and asynchronous load data. The asynchronous load data input comes from the datae or dataf input of the ALM, which are the same inputs that can be used for register packing. For combinational functions, the register is bypassed and the output of the LUT drives directly to the outputs of the ALM.

Each ALM has two sets of outputs that drive the local, row, and column routing resources. The LUT, adder, or register output can drive these output drivers independently (see Figure 2–6). For each set of output drivers, two ALM outputs can drive column, row, or direct link routing connections, and one of these ALM outputs can also drive local interconnect resources. This allows the LUT or adder to drive one output while the register drives another output. This feature, called register packing, improves device utilization because the device can use the register and the combinational logic for unrelated functions. Another special packing mode allows the register output to feed back into the LUT of the same ALM so that the register is packed with its own fan-out LUT. This provides another mechanism for improved fitting. The ALM can also drive out registered and unregistered versions of the LUT or adder output.

f See the Performance & Logic Efficiency Analysis of Stratix II Devices White

Paper for more information on the efficiencies of the Stratix II ALM and

comparisons with previous architectures.

ALM Operating Modes

The Stratix II ALM can operate in one of the following modes:

■ Normal mode

■ Extended LUT mode

■ Arithmetic mode

■ Shared arithmetic mode

Each mode uses ALM resources differently. In each mode, eleven available inputs to the ALM--the eight data inputs from the LAB local interconnect; carry-in from the previous ALM or LAB; the shared arithmetic chain connection from the previous ALM or LAB; and the register chain connection--are directed to different destinations to implement the desired logic function. LAB-wide signals provide clock, asynchronous clear, asynchronous preset/load, synchronous clear,

Altera Corporation 2–9 May 2007 Stratix II Device Handbook, Volume 1

Page 10

Adaptive Logic Modules

synchronous load, and clock enable control for the register. These LABwide signals are available in all ALM modes. See the “LAB Control

Signals” section for more information on the LAB-wide control signals.

The Quartus II software and supported third-party synthesis tools, in conjunction with parameterized functions such as library of parameterized modules (LPM) functions, automatically choose the appropriate mode for common functions such as counters, adders, subtractors, and arithmetic functions. If required, you can also create special-purpose functions that specify which ALM operating mode to use for optimal performance.

Normal Mode

The normal mode is suitable for general logic applications and combinational functions. In this mode, up to eight data inputs from the LAB local interconnect are inputs to the combinational logic. The normal mode allows two functions to be implemented in one Stratix II ALM, or an ALM to implement a single function of up to six inputs. The ALM can support certain combinations of completely independent functions and various combinations of functions which have common inputs.

Figure 2–7 shows the supported LUT combinations in normal mode.

2–10 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 11

Figure 2–7. ALM in Normal Mode Note (1)

Stratix II Architecture

dataf0

datae0

datac dataa

datab datad

datae1

dataf1

dataf0

datae0

datac dataa datab

datad

datae1

dataf1

dataf0

datae0

datac dataa datab

4-Input

LUT

4-Input

LUT

5-Input

LUT

3-Input

LUT

5-Input

LUT

combout0

combout1

combout0

combout1

combout0

dataf0

datae0

datac dataa datab

datad

datae1

dataf1

dataf0

datae0

dataa datab datac datad

dataf0

datae0

dataa datab datac datad

5-Input

LUT

5-Input

LUT

6-Input

LUT

6-Input

LUT

combout0

combout1

combout0

datad

datae1

dataf1

4-Input

LUT

combout1

datae1

dataf1

6-Input

LUT

combout1

Note to Figure 2–7:

(1) Combinations of functions with fewer inputs than those shown are also supported. For example, combinations of

functions with the following number of inputs are supported: 4 and 3, 3 and 3, 3 and 2, 5 and 2, etc.

The normal mode provides complete backward compatibility with fourinput LUT architectures. Two independent functions of four inputs or less can be implemented in one Stratix II ALM. In addition, a five-input function and an independent three-input function can be implemented without sharing inputs.

Altera Corporation 2–11 May 2007 Stratix II Device Handbook, Volume 1

Page 12

Adaptive Logic Modules

For the packing of two five-input functions into one ALM, the functions must have at least two common inputs. The common inputs are dataa and datab. The combination of a four-input function with a five-input function requires one common input (either dataa or datab).

In the case of implementing two six-input functions in one ALM, four inputs must be shared and the combinational function must be the same. For example, a 4 × 2 crossbar switch (two 4-to-1 multiplexers with common inputs and unique select lines) can be implemented in one ALM, as shown in Figure 2–8. The shared inputs are dataa, datab, datac, and

datad, while the unique select lines are datae0 and dataf0 for function0, and datae1 and dataf1 for function1. This crossbar

switch consumes four LUTs in a four-input LUT-based architecture.

Figure 2–8. 4 × 2 Crossbar Switch Example

4 × 2 Crossbar Switch Implementation in 1 ALM

sel0[1..0]

inputa inputb

inputc inputd

out0

out1

dataf0

datae0

dataa datab datac datad

Six-Input

LUT

(Function0)

combout0

sel1[1..0]

datae1

dataf1

Six-Input

LUT

(Function1)

combout1

In a sparsely used device, functions that could be placed into one ALM may be implemented in separate ALMs. The Quartus II Compiler spreads a design out to achieve the best possible performance. As a device begins to fill up, the Quartus II software automatically utilizes the full potential of the Stratix II ALM. The Quartus II Compiler automatically searches for functions of common inputs or completely independent functions to be placed into one ALM and to make efficient use of the device resources. In addition, you can manually control resource usage by setting location assignments.

Any six-input function can be implemented utilizing inputs dataa,

datab, datac, datad, and either datae0 and dataf0 or datae1 and dataf1. If datae0 and dataf0 are utilized, the output is driven to register0, and/or register0 is bypassed and the data drives out to

the interconnect using the top set of output drivers (see Figure 2–9). If

2–12 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 13

Stratix II Architecture

datae1 and dataf1 are utilized, the output drives to register1 and/or bypasses register1 and drives to the interconnect using the bottom set of output drivers. The Quartus II Compiler automatically selects the inputs to the LUT. Asynchronous load data for the register comes from the datae or dataf input of the ALM. ALMs in normal mode support register packing.

Figure 2–9. 6-Input Function in Normal Mode Notes (1), (2)

dataf0

datae0

dataa datab datac datad

datae1

dataf1

(2)

These inputs are available for register packing.

6-Input

LUT

reg0

reg1

To general or local routing

Notes to Figure 2–9:

(1) If datae1 and dataf1 are used as inputs to the six-input function, then datae0

and dataf0 are available for register packing.

(2) The dataf1 input is available for register packing only if the six-input function is

un-registered.

Extended LUT Mode

The extended LUT mode is used to implement a specific set of seven-input functions. The set must be a 2-to-1 multiplexer fed by two arbitrary five-input functions sharing four inputs. Figure 2–10 shows the template of supported seven-input functions utilizing extended LUT mode. In this mode, if the seven-input function is unregistered, the unused eighth input is available for register packing.

Functions that fit into the template shown in Figure 2–10 occur naturally in designs. These functions often appear in designs as “if-else” statements in Verilog HDL or VHDL code.

Altera Corporation 2–13 May 2007 Stratix II Device Handbook, Volume 1

Page 14

Adaptive Logic Modules

Figure 2–10. Template for Supported Seven-Input Functions in Extended LUT Mode

datae0

datac dataa datab datad

dataf0

datae1

dataf1

(1)

5-Input

LUT

5-Input

LUT

This input is available for register packing.

combout0

reg0

To general o

local routing

To general o

local routing

Note to Figure 2–10:

(1) If the seven-input function is unregistered, the unused eighth input is available for register packing. The second

Arithmetic Mode

The arithmetic mode is ideal for implementing adders, counters, accumulators, wide parity functions, and comparators. An ALM in arithmetic mode uses two sets of two four-input LUTs along with two dedicated full adders. The dedicated adders allow the LUTs to be available to perform pre-adder logic; therefore, each adder can add the output of two four-input functions. The four LUTs share the dataa and

datab inputs. As shown in Figure 2–11, the carry-in signal feeds to adder0, and the carry-out from adder0 feeds to carry-in of adder1. The

carry-out from adder1 drives to adder0 of the next ALM in the LAB. ALMs in arithmetic mode can drive out registered and/or unregistered versions of the adder outputs.

2–14 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 15

Figure 2–11. ALM in Arithmetic Mode

carry_in

Stratix II Architecture

datae0

dataf0

datac datab dataa

datad

datae1

dataf1

4-Input

LUT

4-Input

LUT

4-Input

LUT

4-Input

LUT

carry_out

adder0

adder1

reg0

reg1

To general or

local routing

To general or

local routing

To general or

local routing

To general o

local routing

While operating in arithmetic mode, the ALM can support simultaneous use of the adder's carry output along with combinational logic outputs. In this operation, the adder output is ignored. This usage of the adder with the combinational logic output provides resource savings of up to 50% for functions that can use this ability. An example of such functionality is a conditional operation, such as the one shown in Figure 2–12. The equation for this example is:

R = (X < Y) ? Y : X

To implement this function, the adder is used to subtract ‘Y’ from ‘X.’ If ‘X’ is less than ‘Y,’ the carry_out signal is ‘1.’ The carry_out signal is fed to an adder where it drives out to the LAB local interconnect. It then feeds to the LAB-wide syncload signal. When asserted, syncload selects the syncdata input. In this case, the data ‘Y’ drives the

syncdata inputs to the registers. If ‘X’ is greater than or equal to ‘Y,’ the syncload signal is de-asserted and ‘X’ drives the data port of the

registers.

Altera Corporation 2–15 May 2007 Stratix II Device Handbook, Volume 1

Page 16

Adaptive Logic Modules

Figure 2–12. Conditional Operation Example

Adder output is not used.

ALM 1

X[0]

Y[0]

syncdata

X[1]

Y[1]

Carry Chain

X[2]

Y[2]

ALM 2

Comb &

Adder

Logic

Comb &

Adder

Logic

Comb &

Adder

Logic

Comb &

Adder

Logic

X[0]

X[1]

X[2]

syncload

reg0

reg1

reg0

carry_out

R[0]

R[1]

R[2]

To general or local routing

To local routing & then to LAB-wide syncload

The arithmetic mode also offers clock enable, counter enable, synchronous up/down control, add/subtract control, synchronous clear, synchronous load. The LAB local interconnect data inputs generate the clock enable, counter enable, synchronous up/down and add/subtract control signals. These control signals are good candidates for the inputs that are shared between the four LUTs in the ALM. The synchronous clear and synchronous load options are LAB-wide signals that affect all registers in the LAB. The Quartus II software automatically places any registers that are not used by the counter into other LABs.

Carry Chain

The carry chain provides a fast carry function between the dedicated adders in arithmetic or shared arithmetic mode. Carry chains can begin in either the first ALM or the fifth ALM in an LAB. The final carry-out signal is routed t o an ALM , wh ere it i s fe d to loc al, row, or col umn int erc onn ect s.

2–16 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 17

Stratix II Architecture

The Quartus II Compiler automatically creates carry chain logic during design processing, or you can create it manually during design entry. Parameterized functions such as LPM functions automatically take advantage of carry chains for the appropriate functions.

The Quartus II Compiler creates carry chains longer than 16 (8 ALMs in arithmetic or shared arithmetic mode) by linking LABs together automatically. For enhanced fitting, a long carry chain runs vertically allowing fast horizontal connections to TriMatrix memory and DSP blocks. A carry chain can continue as far as a full column.

To avoid routing congestion in one small area of the device when a high fan-in arithmetic function is implemented, the LAB can support carry chains that only utilize either the top half or the bottom half of the LAB before connecting to the next LAB. This leaves the other half of the ALMs in the LAB available for implementing narrower fan-in functions in normal mode. Carry chains that use the top four ALMs in the first LAB carry into the top half of the ALMs in the next LAB within the column. Carry chains that use the bottom four ALMs in the first LAB carry into the bottom half of the ALMs in the next LAB within the column. Every other column of LABs is top-half bypassable, while the other LAB columns are bottom-half bypassable.

See the “MultiTrack Interconnect” on page 2–22 section for more information on carry chain interconnect.

Shared Arithmetic Mode

In shared arithmetic mode, the ALM can implement a three-input add. In this mode, the ALM is configured with four 4-input LUTs. Each LUT either computes the sum of three inputs or the carry of three inputs. The output of the carry computation is fed to the next adder (either to adder1 in the same ALM or to adder0 of the next ALM in the LAB) via a dedicated connection called the shared arithmetic chain. This shared arithmetic chain can significantly improve the performance of an adder tree by reducing the number of summation stages required to implement an adder tree. Figure 2–13 shows the ALM in shared arithmetic mode.

Altera Corporation 2–17 May 2007 Stratix II Device Handbook, Volume 1

Page 18

Adaptive Logic Modules

Figure 2–13. ALM in Shared Arithmetic Mode

shared_arith_in

carry_in

4-Input

LUT

datae0

datac datab dataa

datad

datae1

4-Input

LUT

4-Input

LUT

4-Input

LUT

carry_out

shared_arith_out

reg0

reg1

Note to Figure 2–13:

(1) Inputs dataf0 and dataf1 are available for register packing in shared arithmetic mode.

Adder trees can be found in many different applications. For example, the summation of the partial products in a logic-based multiplier can be implemented in a tree structure. Another example is a correlator function that can use a large adder tree to sum filtered data samples in a given time frame to recover or to de-spread data which was transmitted utilizing spread spectrum technology.

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

An example of a three-bit add operation utilizing the shared arithmetic mode is shown in Figure 2–14. The partial sum (S[2..0]) and the partial carry (C[2..0]) is obtained using the LUTs, while the result (R[2..0]) is computed using the dedicated adders.

2–18 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 19

Figure 2–14. Example of a 3-bit Add Utilizing Shared Arithmetic Mode

3-Bit Add Example ALM Implementation

ALM 1

1st stage add is

implemented in LUTs.

2nd stage add is

implemented in adders.

X2 X1 X0

Y2 Y1 Y0 Z2 Z1 Z0

S2 S1 S0

C2 C1 C0

R3 R2 R1 R0

X0 Y0

3-Input

LUT

3-Input

LUT

Stratix II Architecture

shared_arith_in = '0'

carry_in = '0'

Binary Add

1 1 0

1 1 0 1

1 1 0

1 0 1 0 1 0

0 0 1

Decimal

Equivalents

5 2

2 x 6

X1 Y1 Z1

X2 Y2 Z2

3-Input

ALM 2

3-Input

LUT

'0'

LUT

Shared Arithmetic Chain

In addition to the dedicated carry chain routing, the shared arithmetic chain available in shared arithmetic mode allows the ALM to implement a three-input add. This significantly reduces the resources necessary to implement large adder trees or correlator functions.

The shared arithmetic chains can begin in either the first or fifth ALM in an LAB. The Quartus II Compiler creates shared arithmetic chains longer than 16 (8 ALMs in arithmetic or shared arithmetic mode) by linking LABs together automatically. For enhanced fitting, a long shared

Altera Corporation 2–19 May 2007 Stratix II Device Handbook, Volume 1

Page 20

Adaptive Logic Modules

arithmetic chain runs vertically allowing fast horizontal connections to TriMatrix memory and DSP blocks. A shared arithmetic chain can continue as far as a full column.

Similar to the carry chains, the shared arithmetic chains are also top- or bottom-half bypassable. This capability allows the shared arithmetic chain to cascade through half of the ALMs in a LAB while leaving the other half available for narrower fan-in functionality. Every other LAB column is top-half bypassable, while the other LAB columns are bottomhalf bypassable.

See the “MultiTrack Interconnect” on page 2–22 section for more information on shared arithmetic chain interconnect.

Register Chain

In addition to the general routing outputs, the ALMs in an LAB have register chain outputs. The register chain routing allows registers in the same LAB to be cascaded together. The register chain interconnect allows an LAB to use LUTs for a single combinational function and the registers to be used for an unrelated shift register implementation. These resources speed up connections between ALMs while saving local interconnect resources (see Figure 2–15). The Quartus II Compiler automatically takes advantage of these resources to improve utilization and performance.

2–20 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 21

Figure 2–15. Register Chain within an LAB Note (1)

adder0

Combinational

Logic

reg_chain_in

From Previous ALM Within The LAB

reg0

Stratix II Architecture

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

To general or

local routing

Combinational

Logic

adder1

adder0

adder1

reg1

reg0

reg1

reg_chain_out

To Next ALM within the LAB

Note to Figure 2–15:

(1) The combinational or adder logic can be utilized to implement an unrelated, un-registered function.

See the “MultiTrack Interconnect” on page 2–22 section for more information on register chain interconnect.

Altera Corporation 2–21 May 2007 Stratix II Device Handbook, Volume 1

Page 22

MultiTrack Interconnect

Clear & Preset Logic Control

LAB-wide signals control the logic for the register's clear and load/preset signals. The ALM directly supports an asynchronous clear and preset function. The register preset is achieved through the asynchronous load of a logic high. The direct asynchronous preset does not require a NOTgate push-back technique. Stratix II devices support simultaneous asynchronous load/preset, and clear signals. An asynchronous clear signal takes precedence if both signals are asserted simultaneously. Each LAB supports up to two clears and one load/preset signal.

In addition to the clear and load/preset ports, Stratix II devices provide a device-wide reset pin (DEV_CLRn) that resets all registers in the device. An option set before compilation in the Quartus II software controls this pin. This device-wide reset overrides all other control signals.

MultiTrack Interconnect

In the Stratix II architecture, connections between ALMs, TriMatrix memory, DSP blocks, and device I/O pins are provided by the MultiTrack interconnect structure with DirectDrive interconnect consists of continuous, performance-optimized routing lines of different lengths and speeds used for inter- and intra-design block connectivity. The Quartus II Compiler automatically places critical design paths on faster interconnects to improve design performance.

DirectDrive technology is a deterministic routing technology that ensures identical routing resource usage for any function regardless of placement in the device. The MultiTrack interconnect and DirectDrive technology simplify the integration stage of block-based designing by eliminating the re-optimization cycles that typically follow design changes and additions.

The MultiTrack interconnect consists of row and column interconnects that span fixed distances. A routing structure with fixed length resources for all devices allows predictable and repeatable performance when migrating through different device densities. Dedicated row interconnects route signals to and from LABs, DSP blocks, and TriMatrix memory in the same row. These row resources include:

■ Direct link interconnects between LABs and adjacent blocks

■ R4 interconnects traversing four blocks to the right or left

■ R24 row interconnects for high-speed access across the length of the

device

technology. The MultiTrack

2–22 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 23

Stratix II Architecture

The direct link interconnect allows an LAB, DSP block, or TriMatrix memory block to drive into the local interconnect of its left and right neighbors and then back into itself. This provides fast communication between adjacent LABs and/or blocks without using row interconnect resources.

The R4 interconnects span four LABs, three LABs and one M512 RAM block, two LABs and one M4K RAM block, or two LABs and one DSP block to the right or left of a source LAB. These resources are used for fast row connections in a four-LAB region. Every LAB has its own set of R4 interconnects to drive either left or right. Figure 2–16 shows R4 interconnect connections from an LAB. R4 interconnects can drive and be driven by DSP blocks and RAM blocks and row IOEs. For LAB interfacing, a primary LAB or LAB neighbor can drive a given R4 interconnect. For R4 interconnects that drive to the right, the primary LAB and right neighbor can drive on to the interconnect. For R4 interconnects that drive to the left, the primary LAB and its left neighbor can drive on to the interconnect. R4 interconnects can drive other R4 interconnects to extend the range of LABs they can drive. R4 interconnects can also drive C4 and C16 interconnects for connections from one row to another. Additionally, R4 interconnects can drive R24 interconnects.

Figure 2–16. R4 Interconnect Connections Notes (1), (2), (3)

Adjacent LAB can Drive onto Another LAB's R4 Interconnect

R4 Interconnect

Driving Left

LAB

Neighbor

Primary LAB (2)

Notes to Figure 2–16:

(1) C4 and C16 interconnects can drive R4 interconnects. (2) This pattern is repeated for every LAB in the LAB row. (3) The LABs in Figure 2–16 show the 16 possible logical outputs per LAB.

Altera Corporation 2–23 May 2007 Stratix II Device Handbook, Volume 1

C4 and C16 Column Interconnects (1)

LAB

Neighbor

R4 Interconnect Driving Right

Page 24

MultiTrack Interconnect

R24 row interconnects span 24 LABs and provide the fastest resource for long row connections between LABs, TriMatrix memory, DSP blocks, and Row IOEs. The R24 row interconnects can cross M-RAM blocks. R24 row interconnects drive to other row or column interconnects at every fourth LAB and do not drive directly to LAB local interconnects. R24 row interconnects drive LAB local interconnects via R4 and C4 interconnects. R24 interconnects can drive R24, R4, C16, and C4 interconnects.

The column interconnect operates similarly to the row interconnect and vertically routes signals to and from LABs, TriMatrix memory, DSP blocks, and IOEs. Each column of LABs is served by a dedicated column interconnect. These column resources include:

■ Shared arithmetic chain interconnects in an LAB

■ Carry chain interconnects in an LAB and from LAB to LAB

■ Register chain interconnects in an LAB

■ C4 interconnects traversing a distance of four block s in up and down

direction

■ C16 column interconnects for high-speed vertical routing through

the device

Stratix II devices include an enhanced interconnect structure in LABs for routing shared arithmetic chains and carry chains for efficient arithmetic functions. The register chain connection allows the register output of one ALM to connect directly to the register input of the next ALM in the LAB for fast shift registers. These ALM to ALM connections bypass the local interconnect. The Quartus II Compiler automatically takes advantage of these resources to improve utilization and performance. Figure 2–17 shows the shared arithmetic chain, carry chain and register chain interconnects.

2–24 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 25

Stratix II Architecture

Figure 2–17. Shared Arithmetic Chain, Carry Chain & Register Chain Interconnects

Local Interconnect Routing Among ALMs in the LAB

Carry Chain & Shared

Arithmetic Chain

outing to Adjacent ALM

Local

Interconnect

ALM 1

ALM 2

ALM 3

ALM 4

ALM 5

ALM 6

ALM 7

ALM 8

The C4 interconnects span four LABs, M512, or M4K blocks up or down from a source LAB. Every LAB has its own set of C4 interconnects to drive either up or down. Figure 2–18 shows the C4 interconnect connections from an LAB in a column. The C4 interconnects can drive and be driven by all types of architecture blocks, including DSP blocks, TriMatrix memory blocks, and column and row IOEs. For LAB interconnection, a primary LAB or its LAB neighbor can drive a given C4 interconnect. C4 interconnects can drive each other to extend their range as well as drive row interconnects for column-to-column connections.

Altera Corporation 2–25 May 2007 Stratix II Device Handbook, Volume 1

Page 26

MultiTrack Interconnect

Figure 2–18. C4 Interconnect Connections Note (1)

C4 Interconnect Drives Local and R Interconnects up to Four Rows

C4 Interconnect Driving Up

LAB

Row Interconnect

Adjacent LAB can drive onto neighboring LAB's C4 interconnect

Local

Interconnect

C4 Interconnect Driving Down

Note to Figure 2–18:

(1) Each C4 interconnect can drive either up or down four rows.

2–26 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 27

C16 column interconnects span a length of 16 LABs and provide the fastest resource for long column connections between LABs, TriMatrix memory blocks, DSP blocks, and IOEs. C16 interconnects can cross M-RAM blocks and also drive to row and column interconnects at every fourth LAB. C16 interconnects drive LAB local interconnects via C4 and R4 interconnects and do not drive LAB local interconnects directly.

All embedded blocks communicate with the logic array similar to LABto-LAB interfaces. Each block (that is, TriMatrix memory and DSP blocks) connects to row and column interconnects and has local interconnect regions driven by row and column interconnects. These blocks also have direct link interconnects for fast connections to and from a neighboring LAB. All blocks are fed by the row LAB clocks, labclk[5..0].

Table 2–2 shows the Stratix II device’s routing scheme.

Table 2–2. Stratix II Device Routing Scheme (Part 1 of 2)

Source

Carry Chain

Local Interconnect

Direct Link Interconnect

v v vvvv

vvvv

vvv

vvvv

vvv v vvv v

vvvv vv v

Shared arithmetic chain

Carry chain

Local interconnect

Direct link interconnect

R4 interconnect

R24 interconnect

C4 interconnect

C16 interconnect

ALM

M512 RAM block

M4K RAM block

M-RAM block

DSP blocks

Shared Arithmetic Chain

vvvvvv v

Destination

R24 Interconnect

C4 Interconnect

R4 Interconnect

Stratix II Architecture

ALM

C16 Interconnect

M4K RAM Block

M512 RAM Block

DSP Blocks

M-RAM Block

v v v vvvvvvv

Column IOE

Row IOE

Altera Corporation 2–27 May 2007 Stratix II Device Handbook, Volume 1

Page 28

Tri M a t r ix Memor y

Table 2–2. Stratix II Device Routing Scheme (Part 2 of 2)

Source

Carry Chain

Local Interconnect

Shared Arithmetic Chain

Column IOE

Row IOE

Direct Link Interconnect

vvv vvvv

R4 Interconnect

Destination

C4 Interconnect

R24 Interconnect

ALM

C16 Interconnect

M512 RAM Block

DSP Blocks

M-RAM Block

M4K RAM Block

Row IOE

Column IOE

TriMatrix Memory

TriMatrix memory consists of three types of RAM blocks: M512, M4K, and M-RAM. Although these memory blocks are different, they can all implement various types of memory with or without parity, including true dual-port, simple dual-port, and single-port RAM, ROM, and FIFO buffers. Table 2–3 shows the size and features of the different RAM blocks.

Table 2–3. TriMatrix Memory Features (Part 1 of 2)

Memory Feature

Maximum performance 500 MHz 550 MHz 420 MHz

True dual-port memory

Simple dual-port memory

Single-port memory

Shift register

ROM

FIFO buffer

Pack mode

Byte enable

Address clock enable

Parity bits

Mixed clock mode

Memory initialization (.mif)

M512 RAM Block

(32 × 18 Bits)

vvv vvv vv vv vvv

vvv

vvv vvv vv

M4K RAM Block

(128 × 36 Bits)

M-RAM Block

(4K × 144 Bits)

(1)

2–28 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 29

Table 2–3. TriMatrix Memory Features (Part 2 of 2)

Stratix II Architecture

Memory Feature

Simple dual-port memory mixed width support

True dual-port memory mixed width support

Power-up conditions Outputs cleared Outputs cleared Outputs unknown

Mixed-port read-during-write Unknown output/old data Unknown output/old data Unknown output

Configurations 512 × 1

Notes to Ta b l e 2– 3 :

(1) The M-RAM block does not support memory initializations. However, the M-RAM block can emulate a ROM

function using a dual-port RAM bock. The Stratix II device must write to the dual-port memory once and then disable the write-enable ports afterwards.

M512 RAM Block

(32 × 18 Bits)

vvv

256 × 2 128 × 4 64 × 8 64 × 9 32 × 16 32 × 18

M4K RAM Block

(128 × 36 Bits)

4K × 1 2K × 2 1K × 4 512 × 8 512 × 9 256 × 16 256 × 18 128 × 32 128 × 36

M-RAM Block

(4K × 144 Bits)

64K × 8 64K × 9 32K × 16 32K × 18 16K × 32 16K × 36 8K × 64 8K × 72 4K × 128 4K × 144

Memory Block Size

TriMatrix memory provides three different memory sizes for efficient application support. The Quartus II software automatically partitions the user-defined memory into the embedded memory blocks using the most efficient size combinations. You can also manually assign the memory to a specific block size or a mixture of block sizes.

When applied to input registers, the asynchronous clear signal for the TriMatrix embedded memory immediately clears the input registers. However, the output of the memory block does not show the effects until the next clock edge. When applied to output registers, the asynchronous clear signal clears the output registers and the effects are seen immediately.

Altera Corporation 2–29 May 2007 Stratix II Device Handbook, Volume 1

Page 30

Tri M a t r ix Memor y

M512 RAM Block

The M512 RAM block is a simple dual-port memory block and is useful for implementing small FIFO buffers, DSP, and clock domain transfer applications. Each block contains 576 RAM bits (including parity bits). M512 RAM blocks can be configured in the following modes:

■ Simple dual-port RAM

■ Single-port RAM

■ FIFO

■ ROM

■ Shift register

1 Violating the setup or hold time on the memory block address

registers could corrupt memory contents. This applies to both read and write operations.

When configured as RAM or ROM, you can use an initialization file to pre-load the memory contents.

M512 RAM blocks can have different clocks on its inputs and outputs. The wren, datain, and write address registers are all clocked together from one of the two clocks feeding the block. The read address, rden, and output registers can be clocked by either of the two clocks driving the block. This allows the RAM block to operate in read/write or input/output clock modes. Only the output register can be bypassed. The six labclk signals or local interconnect can drive the inclock, outclock, wren, rden, and outclr signals. Because of the advanced interconnect between the LAB and M512 RAM blocks, ALMs can also control the wren and rden signals and the RAM clock, clock enable, and asynchronous clear signals. Figure 2–19 shows the M512 RAM block control signal generation logic.

The RAM blocks in Stratix II devices have local interconnects to allow ALMs and interconnects to drive into RAM blocks. The M512 RAM block local interconnect is driven by the R4, C4, and direct link interconnects from adjacent LABs. The M512 RAM blocks can communicate with LABs on either the left or right side through these row interconnects or with LAB columns on the left or right side with the column interconnects. The M512 RAM block has up to 16 direct link input connections from the left adjacent LABs and another 16 from the right adjacent LAB. M512 RAM outputs can also connect to left and right LABs through direct link interconnect. The M512 RAM block has equal opportunity for access and performance to and from LABs on either its left or right side. Figure 2–20 shows the M512 RAM block to logic array interface.

2–30 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 31

Figure 2–19. M512 RAM Block Control Signals

Stratix II Architecture

Dedicated Row LAB Clocks

Local Interconnect

inclocken

outclocken

outclockinclock

rden

wren

outclr

Altera Corporation 2–31 May 2007 Stratix II Device Handbook, Volume 1

Page 32

Tri M a t r ix Memor y

Figure 2–20. M512 RAM Block LAB Row Interface

C4 Interconnect

R4 Interconnect

Direct link interconnect to adjacent LAB

Direct link interconnect from adjacent LAB

M512 RAM Block Local Interconnect Region

clocks

datain

dataout

M512 RAM

Block

control

signals

address

LAB Row Clocks

Direct link interconnect to adjacent LAB

Direct link interconnect from adjacent LAB

M4K RAM Blocks

The M4K RAM block includes support for true dual-port RAM. The M4K RAM block is used to implement buffers for a wide variety of applications such as storing processor code, implementing lookup schemes, and implementing larger memory applications. Each block contains 4,608 RAM bits (including parity bits). M4K RAM blocks can be configured in the following modes:

■ True dual-port RAM

■ Simple dual-port RAM

■ Single-port RAM

■ FIFO

■ ROM

■ Shift register

When configured as RAM or ROM, you can use an initialization file to pre-load the memory contents.

2–32 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 33

The M4K RAM blocks allow for different clocks on their inputs and outputs. Either of the two clocks feeding the block can clock M4K RAM block registers (renwe, address, byte enable, datain, and output registers). Only the output register can be bypassed. The six labclk signals or local interconnects can drive the control signals for the A and B ports of the M4K RAM block. ALMs can also control the clock_a, clock_b, renwe_a, renwe_b, clr_a, clr_b, clocken_a, and clocken_b signals, as shown in Figure 2–21.

The R4, C4, and direct link interconnects from adjacent LABs drive the M4K RAM block local interconnect. The M4K RAM blocks can communicate with LABs on either the left or right side through these row resources or with LAB columns on either the right or left with the column resources. Up to 16 direct link input connections to the M4K RAM Block are possible from the left adjacent LABs and another 16 possible from the right adjacent LAB. M4K RAM block outputs can also connect to left and right LABs through direct link interconnect. Figure 2–22 shows the M4K RAM block to logic array interface.

Figure 2–21. M4K RAM Block Control Signals

Dedicated Row LAB Clocks

Local Interconnect

Stratix II Architecture

Local Interconnect

clock_b

clocken_aclock_a

clocken_b

renwe_a

renwe_b

aclr_b

aclr_a

Altera Corporation 2–33 May 2007 Stratix II Device Handbook, Volume 1

Page 34

Tri M a t r ix Memor y

Figure 2–22. M4K RAM Block LAB Row Interface

C4 Interconnect

R4 Interconnect

Direct link interconnect to adjacent LAB

Direct link interconnect from adjacent LAB

M4K RAM Block Local Interconnect Region

M4K RAM

datain

control signals

clocks

address

LAB Row Clocks

dataout

Block

byte enable

Direct link interconnect to adjacent LAB

Direct link interconnect from adjacent LAB

M-RAM Block

The largest TriMatrix memory block, the M-RAM block, is useful for applications where a large volume of data must be stored on-chip. Each block contains 589,824 RAM bits (including parity bits). The M-RAM block can be configured in the following modes:

■ True dual-port RAM

■ Simple dual-port RAM

■ Single-port RAM

■ FIFO

You cannot use an initialization file to initialize the contents of an M-RAM block. All M-RAM block contents power up to an undefined value. Only synchronous operation is supported in the M-RAM block, so all inputs are registered. Output registers can be bypassed.

2–34 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 35

Similar to all RAM blocks, M-RAM blocks can have different clocks on their inputs and outputs. Either of the two clocks feeding the block can clock M-RAM block registers (renwe, address, byte enable, datain, and output registers). The output register can be bypassed. The six labclk signals or local interconnect can drive the control signals for the A and B ports of the M-RAM block. ALMs can also control the clock_a,

clock_b, renwe_a, renwe_b, clr_a, clr_b, clocken_a, and clocken_b signals as shown in Figure 2–23.

Figure 2–23. M-RAM Block Control Signals

Dedicated Row LAB Clocks

Local Interconnect

Stratix II Architecture

Local Interconnect

clocken_a

clock_a

aclr_a

renwe_a

renwe_b

aclr_b

clocken_b

clock_b

Local Interconnect

The R4, R24, C4, and direct link interconnects from adjacent LABs on either the right or left side drive the M-RAM block local interconnect. Up to 16 direct link input connections to the M-RAM block are possible from the left adjacent LABs and another 16 possible from the right adjacent LAB. M-RAM block outputs can also connect to left and right LABs through direct link interconnect. Figure 2–24 shows an example floorplan for the EP2S130 device and the location of the M-RAM interfaces.

Figures 2–25 and 2–26 show the interface between the M-RAM block and

the logic array.

Altera Corporation 2–35 May 2007 Stratix II Device Handbook, Volume 1

Page 36

Tri M a t r ix Memor y

Figure 2–24. EP2S130 Device with M-RAM Interface Locations Note (1)

M-RAM blocks interface to

LABs on right and left sides for

easy access to horizontal I/O pins

M4K

Blocks

M-RAM

Block

M-RAM

Block

M-RAM

Block

M512

Blocks

DSP

Blocks

LABs

M-RAM

Block

M-RAM

Block

M-RAM

Block

DSP

Blocks

Note to Figure 2–24:

(1) The device shown is an EP2S130 device. The number and position of M-RAM blocks varies in other devices.

2–36 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 37

Figure 2–25. M-RAM Block LAB Row Interface Note (1)

Row Unit Interface Allows LAB Rows to Drive Port A Datain, Dataout, Address and Control Signals to and from M-RAM Block

Row Unit Interface Allows LAB Rows to Drive Port B Datain, Dataout, Address and Control Signals to and from M-RAM Block

Stratix II Architecture

M-RAM Block

LAB Interface Blocks

LABs in Row M-RAM Boundary

Note to Figure 2–25:

(1) Only R24 and C16 interconnects cross the M-RAM block boundaries.

Port BPort A

LABs in Row M-RAM Boundary

Altera Corporation 2–37 May 2007 Stratix II Device Handbook, Volume 1

Page 38

Tri M a t r ix Memor y

Figure 2–26. M-RAM Row Unit Interface to Interconnect

R4 and R24 InterconnectsC4 Interconnect

LAB

M-RAM Block

Direct Link Interconnects

Up to 16

Up to 28

Row Interface Block

M-RAM Block to LAB Row Interface Block Interconnect Region

dataout_a[ ]

datain_a[ ] addressa[ ] addr_ena_a renwe_a byteenaA[ ] clocken_a clock_a aclr_a

Table 2–4 shows the input and output data signal connections along with

the address and control signal input connections to the row unit interfaces (L0 to L5 and R0 to R5).

2–38 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 39

Stratix II Architecture

Table 2–4. M-RAM Row Interface Unit Signals

Unit Interface Block Input Signals Output Signals

L0 datain_a[14..0]

byteena_a[1..0]

L1 datain_a[29..15]

byteena_a[3..2]

L2 datain_a[35..30]

addressa[4..0] addr_ena_a clock_a clocken_a renwe_a aclr_a

L3 addressa[15..5]

datain_a[41..36]

L4 datain_a[56..42]

byteena_a[5..4]

L5 datain_a[71..57]

byteena_a[7..6]

R0 datain_b[14..0]

byteena_b[1..0]

R1 datain_b[29..15]

byteena_b[3..2]

R2 datain_b[35..30]

addressb[4..0] addr_ena_b clock_b clocken_b renwe_b aclr_b

R3 addressb[15..5]

datain_b[41..36]

R4 datain_b[56..42]

byteena_b[5..4]

R5 datain_b[71..57]

byteena_b[7..6]

dataout_a[11..0]

dataout_a[23..12]

dataout_a[35..24]

dataout_a[47..36]

dataout_a[59..48]

dataout_a[71..60]

dataout_b[11..0]

dataout_b[23..12]

dataout_b[35..24]

dataout_b[47..36]

dataout_b[59..48]

dataout_b[71..60]

f See the TriMatrix Embedded Memory Blocks in Stratix II & Stratix II GX

Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook for more information on TriMatrix

memory.

Altera Corporation 2–39 May 2007 Stratix II Device Handbook, Volume 1

Page 40

Digital Signal Processing Block

The most commonly used DSP functions are FIR filters, complex FIR filters, IIR filters, fast Fourier transform (FFT) functions, direct cosine transform (DCT) functions, and correlators. All of these use the multiplier as the fundamental building block. Additionally, some applications need specialized operations such as multiply-add and multiply-accumulate operations. Stratix II devices provide DSP blocks to meet the arithmetic requirements of these functions.

Each Stratix II device has from two to four columns of DSP blocks to efficiently implement DSP functions faster than ALM-based implementations. Stratix II devices have up to 24 DSP blocks per column (see Table 2–5). Each DSP block can be configured to support up to:

■ Eight 9 × 9-bit multipliers

■ Four 18 × 18-bit multipliers

■ One 36 × 36-bit multiplier

As indicated, the Stratix II DSP block can support one 36 × 36-bit multiplier in a single DSP block. This is true for any combination of signed, unsigned, or mixed sign multiplications.

1 This list only shows functions that can fit into a single DSP block.

Multiple DSP blocks can support larger multiplication functions.

2–40 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 41

Stratix II Architecture

Figure 2–27 shows one of the columns with surrounding LAB rows.

Figure 2–27. DSP Blocks Arranged in Columns

DSP Block

Column

4 LAB Rows

DSP Block

Altera Corporation 2–41 May 2007 Stratix II Device Handbook, Volume 1

Page 42

Digital Signal Processing Block

Table 2–5 shows the number of DSP blocks in each Stratix II device.

Table 2–5. DSP Blocks in Stratix II Devices Note (1)

Device DSP Blocks

EP2S15 12 96 48 12

EP2S30 16 128 64 16

EP2S60 36 288 144 36

EP2S90 48 384 192 48

EP2S130 63 504 252 63

EP2S180 96 768 384 96

Note to Ta b l e 2– 5 :

(1) Each device has either the numbers of 9 × 9-, 18 × 18-, or 36 × 36-bit multipliers

shown. The total number of multipliers for each device is not the sum of all the multipliers.

Total 9 × 9

Multipliers

Total 18 × 18

Multipliers

Total 36 × 36

Multipliers

DSP block multipliers can optionally feed an adder/subtractor or accumulator in the block depending on the configuration. This makes routing to ALMs easier, saves ALM routing resources, and increases performance, because all connections and blocks are in the DSP block. Additionally, the DSP block input registers can efficiently implement shift registers for FIR filter applications, and DSP blocks support Q1.15 format rounding and saturation.

Figure 2–28 shows the top-level diagram of the DSP block configured for

18 × 18-bit multiplier mode.

2–42 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 43

Figure 2–28. DSP Block Diagram for 18 × 18-Bit Configuration

Optional Serial Shift

Previous DSP Block

Stratix II Architecture

From the row

interface block

Optional Serial Shift Register Outputs to

Next DSP Block

in the Column

PRN DQ

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

Multiplier Block

Q1.15

Round/

Saturate

Q1.15

Round/

Saturate

Q1.15

Round/

Saturate

Q1.15

Round/

Saturate

Optional Input Register Stage with Parallel Input or Shift Register Configuration

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

PRN

ENA

CLRN

Adder Output Block

Optional Stage Configurable as Accumulator or Dynamic

Adder/Subtractor

Adder/

Subtractor/

Accumulator

Adder/

Subtractor/

Accumulator

Optional Pipline Register Stage

Q1.15

Round/

Saturate

Summation

Block

Adder

Summation Stage

for Adding Four

Multipliers Together

Q1.15

Round/

Saturate

Output

Selection

Multiplexer

to MultiTrack

Interconnect

ENA

CLRN

Altera Corporation 2–43 May 2007 Stratix II Device Handbook, Volume 1

Page 44

Digital Signal Processing Block

Modes of Operation

The adder, subtractor, and accumulate functions of a DSP block have four modes of operation:

■ Simple multiplier

■ Multiply-accumulator

■ Two-multipliers adder

■ Four-multipliers adder

Table 2–6 shows the different number of multipliers possible in each DSP

block mode according to size. These modes allow the DSP blocks to implement numerous applications for DSP including FFTs, complex FIR, FIR, and 2D FIR filters, equalizers, IIR, correlators, matrix multiplication and many other functions. The DSP blocks also support mixed modes and mixed multiplier sizes in the same block. For example, half of one DSP block can implement one 18 × 18-bit multiplier in multiplyaccumulator mode, while the other half of the DSP block implements four 9 × 9-bit multipliers in simple multiplier mode.

Table 2–6. Multiplier Size & Configurations per DSP Block

DSP Block Mode 9 × 9 18 × 18 36 × 36

Multiplier Eight multipliers with

eight product outputs

Multiply-accumulator - Two 52-bit multiply-

Two-multipliers adder Four two-multiplier adder

(two 9 × 9 complex multiply)

Four-multipliers adder Two four-multiplier adder One four-multiplier adder -

Four multipliers with four product outputs

accumulate blocks

Two two-multiplier adder (one 18 × 18 complex multiply)

One multiplier with one product output

DSP Block Interface

Stratix II device DSP block input registers can generate a shift register that can cascade down in the same DSP block column. Dedicated connections between DSP blocks provide fast connections between the shift register inputs to cascade the shift register chains. You can cascade registers within multiple DSP blocks for 9 × 9- or 18 × 18-bit FIR filters larger than four taps, with additional adder stages implemented in ALMs. If the DSP block is configured as 36 × 36 bits, the adder, subtractor, or accumulator stages are implemented in ALMs. Each DSP block can route the shift register chain out of the block to cascade multiple columns of DSP blocks.

2–44 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 45

Stratix II Architecture

The DSP block is divided into four block units that interface with four LAB rows on the left and right. Each block unit can be considered one complete 18 × 18-bit multiplier with 36 inputs and 36 outputs. A local interconnect region is associated with each DSP block. Like an LAB, this interconnect region can be fed with 16 direct link interconnects from the LAB to the left or right of the DSP block in the same row. R4 and C4 routing resources can access the DSP block's local interconnect region. The outputs also work similarly to LAB outputs as well. Eighteen outputs from the DSP block can drive to the left LAB through direct link interconnects and eighteen can drive to the right LAB though direct link interconnects. All 36 outputs can drive to R4 and C4 routing interconnects. Outputs can drive right- or left-column routing.

Figures 2–29 and 2–30 show the DSP block interfaces to LAB rows.

Figure 2–29. DSP Block Interconnect Interface

DSP Block

R4, C4 & Direct

Link Interconnects

OA[17..0] OB[17..0]

A1[17..0] B1[17..0]

OC[17..0] OD[17..0]

A2[17..0] B2[17..0]

OE[17..0] OF[17..0]

A3[17..0] B3[17..0]

OG[17..0] OH[17..0]

A4[17..0] B4[17..0]

R4, C4 & Direct Link Interconnect

Altera Corporation 2–45 May 2007 Stratix II Device Handbook, Volume 1

Page 46

Digital Signal Processing Block

Figure 2–30. DSP Block Interface to Interconnect

C4 Interconnect

LAB LAB

Direct Link Interconnect from Adjacent LAB

R4 Interconnect

DSP Block Row Structure

Direct Link Outputs to Adjacent LABs

Direct Link Interconnect from Adjacent LAB

Row Interface

DSP Block to LAB Row Interface Block Interconnect Region

Control

A[17..0] B[17..0]

Block

36 Inputs per Row 36 Outputs per Row

OA[17..0] OB[17..0]

A bus of 44 control signals feeds the entire DSP block. These signals include clocks, asynchronous clears, clock enables, signed/unsigned control signals, addition and subtraction control signals, rounding and saturation control signals, and accumulator synchronous loads. The clock signals are routed from LAB row clocks and are generated from specific LAB rows at the DSP block interface.

2–46 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 47

Stratix II Architecture

The LAB row source for control signals, data inputs, and outputs is shown in Ta bl e 2 –7 .

Table 2–7. DSP Block Signal Sources & Destinations

LAB Row at

Interface

0clock0

aclr0 ena0 mult01_saturate addnsub1_round/ accum_round addnsub1 signa sourcea sourceb

1clock1

aclr1 ena1 accum_saturate mult01_round accum_sload sourcea sourceb mode0

2clock2

aclr2 ena2 mult23_saturate addnsub3_round/ accum_round addnsub3 sign_b sourcea sourceb

3clock3

aclr3 ena3 accum_saturate mult23_round accum_sload sourcea sourceb mode1

Control Signals Generated Data Inputs Data Outputs

A1[17..0] B1[17..0]

A2[17..0] B2[17..0]

A3[17..0] B3[17..0]

A4[17..0] B4[17..0]

OA[17..0] OB[17..0]

OC[17..0] OD[17..0]

OE[17..0] OF[17..0]

OG[17..0] OH[17..0]

f See the DSP Blocks in Stratix II & Stratix II GX Devices chapter in

volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook, for more information on DSP blocks.

Altera Corporation 2–47 May 2007 Stratix II Device Handbook, Volume 1

Page 48

PLLs & Clock Networks

Stratix II devices provide a hierarchical clock structure and multiple PLLs with advanced features. The large number of clocking resources in combination with the clock synthesis precision provided by enhanced and fast PLLs provides a complete clock management solution.

Global & Hierarchical Clocking

Stratix II devices provide 16 dedicated global clock networks and 32 regional clock networks (eight per device quadrant). These clocks are organized into a hierarchical clock structure that allows for up to 24 clocks per device region with low skew and delay. This hierarchical clocking scheme provides up to 48 unique clock domains in Stratix II devices.

There are 16 dedicated clock pins (CLK[15..0]) to drive either the global or regional clock networks. Four clock pins drive each side of the device, as shown in Figures 2–31 and 2–32. Internal logic and enhanced and fast PLL outputs can also drive the global and regional clock networks. Each global and regional clock has a clock control block, which controls the selection of the clock source and dynamically enables/disables the clock to reduce power consumption. Table 2–8 shows global and regional clock features.

Table 2–8. Global & Regional Clock Features

Feature Global Clocks Regional Clocks

Number per device 16 32

Number available per quadrant

Sources CLK pins, PLL outputs,

or internal logic

Dynamic clock source selection

Dynamic enable/disable

16 8

CLK pins, PLL outputs, or internal logic

v (1)

Note to Ta b l e 2– 8 :

(1) Dynamic source clock selection is supported for selecting between CLKp pins and

PLL outputs only.

Global Clock Network

These clocks drive throughout the entire device, feeding all device quadrants. The global clock networks can be used as clock sources for all resources in the device-IOEs, ALMs, DSP blocks, and all memory blocks. These resources can also be used for control signals, such as clock enables and synchronous or asynchronous clears fed from the external pin. The

2–48 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 49

Stratix II Architecture

global clock networks can also be driven by internal logic for internally generated global clocks and asynchronous clears, clock enables, or other control signals with large fanout. Figure 2–31 shows the 16 dedicated CLK pins driving global clock networks.

Figure 2–31. Global Clocking

CLK[15..12]

Global Clock [15..0]

CLK[3..0]

Global Clock [15..0]

CLK[7..4]

CLK[11..8]

Regional Clock Network

There are eight regional clock networks RCLK[7..0] in each quadrant of the Stratix II device that are driven by the dedicated CLK[15..0] input pins, by PLL outputs, or by internal logic. The regional clock networks provide the lowest clock delay and skew for logic contained in a single quadrant. The CLK clock pins symmetrically drive the RCLK networks in a particular quadrant, as shown in Figure 2–32.

Altera Corporation 2–49 May 2007 Stratix II Device Handbook, Volume 1

Page 50

PLLs & Clock Networks

Figure 2–32. Regional Clocks

RCLK[31..28] RCLK[27..24]

CLK[15..12]

RCLK[3..0]

CLK[3..0]

RCLK[7..4]

RCLK[23..20]

CLK[11..8]

RCLK[19..16]

CLK[7..4]

Regional Clocks Only Drive a Device Quadrant from Specified CLK Pins, PLLs or Core Logic within that Quadrant

RCLK[11..8] RCLK[15..12]

Dual-Regional Clock Network

A single source (CLK pin or PLL output) can generate a dual-regional clock by driving two regional clock network lines in adjacent quadrants (one from each quadrant). This allows logic that spans multiple quadrants to utilize the same low skew clock. The routing of this clock signal on an entire side has approximately the same speed but slightly higher clock skew when compared with a clock signal that drives a single quadrant. Internal logic-array routing can also drive a dual-regional clock. Clock pins and enhanced PLL outputs on the top and bottom can drive horizontal dual-regional clocks. Clock pins and fast PLL outputs on the left and right can drive vertical dual-regional clocks, as shown in

Figure 2–33. Corner PLLs cannot drive dual-regional clocks.

2–50 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 51

Figure 2–33. Dual-Regional Clocks

Clock Pins or PLL Clock Outputs Can Drive Dual-Regional Network

CLK[15..12]

Clock Pins or PLL Clock Outputs Can Drive Dual-Regional Network

Stratix II Architecture

CLK[15..12]

CLK[3..0]

CLK[7..4]

CLK[11..8]

Combined Resources

Within each quadrant, there are 24 distinct dedicated clocking resources consisting of 16 global clock lines and eight regional clock lines. Multiplexers are used with these clocks to form busses to drive LAB row clocks, column IOE clocks, or row IOE clocks. Another multiplexer is used at the LAB level to select three of the six row clocks to feed the ALM registers in the LAB (see Figure 2–34).

Figure 2–34. Hierarchical Clock Networks Per Quadrant

Clocks Available

to a Quadrant

or Half-Quadrant

CLK[3..0]

PLLsPLLs

CLK[11..8]

CLK[7..4]

Column I/O Cell

IO_CLK[7..0]

Global Clock Network [15..0]

Clock [23..0]

Regional Clock Network [7..0]

Lab Row Clock [5..0]

Row I/O Cell

IO_CLK[7..0]

Altera Corporation 2–51 May 2007 Stratix II Device Handbook, Volume 1

Page 52

PLLs & Clock Networks

IOE clocks have row and column block regions that are clocked by eight I/O clock signals chosen from the 24 quadrant clock resources.

Figures 2–35 and 2–36 show the quadrant relationship to the I/O clock

regions.

Figure 2–35. EP2S15 & EP2S30 Device I/O Clock Groups

IO_CLKA[7:0] IO_CLKB[7:0]

IO_CLKH[7:0]

IO_CLKG[7:0]

8 8

24 Clocks in

the Quadrant

24 Clocks in

the Quadrant

24 Clocks in

the Quadrant

24 Clocks in

the Quadrant

I/O Clock Region

IO_CLKC[7:0]

IO_CLKD[7:0]

IO_CLKF[7:0] IO_CLKE[7:0]

2–52 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 53

Figure 2–36. EP2S60, EP2S90, EP2S130 & EP2S180 Device I/O Clock Groups

IO_CLKA[7:0] IO_CLKB[7:0]

IO_CLKC[7:0] IO_CLKD[7:0]

Stratix II Architecture

IO_CLKP[7:0]

IO_CLKO[7:0]

IO_CLKN[7:0]

IO_CLKM[7:0]

888

24 Clocks in the

Quadrant

24 Clocks in the

Quadrant

IO_CLKL[7:0] IO_CLKK[7:0]

8 8 8

24 Clocks in the

Quadrant

24 Clocks in the

Quadrant

IO_CLKJ[7:0] IO_CLKI[7:0]

I/O Clock Region

IO_CLKE[7:0]

IO_CLKF[7:0]

IO_CLKG[7:0]

IO_CLKH[7:0]

You can use the Quartus II software to control whether a clock input pin drives either a global, regional, or dual-regional clock network. The Quartus II software automatically selects the clocking resources if not specified.

Clock Control Block

Each global clock, regional clock, and PLL external clock output has its own clock control block. The control block has two functions:

■ Clock source selection (dynamic selection for global clocks)

■ Clock power-down (dynamic clock enable/disable)

Altera Corporation 2–53 May 2007 Stratix II Device Handbook, Volume 1

Page 54

PLLs & Clock Networks

)

1 When using the global or regional clock control blocks in

Stratix II devices to select between multiple clocks or to enable and disable clock networks, be aware of possible narrow pulses or glitches when switching from one clock signal to another. A glitch or runt pulse has a width that is less than the width of the highest frequency input clock signal. To prevent logic errors within the FPGA, Altera recommends that you build circuits that filter out glitches and runt pulses.

Figures 2–37 through 2–39 show the clock control block for the global

clock, regional clock, and PLL external clock output, respectively.

Figure 2–37. Global Clock Control Blocks

CLKp Pins

PLL Counter

Outputs

CLKSELECT[1..0]

(1)

CLKn

Internal Logic

This multiplexer supports User-Controllable Dynamic Switching

Static Clock Select

Enable/ Disable

Internal

Logic

GCLK

Notes to Figure 2–37:

(1) These clock select signals can be dynamically controlled through internal logic

when the device is operating in user mode.

(2) These clock select signals can only be set through a configuration file (.sof or .pof)

and cannot be dynamically controlled during user mode operation.

2–54 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 55

Stratix II Architecture

)

Figure 2–38. Regional Clock Control Blocks

CLKp

CLKn

(2)

PLL Counter

Outputs

Notes to Figure 2–38:

(1) These clock select signals can only be set through a configuration file (.sof or .pof)

and cannot be dynamically controlled during user mode operation.

(2) Only th e CLKn pins on the top and bottom of the device feed to regional clock select

blocks.The clock outputs from corner PLLs cannot be dynamically selected through the global clock control block.

(3) The clock outputs from corner PLLs cannot be dynamically selected through the

global clock control block.

(3)

Enable/

Disable

RCLK

Internal Logic

Static Clock Select

Internal

Logic

Altera Corporation 2–55 May 2007 Stratix II Device Handbook, Volume 1

Page 56

PLLs & Clock Networks

)

Figure 2–39. External PLL Output Clock Control Blocks

PLL Counter

Outputs (c[5..0])

IOE

Internal

(2)

Logic

Enable/ Disable

PLL_OUT

Static Clock Select

Internal

Logic

Static Clock Select

(1)

Notes to Figure 2–39:

(1) These clock select signals can only be set through a configuration file (.sof or .pof)

and cannot be dynamically controlled during user mode operation.

(2) The clock control block feeds to a multiplexer within the PLL_OUT pin’s IOE. The

PLL_OUT pin is a dual-purpose pin. Therefore, this multiplexer selects either an internal signal or the output of the clock control block.

For the global clock control block, the clock source selection can be controlled either statically or dynamically. The user has the option of statically selecting the clock source by using the Quartus II software to set specific configuration bits in the configuration file (.sof or .pof) or the user can control the selection dynamically by using internal logic to drive the multiplexor select inputs. When selecting statically, the clock source can be set to any of the inputs to the select multiplexor. When selecting the clock source dynamically, you can either select between two PLL outputs (such as the C0 or C1 outputs from one PLL), between two PLLs (such as the C0/C1 clock output of one PLL or the C0/C1 c1ock output of the other PLL), between two clock pins (such as CLK0 or CLK1), or between a combination of clock pins or PLL outputs. The clock outputs from corner PLLs cannot be dynamically selected through the global control block.

For the regional and PLL_OUT clock control block, the clock source selection can only be controlled statically using configuration bits. Any of the inputs to the clock select multiplexor can be set as the clock source.

2–56 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 57

Stratix II Architecture

The Stratix II clock networks can be disabled (powered down) by both static and dynamic approaches. When a clock net is powered down, all the logic fed by the clock net is in an off-state thereby reducing the overall power consumption of the device.

The global and regional clock networks can be powered down statically through a setting in the configuration (.sof or .pof) file. Clock networks that are not used are automatically powered down through configuration bit settings in the configuration file generated by the Quartus II software.

The dynamic clock enable/disable feature allows the internal logic to control power up/down synchronously on GCLK and RCLK nets and PLL_OUT pins. This function is independent of the PLL and is applied directly on the clock network or PLL_OUT pin, as shown in Figures 2–37 through 2–39.

1 The following restrictions for the input clock pins apply:

• CLK0 pin -> inclk[0] of CLKCTRL

• CLK1 pin -> inclk[1] of CLKCTRL

• CLK2 pin -> inclk[0] of CLKCTRL

• CLK3 pin -> inclk[1] of CLKCTRL

In general, even CLK numbers connect to the inclk[0] port of CLKCTRL, and odd CLK numbers connect to the inclk[1] port of CLKCTRL.

Failure to comply with these restrictions will result in a no-fit error.

Enhanced & Fast PLLs

Stratix II devices provide robust clock management and synthesis using up to four enhanced PLLs and eight fast PLLs. These PLLs increase performance and provide advanced clock interfacing and clockfrequency synthesis. With features such as clock switchover, spread-spectrum clocking, reconfigurable bandwidth, phase control, and reconfigurable phase shifting, the Stratix II device’s enhanced PLLs provide you with complete control of clocks and system timing. The fast PLLs provide general purpose clocking with multiplication and phase shifting as well as high-speed outputs for high-speed differential I/O support. Enhanced and fast PLLs work together with the Stratix II high-speed I/O and advanced clock architecture to provide significant improvements in system performance and bandwidth.

Altera Corporation 2–57 May 2007 Stratix II Device Handbook, Volume 1

Page 58

PLLs & Clock Networks

The Quartus II software enables the PLLs and their features without requiring any external devices. Table 2–9 shows the PLLs available for each Stratix II device and their type.

Table 2–9. Stratix II Device PLL Availability

Device

Fast PLLs Enhanced PLLs

123478910561112

EP2S15

EP2S30

EP2S60 (1)

EP2S90 (2)

EP2S130 (3)

EP2S180

Notes to Ta b l e 2– 9 :

(1) EP2S60 devices in the 1020-pin package contain 12 PLLs. EP2S60 devices in the 484-pin and 672-pin packages

contain fast PLLs 1–4 and enhanced PLLs 5 and 6.

(2) EP2S90 devices in the 1020-pin and 1508-pin packages contain 12 PLLs. EP2S90 devices in the 484-pin and 780-pin

packages contain fast PLLS 1–4 and enhanced PLLs 5 and 6.

(3) EP2S130 devices in the 1020-pin and 1508-pin packages contain 12PLLs. The EP2S130 device in the 780-pin package

contains fast PLLs 1–4 and enhanced PLLs 5 and 6.

vvvv vv vvvv vv vvvvvvvvvvvv vvvvvvvvvvvv vvvvvvvvvvvv vvvvvvvvvvvv

2–58 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 59

Stratix II Architecture

Table 2–10 shows the enhanced PLL and fast PLL features in Stratix II

devices.

Table 2–10. Stratix II PLL Features

Feature Enhanced PLL Fast PLL

Clock multiplication and division m/(n × post-scale counter) (1) m/(n × post-scale counter) (2)

Phase shift Down to 125-ps increments (3), (4) Down to 125-ps increments (3), (4)

Clock switchover

PLL reconfiguration

Reconfigurable bandwidth

Spread spectrum clocking

Programmable duty cycle

Number of internal clock outputs 6 4

Number of external clock outputs Three differential/six single-ended (6)

Number of feedback clock inputs One single-ended or differential

Notes to Table 2–10:

(1) For enhanced PLLs, m ranges from 1 to 256, while n and post-scale counters range from 1 to 512 with 50% duty

cycle.

(2) For fast PLLs, m, and post-scale counters range from 1 to 32. The n counter ranges from 1 to 4. (3) The smallest phase shift is determined by the voltage controlled oscillator (VCO) period divided by 8. (4) For degree increments, Stratix II devices can shift all output frequencies in increments of at least 45. Smaller degree

increments are possible depending on the frequency and divide parameters. (5) Stratix II fast PLLs only support manual clock switchover. (6) Fast PLLs can drive to any I/O pin as an external clock. For high-speed differential I/O pins, the device uses a data

channel to generate txclkout. (7) If the feedback input is used, you lose one (or two, if FBIN is differential) external clock output pin. (8) Every Stratix II device has at least two enhanced PLLs with one single-ended or differential external feedback input

per PLL.

vv (5) vv vv v vv

(7), (8)

Altera Corporation 2–59 May 2007 Stratix II Device Handbook, Volume 1

Page 60

PLLs & Clock Networks

Figure 2–40. PLL Locations

Figure 2–40 shows a top-level diagram of the Stratix II device and PLL

floorplan.

CLK[15..12]

511

FPLL7CLK FPLL10CLK

CLK[3..0]

PLLs

FPLL8CLK

612

CLK[7..4]

CLK[8..11]

FPLL9CLK

Figures 2–41 and 2–42 shows the global and regional clocking from the

fast PLL outputs and the side clock pins.

2–60 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 61

Stratix II Architecture

Figure 2–41. Global & Regional Clock Connections from Center Clock Pins & Fast PLL Outputs Note (1)

CLK11

CLK10

CLK9

CLK8

Fast

C0C1C2

Fast

PLL 4

Network

To Clock

Logic Array

Signal Input

PLL 1

Fast

C0C1C2

Fast

PLL 3

RCK20 RCK22

RCK19 RCK21 RCK23

RCK17

RCK16 RCK18

GCK1 GCK3 GCK8 GCK10

GCK0 GCK2 GCK9 GCK11

RCK5 RCK7

RCK4 RCK6

RCK1 RCK3

RCK0 RCK2

PLL 2

CLK0

CLK1

CLK2

CLK3

Notes to Figure 2–41:

(1) EP2S15 and EP2S30 devices only have four fast PLLs (1, 2, 3, and 4), but the

connectivity from these four PLLs to the global and regional clock networks remains the same as shown.

(2) The global or regional clocks in a fast PLL's quadrant can drive the fast PLL input.

The global or regional clock input can be driven by an output from another PLL, a pin-driven dedicated global or regional clock, or through a clock control block, provided the clock control block is fed by an output from another PLL or a pin-driven dedicated global or regional clock. An internally generated global signal cannot drive the PLL.

Altera Corporation 2–61 May 2007 Stratix II Device Handbook, Volume 1

Page 62

PLLs & Clock Networks

Figure 2–42. Global & Regional Clock Connections from Corner Clock Pins & Fast PLL Outputs Note (1)

9CL

FPLL10CL

FPLL

Fast

PLL 10

C0C1C2

RCK21 RCK23

RCK20 RCK22

RCK1 RCK3

RCK0 RCK2

C0C1C2

Fast

PLL 7

Fast

PLL 9

C0C1C2

Fast

PLL 8

RCK19

RCK17

RCK16 RCK18

GCK1 GCK3 GCK8 GCK10

GCK0 GCK2 GCK9 GCK11

RCK5 RCK7

RCK4 RCK6

8CLK

FPLL7CLK

FPLL

Note to Figure 2–42:

(1) The corner fast PLLs can also be driven through the global or regional clock

networks. The global or regional clock input can be driven by an output from another PLL, a pin-driven dedicated global or regional clock, or through a clock control block, provided the clock control block is fed by an output from another PLL or a pin-driven dedicated global or regional clock. An internally generated global signal cannot drive the PLL.

2–62 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 63

Stratix II Architecture

Figure 2–43 shows the global and regional clocking from enhanced PLL

outputs and top and bottom CLK pins. The connections to the global and regional clocks from the top clock pins and enhanced PLL outputs is shown in Table 2–11. The connections to the clocks from the bottom clock pins is shown in Table 2–12.

Altera Corporation 2–63 May 2007 Stratix II Device Handbook, Volume 1

Page 64

PLLs & Clock Networks

Figure 2–43. Global & Regional Clock Connections from Top & Bottom Clock Pins & Enhanced PLL Outputs

Notes (1), (2), and (3)

CLK15

CLK13

CLK12

PLL11_FB

CLK14

PLL5_FB

PLL11_OUT[2..0]p PLL11_OUT[2..0]n

RCLK27

Regional

RCLK26

Clocks

RCLK25 RCLK24

Global

Clocks

Regional

RCLK10

Clocks

RCLK11

PLL12_OUT[2..0]p PLL12_OUT[2..0]n

RCLK8 RCLK9

PLL 11

c0 c1 c2 c3 c4 c5 c0 c1 c2 c3 c4 c5

PLL 12

PLL12_FB

CLK4

CLK5

CLK6

CLK7

PLL 5

PLL 6

PLL6_FB

PLL5_OUT[2..0]p

PLL5_OUT[2..0]n RCLK31 RCLK30 RCLK29 RCLK28

G15 G14 G13 G12

G4 G5 G6 G7

RCLK12 RCLK13 RCLK14 RCLK15

PLL6_OUT[2..0]p PLL6_OUT[2..0]n

Notes to Figure 2–43:

(1) EP2S15 and EP2S30 devices only have two enhanced PLLs (5 and 6), but the connectivity from these two PLLs to

the global and regional clock networks remains the same as shown. (2) If the design uses the feedback input, you lose one (or two, if FBIN is differential) external clock output pin. (3) The enhanced PLLs can also be driven through the global or regional clock netowrks. The global or regional clock

input can be driven by an output from another PLL, a pin-driven dedicated global or regional clock, or through a

clock control block provided the clock control block is fed by an output from another PLL or a pin-driven dedicated

global or regional clock. An internally generated global signal cannot drive the PLL.

2–64 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 65

Stratix II Architecture

Table 2–11. Global & Regional Clock Connections from Top Clock Pins & Enhanced PLL Outputs (Part 1 of 2)

Top Side Global & Regional

Clock Network Connectivity

Clock pins

CLK12p

CLK13p

CLK14p

CLK15p

CLK12n

CLK13n

CLK14n

CLK15n

Drivers from internal logic

GCLKDRV0

GCLKDRV1

GCLKDRV2

GCLKDRV3

RCLKDRV0

RCLKDRV1

RCLKDRV2

RCLKDRV3

RCLKDRV4

RCLKDRV5

RCLKDRV6

RCLKDRV7

Enhanced PLL 5 outputs

vvv v v

vvv v v vvvv v

vvv vv

vvv v v

vvv v v vvvv v

vvvvv

CLK12

CLK13

CLK14

DLLCLK

CLK15

RCLK24

RCLK25

RCLK26

RCLK27

RCLK28

RCLK29

vvv

vv v

vvv

RCLK30

RCLK31

Altera Corporation 2–65 May 2007 Stratix II Device Handbook, Volume 1

Page 66

PLLs & Clock Networks

Table 2–11. Global & Regional Clock Connections from Top Clock Pins & Enhanced PLL Outputs (Part 2 of 2)

Top Side Global & Regional

Clock Network Connectivity

Enhanced PLL 11 outputs

vvvvv

v vvvv

CLK12

CLK13

CLK14

DLLCLK

CLK15

RCLK24

RCLK25

RCLK26

RCLK27

RCLK28

vv v v vv v v

vv v v

vvvv

RCLK29

RCLK30

Table 2–12. Global & Regional Clock Connections from Bottom Clock Pins & Enhanced PLL Outputs (Part 1 of 2)

RCLK31

Bottom Side Global &

Regional Clock Network

Connectivity

Clock pins

CLK4p

CLK5p

CLK6p

CLK7p

CLK4n

CLK5n

CLK6n

CLK7n

Drivers from internal logic

GCLKDRV0

GCLKDRV1

GCLKDRV2

2–66 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

vvv v v vvv v v

vvvv v

vvvvv

CLK4

CLK5

CLK6

CLK7

RCLK8

DLLCLK

RCLK9

RCLK10

RCLK11

RCLK12

RCLK13

vvv

RCLK14

RCLK15

Page 67

Stratix II Architecture

Table 2–12. Global & Regional Clock Connections from Bottom Clock Pins & Enhanced PLL Outputs (Part 2 of 2)

Bottom Side Global &

Regional Clock Network

Connectivity

GCLKDRV3

RCLKDRV0

RCLKDRV1

RCLKDRV2

RCLKDRV3

RCLKDRV4

RCLKDRV5

RCLKDRV6

RCLKDRV7

Enhanced PLL 6 outputs

Enhanced PLL 12 outputs

vvv v v vvv v v

vvvv v

vvvvv vvvvv

v vvvv

CLK4

CLK5

CLK6

CLK7

RCLK8

DLLCLK

RCLK9

RCLK10

RCLK11

RCLK12

vv v v

vvvv

RCLK13

RCLK14

RCLK15

Altera Corporation 2–67 May 2007 Stratix II Device Handbook, Volume 1

Page 68

PLLs & Clock Networks

Enhanced PLLs

Stratix II devices contain up to four enhanced PLLs with advanced clock management features. Figure 2–44 shows a diagram of the enhanced PLL.

Figure 2–44. Stratix II Enhanced PLL Note (1)

Clock

INCLK[3..0]

Global or Regional Clock

(4)

Switchover

Circuitry

FBIN

Phase Frequency Detector

(2)

PFD

Charge

Pump

Spread

Spectrum

Lock Detect

& Filter

VCO Phase Selection Selectable at Each PLL Output Port

Loop Filter

VCO

From Adjacent PLL

Post-Scale Counters

/c0

/c1

/c2

/c3

/c4

/c5

Global Clocks

Regional Clocks

I/O Buffers

to I/O or general routing

(3)

Shaded Portions of the PLL are Reconfigurable

VCO Phase Selection Affecting All Outputs

Notes to Figure 2–44:

(1) Each clock source can come from any of the four clock pins that are physically located on the same side of the device

as the PLL. (2) If the feedback input is used, you lose one (or two, if FBIN is differential) external clock output pin. (3) Each enhanced PLL has three differential external clock outputs or six single-ended external clock outputs. (4) The global or regional clock input can be driven by an output from another PLL, a pin-driven dedicated global or

regional clock, or through a clock control block, provided the clock control block is fed by an output from another

PLL or a pin-driven dedicated global or regional clock. An internally generated global signal cannot drive the PLL.

2–68 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 69

Stratix II Architecture

Fast PLLs

Stratix II devices contain up to eight fast PLLs with high-speed serial interfacing ability. Figure 2–45 shows a diagram of the fast PLL.

Figure 2–45. Stratix II Device Fast PLL Notes (1), (2), (3)

Post-Scale

VCO Phase Selection Selectable at each PLL Output Port

VCO

÷m

(5)

÷k

Clock

Input

Global or regional clock

Switchover

Circuitry

(1)

Shaded Portions of the PLL are Reconfigurable

Clock

Phase

Frequency

(4)

Detector

PFD

Charge

÷n

Pump

Loop Filter

Notes to Figure 2–45:

(1) The global or regional clock input can be driven by an output from another PLL, a pin-driven dedicated global or

regional clock, or through a clock control block, provided the clock control block is fed by an output from another

PLL or a pin-driven dedicated global or regional clock. An internally generated global signal cannot drive the PLL. (2) In high-speed differential I/O support mode, this high-speed PLL clock feeds the SERDES circuitry. Stratix II

devices only support one rate of data transfer per fast PLL in high-speed differential I/O support mode. (3) This signal is a differential I/O SERDES control signal. (4) Stratix II fast PLLs only support manual clock switchover. (5) If the design enables this ÷2 counter, then the device can use a VCO frequency range of 150 to 520 MHz.

Counters

÷c0

÷c1

÷c2

÷c3

diffioclk0

load_en0

load_en1

diffioclk1

Global clocks

Regional clocks

to DPA block

(2)

(3)

(2)

f See the PLLs in Stratix II & Stratix II GX Devices chapter in volume 2 of

the Stratix II Device Handbook or the Stratix II GX Device Handbook for more information on enhanced and fast PLLs. See “High-Speed

Differential I/O with DPA Support” on page 2–96 for more information

on high-speed differential I/O support.

I/O Structure

Altera Corporation 2–69 May 2007 Stratix II Device Handbook, Volume 1

The Stratix II IOEs provide many features, including:

■ Dedicated differential and single-ended I/O buffers

■ 3.3-V, 64-bit, 66-MHz PCI compliance

■ 3.3-V, 64-bit, 133-MHz PCI-X 1.0 compliance

■ Joint Test Action Group (JTAG) boundary-scan test (BST) support

■ On-chip driver series termination

■ On-chip parallel termination

■ On-chip termination for differential standards

■ Programmable pull-up during configuration

Page 70

I/O Structure

■ Output drive strength control

■ Tri-state buffers

■ Bus-hold circuitry

■ Programmable pull-up resistors

■ Programmable input and output delays

■ Open-drain outputs

■ DQ and DQS I/O pins

■ Double data rate (DDR) registers

The IOE in Stratix II devices contains a bidirectional I/O buffer, six registers, and a latch for a complete embedded bidirectional single data rate or DDR transfer. Figure 2–46 shows the Stratix II IOE structure. The IOE contains two input registers (plus a latch), two output registers, and two output enable registers. The design can use both input registers and the latch to capture DDR input and both output registers to drive DDR outputs. Additionally, the design can use the output enable (OE) register for fast clock-to-output enable timing. The negative edge-clocked OE register is used for DDR SDRAM interfacing. The Quartus II software automatically duplicates a single OE register that controls multiple output or bidirectional pins.

2–70 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 71

Figure 2–46. Stratix II IOE Structure

Logic Array

Stratix II Architecture

OE Register

Output A

Output B

Input A

Input B

Output Register

OE Register

CLK

Input Register

Input Latch

ENA

The IOEs are located in I/O blocks around the periphery of the Stratix II device. There are up to four IOEs per row I/O block and four IOEs per column I/O block. The row I/O blocks drive row, column, or direct link interconnects. The column I/O blocks drive column interconnects.

Figure 2–47 shows how a row I/O block connects to the logic array. Figure 2–48 shows how a column I/O block connects to the logic array.

Altera Corporation 2–71 May 2007 Stratix II Device Handbook, Volume 1

Page 72

I/O Structure

Figure 2–47. Row I/O Block Connection to the Interconnect Note (1)

LAB Local

Interconnect

R4 & R24

Interconnects

LAB

Direct Link

Interconnect

to Adjacent LAB

C4 Interconnect

io_dataina[3..0] io_datainb[3..0]

Direct Link

Interconnect

to Adjacent LAB

I/O Block Local

Interconnect

io_clk[7:0]

32 Data & Control Signals from Logic Array (1)

Horizontal

I/O Block

Horizontal I/O Block Contains up to Four IOEs

Note to Figure 2–47:

(1) The 32 data and control signals consist of eight data out lines: four lines each for DDR applications

io_dataouta[3..0] and io_dataoutb[3..0], four output enables io_oe[3..0], four input clock enables io_ce_in[3..0], four output clock enables io_ce_out[3..0], four clocks io_clk[3..0], four asynchronous clear and preset signals io_aclr/apreset[3..0], and four synchronous clear and preset signals io_sclr/spreset[3..0].

2–72 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 73

Figure 2–48. Column I/O Block Connection to the Interconnect Note (1)

32 Data &

Control Signals

from Logic Array (1)

Vertical I/O Block

Stratix II Architecture

Vertical I/O Block Contains up to Four IOE

I/O Block

Local Interconnect

R4 & R24

Interconnects

LAB LAB LAB

IO_dataina[3:0] IO_datainb[3:0]

io_clk[7..0]

LAB Local

Interconnect

C4 & C16

Interconnects

Note to Figure 2–48:

(1) The 32 data and control signals consist of eight data out lines: four lines each for DDR applications

Altera Corporation 2–73 May 2007 Stratix II Device Handbook, Volume 1

Page 74

I/O Structure

There are 32 control and data signals that feed each row or column I/O block. These control and data signals are driven from the logic array. The row or column IOE clocks, io_clk[7..0], provide a dedicated routing resource for low-skew, high-speed clocks. I/O clocks are generated from global or regional clocks (see the “PLLs & Clock Networks” section).

Figure 2–49 illustrates the signal paths through the I/O block.

Figure 2–49. Signal Path through the I/O Block

Row or Column

io_clk[7..0]

To Other IOEs

To Logic

Array

From Logic

Array

io_dataina

io_datainb

io_oe

io_ce_in

io_ce_out

io_aclr

io_sclr

io_clk

io_dataouta

io_dataoutb

ce_in

ce_out

Control

Signal

Selection

aclr/apreset

sclr/spreset

clk_in

clk_out

IOE

Each IOE contains its own control signal selection for the following control signals: oe, ce_in, ce_out, aclr/apreset, sclr/spreset, clk_in, and clk_out. Figure 2–50 illustrates the control signal selection.

2–74 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 75

Figure 2–50. Control Signal Selection per IOE

Dedicated I/O Clock [7..0]

Local Interconnect

io_oe

Stratix II Architecture

Local Interconnect

io_sclr

io_aclr

io_ce_out

io_ce_in

io_clk

clk_out

ce_out

ce_inclk_in

aclr/apreset

sclr/spreset

Notes to Figure 2–50:

(1) Control signals ce_in, ce_out, aclr/apreset, sclr/spreset, and oe can be global signals even though their

control selection multiplexers are not directly fed by the ioe_clk[7..0] signals. The ioe_clk signals can drive the I/O local interconnect, which then drives the control selection multiplexers.

In normal bidirectional operation, the input register can be used for input data requiring fast setup times. The input register can have its own clock input and clock enable separate from the OE and output registers. The output register can be used for data requiring fast clock-to-output performance. The OE register can be used for fast clock-to-output enable timing. The OE and output register share the same clock source and the same clock enable source from local interconnect in the associated LAB, dedicated I/O clocks, and the column and row interconnects.

Altera Corporation 2–75 May 2007 Stratix II Device Handbook, Volume 1

Page 76

I/O Structure

Figure 2–51 shows the IOE in bidirectional configuration.

Figure 2–51. Stratix II IOE in Bidirectional I/O Configuration Note (1)

ioe_clk[7..0]

Column, Row,

or Local

Interconnect

OE Register

clkout

ENA

ce_out

aclr/apreset

Chip-Wide Reset

sclr/spreset

clkin

CLRN/PRN

Output Register

ENA

CLRN/PRN

Input Register

Output

Pin Delay

Drive Strength Control

Open-Drain Output

Input Pin to

Logic Array Delay

Input Pin to

Input Register Delay

OE Register

Delay

CCIO

PCI Clamp (2)

CCIO

On-Chip

Termination

Bus-Hold

Circuit

Programmable Pull-Up Resistor

ce_in

ENA CLRN/PRN

Notes to Figure 2–51:

(1) All input signals to the IOE can be inverted at the IOE. (2) The optional PCI clamp is only available on column I/O pins.

2–76 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 77

Stratix II Architecture

The Stratix II device IOE includes programmable delays that can be activated to ensure input IOE register-to-logic array register transfers, input pin-to-logic array register transfers, or output IOE register-to-pin transfers.

A path in which a pin directly drives a register may require the delay to ensure zero hold time, whereas a path in which a pin drives a register through combinational logic may not require the delay. Programmable delays exist for decreasing input-pin-to-logic-array and IOE input register delays. The Quartus II Compiler can program these delays to automatically minimize setup time while providing a zero hold time. Programmable delays can increase the register-to-pin delays for output and/or output enable registers. Programmable delays are no longer required to ensure zero hold times for logic array register-to-IOE register transfers. The Quartus II Compiler can create the zero hold time for these transfers. Table 2–13 shows the programmable delays for Stratix II devices.

Table 2–13. Stratix II Programmable Delay Chain

Programmable Delays Quartus II Logic Option

Input pin to logic array delay Input delay from pin to internal cells

Input pin to input register delay Input delay from pin to input register

Output pin delay Delay from output register to output pin

Output enable register t

delay Delay to output enable pin

The IOE registers in Stratix II devices share the same source for clear or preset. You can program preset or clear for each individual IOE. You can also program the registers to power up high or low after configuration is complete. If programmed to power up low, an asynchronous clear can control the registers. If programmed to power up high, an asynchronous preset can control the registers. This feature prevents the inadvertent activation of another device's active-low input upon power-up. If one register in an IOE uses a preset or clear signal then all registers in the IOE must use that same signal if they require preset or clear. Additionally, a synchronous reset signal is available for the IOE registers.

Double Data Rate I/O Pins

Stratix II devices have six registers in the IOE, which support DDR interfacing by clocking data on both positive and negative clock edges. The IOEs in Stratix II devices support DDR inputs, DDR outputs, and bidirectional DDR modes.

Altera Corporation 2–77 May 2007 Stratix II Device Handbook, Volume 1

Page 78

I/O Structure

When using the IOE for DDR inputs, the two input registers clock double rate input data on alternating edges. An input latch is also used in the IOE for DDR input acquisition. The latch holds the data that is present during the clock high times. This allows both bits of data to be synchronous with the same clock edge (either rising or falling). Figure 2–52 shows an IOE configured for DDR input. Figure 2–53 shows the DDR input timing diagram.

Figure 2–52. Stratix II IOE in DDR Input I/O Configuration Notes (1), (2), (3)

ioe_clk[7..0]

Column, Row,

or Local

Interconnect

DQS Local

(2)

Bus

To DQS Logic

Block

(3)

VCCIO

PCI Clamp (4)

VCCIO

Programmable Pull-Up Resistor

nput Pin to

sclr/spreset

clkin

ce_in

aclr/apreset

Chip-Wide Reset

Input RegisterDelay

Input Register

ENA

CLRN/PRN

Input Register

ENA CLRN/PRN

Notes to Figure 2–52:

(1) All input signals to the IOE can be inverted at the IOE. (2) This signal connection is only allowed on dedicated DQ function pins. (3) This signal is for dedicated DQS function pins only. (4) The optional PCI clamp is only available on column I/O pins.

Latch

ENA CLRN/PRN

On-Chip

Termination

Bus-Hold

Circuit

2–78 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 79

Figure 2–53. Input Timing Diagram in DDR Mode

Stratix II Architecture

Data at input pin

Input To Logic Array

CLK

A0B0 B1 A1A1B2 A2 A3

B0 B2 B3

B3 B4

A2 A3

When using the IOE for DDR outputs, the two output registers are configured to clock two data paths from ALMs on rising clock edges. These output registers are multiplexed by the clock to drive the output pin at a ×2 rate. One output register clocks the first bit out on the clock high time, while the other output register clocks the second bit out on the clock low time. Figure 2–54 shows the IOE configured for DDR output.

Figure 2–55 shows the DDR output timing diagram.

Altera Corporation 2–79 May 2007 Stratix II Device Handbook, Volume 1

Page 80

I/O Structure

Figure 2–54. Stratix II IOE in DDR Output I/O Configuration Notes (1), (2)

ioe_clk[7..0]

Column, Row,

or Local

Interconnect

clkout

ce_out

aclr/apreset

sclr/spreset

Chip-Wide Reset

OE Register

ENA CLRN/PRN

OE Register

ENA CLRN/PRN

Output Register

ENA CLRN/PRN

Output Register

ENA

CLRN/PRN

Used for DDR, DDR2 SDRAM

Output

Pin Delay

clk

Drive Strength

Control

Open-Drain Output

OE Register

tCO Delay

CCIO

PCI Clamp (3)

CCIO

Programmable Pull-Up Resistor

On-Chip

Ter mination

Bus-Hold

Circuit

Notes to Figure 2–54:

(1) All input signals to the IOE can be inverted at the IOE. (2) The tri-state buffer is active low. The DDIO megafunction represents the tri-state buffer as active-high with an

inverter at the OE register data port. Similarly, the aclr and apreset signals are also active-high at the input ports of the DDIO megafunction.

(3) The optional PCI clamp is only available on column I/O pins.

2–80 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 81

Figure 2–55. Output TIming Diagram in DDR Mode

CLK

A2A1 A3 A4

From Internal

Registers

B1 B2 B3 B4

Stratix II Architecture

DDR output

B1 A1 B2 A2 B3 A3 B4 A4

The Stratix II IOE operates in bidirectional DDR mode by combining the DDR input and DDR output configurations. The negative-edge-clocked OE register holds the OE signal inactive until the falling edge of the clock. This is done to meet DDR SDRAM timing requirements.

External RAM Interfacing

In addition to the six I/O registers in each IOE, Stratix II devices also have dedicated phase-shift circuitry for interfacing with external memory interfaces. Stratix II devices support DDR and DDR2 SDRAM, QDR II SRAM, RLDRAM II, and SDR SDRAM memory interfaces. In every Stratix II device, the I/O banks at the top (banks 3 and 4) and bottom (banks 7 and 8) of the device support DQ and DQS signals with DQ bus modes of ×4, ×8/×9, ×16/×18, or ×32/×36. Table 2–14 shows the number of DQ and DQS buses that are supported per device.

Table 2–14. DQS & DQ Bus Mode Support (Part 1 of 2) Note (1)

Device Package

EP2S15 484-pin FineLine BGA 8 4 0 0

672-pin FineLine BGA 18 8 4 0

EP2S30 484-pin FineLine BGA 8 4 0 0

672-pin FineLine BGA 18 8 4 0

EP2S60 484-pin FineLine BGA 8 4 0 0

672-pin FineLine BGA 18 8 4 0

1,020-pin FineLine BGA 36 18 8 4

Number of ×4 Groups

Number of

×8/×9 Groups

Number of

×16/×18 Groups

Number of

×32/×36 Groups

Altera Corporation 2–81 May 2007 Stratix II Device Handbook, Volume 1

Page 82

I/O Structure

Table 2–14. DQS & DQ Bus Mode Support (Part 2 of 2) Note (1)

Device Package

EP2S90 484-pin Hybrid FineLine BGA 8 4 0 0

780-pin FineLine BGA 18 8 4 0

1,020-pin FineLine BGA 36 18 8 4

1,508-pin FineLine BGA 36 18 8 4

EP2S130 780-pin FineLine BGA 18 8 4 0

1,020-pin FineLine BGA 36 18 8 4

1,508-pin FineLine BGA 36 18 8 4

EP2S180 1,020-pin FineLine BGA 36 18 8 4

1,508-pin FineLine BGA 36 18 8 4

Notes to Table 2–14:

(1) Check the pin table for each DQS/DQ group in the different modes.

Number of ×4 Groups

Number of

×8/×9 Groups

Number of

×16/×18 Groups

Number of

×32/×36 Groups

A compensated delay element on each DQS pin automatically aligns input DQS synchronization signals with the data window of their corresponding DQ data signals. The DQS signals drive a local DQS bus in the top and bottom I/O banks. This DQS bus is an additional resource to the I/O clocks and is used to clock DQ input registers with the DQS signal.

The Stratix II device has two phase-shifting reference circuits, one on the top and one on the bottom of the device. The circuit on the top controls the compensated delay elements for all DQS pins on the top. The circuit on the bottom controls the compensated delay elements for all DQS pins on the bottom.

Each phase-shifting reference circuit is driven by a system reference clock, which must have the same frequency as the DQS signal. Clock pins CLK[15..12]p feed the phase circuitry on the top of the device and clock pins CLK[7..4]p feed the phase circuitry on the bottom of the device. In addition, PLL clock outputs can also feed the phase-shifting reference circuits.

Figure 2–56 illustrates the phase-shift reference circuit control of each

DQS delay shift on the top of the device. This same circuit is duplicated on the bottom of the device.

2–82 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 83

Figure 2–56. DQS Phase-Shift Circuitry Notes (1), (2), (3), (4)

From PLL 5

DQSn

DQS

DQSn

DQS

CLK[15..12]p

(2)

DQS

Stratix II Architecture

(3)

DQSn

DQS

DQSn

to IOE

DQS

Phase-Shift

Circuitry

to IOE to IOE to IOE

ΔtΔtΔtΔtΔtΔt Δt

to IOE

Δt

to IOE

DQS Logi Blocks

Notes to Figure 2–56:

(1) There are up to 18 pairs of DQS and DQSn pins available on the top or the bottom of the Stratix II device. There are

up to 10 pairs on the right side and 8 pairs on the left side of the DQS phase-shift circuitry. (2) The Δt module represents the DQS logic block. (3) Clock pins CLK[15..12]p feed the phase-shift circuitry on the top of the device and clock pins CLK[7..4]p feed

the phase circuitry on the bottom of the device. You can also use a PLL clock output as a reference clock to the phase-

shift circuitry. (4) You can only use PLL 5 to feed the DQS phase-shift circuitry on the top of the device and PLL 6 to feed the DQS

phase-shift circuitry on the bottom of the device.

These dedicated circuits combined with enhanced PLL clocking and phase-shift ability provide a complete hardware solution for interfacing to high-speed memory.

f For more information on external memory interfaces, refer to the

External Memory Interfaces in Stratix II & Stratix II GX Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook.

Programmable Drive Strength

The output buffer for each Stratix II device I/O pin has a programmable drive strength control for certain I/O standards. The LVTTL, LVCMOS, SSTL, and HSTL standards have several levels of drive strength that the user can control. The default setting used in the Quartus II software is the maximum current strength setting that is used to achieve maximum I/O performance. For all I/O standards, the minimum setting is the lowest drive strength that guarantees the I minimum settings provides signal slew rate control to reduce system noise and signal overshoot.

Altera Corporation 2–83 May 2007 Stratix II Device Handbook, Volume 1

of the standard. Using

OH/IOL

Page 84

I/O Structure

Table 2–15 shows the possible settings for the I/O standards with drive

strength control.

Table 2–15. Programmable Drive Strength Note (1)

/ IOL Current Strength

I/O Standard

Setting (mA) for Column

I/O Pins

3.3-V LVTTL 24, 20, 16, 12, 8, 4 12, 8, 4

3.3-V LVCMOS 24, 20, 16, 12, 8, 4 8, 4

2.5-V LVTTL/LVCMOS 16, 12, 8, 4 12, 8, 4

1.8-V LVTTL/LVCMOS 12, 10, 8, 6, 4, 2 8, 6, 4, 2

1.5-V LVCMOS 8, 6, 4, 2 4, 2

SSTL-2 Class I 12, 8 12, 8

SSTL-2 Class II 24, 20, 16 16

SSTL-18 Class I 12, 10, 8, 6, 4 10, 8, 6, 4

SSTL-18 Class II 20, 18, 16, 8 -

HSTL-18 Class I 12, 10, 8, 6, 4 12, 10, 8, 6, 4

HSTL-18 Class II 20, 18, 16 -

HSTL-15 Class I 12, 10, 8, 6, 4 8, 6, 4

HSTL-15 Class II 20, 18, 16 -

Note to Table 2–15:

(1) The Quartus II software default current setting is the maximum setting for each

I/O standard.

IOH / IOL Current Strength Setting (mA) for Row I/O

Pins

Open-Drain Output

Stratix II devices provide an optional open-drain (equivalent to an opencollector) output for each I/O pin. This open-drain output enables the device to provide system-level control signals (e.g., interrupt and writeenable signals) that can be asserted by any of several devices.

Bus Hold

Each Stratix II device I/O pin provides an optional bus-hold feature. The bus-hold circuitry can weakly hold the signal on an I/O pin at its last-driven state. Since the bus-hold feature holds the last-driven state of the pin until the next input signal is present, you do not need an external pull-up or pull-down resistor to hold a signal level when the bus is tri-stated.

2–84 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 85

Stratix II Architecture

The bus-hold circuitry also pulls undriven pins away from the input threshold voltage where noise can cause unintended high-frequency switching. You can select this feature individually for each I/O pin. The bus-hold output drives no higher than V

to prevent overdriving

CCIO

signals. If the bus-hold feature is enabled, the programmable pull-up option cannot be used. Disable the bus-hold feature when the I/O pin has been configured for differential signals.

The bus-hold circuitry uses a resistor with a nominal resistance (R

) of

approximately 7 kΩ to weakly pull the signal level to the last-driven state. See the DC & Switching Characteristics chapter in the Stratix II Device Handbook, Volume 1, for the specific sustaining current driven through this resistor and overdrive current used to identify the next-driven input level. This information is provided for each V

voltage level.

CCIO

The bus-hold circuitry is active only after configuration. When going into user mode, the bus-hold circuit captures the value on the pin present at the end of configuration.

Programmable Pull-Up Resistor

Each Stratix II device I/O pin provides an optional programmable pull-up resistor during user mode. If you enable this feature for an I/O pin, the pull-up resistor (typically 25 kΩ) weakly holds the output to the

level of the output pin’s bank.

CCIO

Programmable pull-up resistors are only supported on user I/O pins, and are not supported on dedicated configuration pins, JTAG pins or dedicated clock pins.

Advanced I/O Standard Support

Stratix II device IOEs support the following I/O standards:

■ 3.3-V LVTTL/LVCMOS

■ 2.5-V LVTTL/LVCMOS

■ 1.8-V LVTTL/LVCMOS

■ 1.5-V LVCMOS

■ 3.3-V PCI

■ 3.3-V PCI-X mode 1

■ LV DS

■ LVPECL (on input and output clocks only)

■ HyperTransport technology

■ Differential 1.5-V HSTL Class I and II

■ Differential 1.8-V HSTL Class I and II

■ Differential SSTL-18 Class I and II

■ Differential SSTL-2 Class I and II

Altera Corporation 2–85 May 2007 Stratix II Device Handbook, Volume 1

Page 86

I/O Structure

■ 1.5-V HSTL Class I and II

■ 1.8-V HSTL Class I and II

■ 1.2-V HSTL

■ SSTL-2 Class I and II

■ SSTL-18 Class I and II

Table 2–16 describes the I/O standards supported by Stratix II devices.

Table 2–16. Stratix II Supported I/O Standards (Part 1 of 2)

I/O Standard Type

LVTTL Single-ended - 3.3 -

LVCMOS Single-ended - 3.3 -

2.5 V Single-ended - 2.5 -

1.8 V Single-ended - 1.8 -

1.5-V LVCMOS Single-ended - 1.5 -

3.3-V PCI Single-ended - 3.3 -

3.3-V PCI-X mode 1 Single-ended - 3.3 -

LVDS Differential - 2.5 (3) -

LVPECL (1) Differential - 3.3 -

HyperTransport technology Differential - 2.5 -

Differential 1.5-V HSTL Class I and II (2)

Differential 1.8-V HSTL Class I and II (2)

Differential SSTL-18 Class I and II (2)

Differential SSTL-2 Class I and II (2)

1.2-V HSTL(4) Voltage-referenced 0.6 1.2 0.6

1.5-V HSTL Class I and II Voltage-referenced 0.75 1.5 0.75

1.8-V HSTL Class I and II Voltage-referenced 0.9 1.8 0.9

SSTL-18 Class I and II Voltage-referenced 0.90 1.8 0.90

Differential 0.75 1.5 0.75

Differential 0.90 1.8 0.90

Differential 1.25 2.5 1.25

Input Reference

Voltage (V

REF

) (V)

Output Supply

Voltage (V

CCIO

Board Termination

) (V)

Voltage (VTT) (V)

2–86 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 87

Table 2–16. Stratix II Supported I/O Standards (Part 2 of 2)

Stratix II Architecture

I/O Standard Type

SSTL-2 Class I and II Voltage-referenced 1.25 2.5 1.25

Notes to Table 2–16:

(1) This I/O standard is only available on input and output column clock pins. (2) This I/O standard is only available on input clock pins and DQS pins in I/O banks 3, 4, 7, and 8, and output clock

pins in I/O banks 9,10, 11, and 12.

(3) V

(4) 1.2-V HSTL is only supported in I/O banks 4,7, and 8.

is 3.3 V when using this I/O standard in input and output column clock pins (in I/O banks 9, 10, 11, and 12).

CCIO

The clock input pins supporting LVDS on banks 3, 4, 7, and 8 use V dependency on the V

level of the bank.

CCIO

Input Reference

Voltage (V

REF

) (V)

Output Supply

Voltage (V

for LVDS input operations and have no

CCINT

CCIO

Board Termination

) (V)

Voltage (VTT) (V)

f For more information on I/O standards supported by Stratix II I/O

banks, refer to the Selectable I/O Standards in Stratix II & Stratix II GX Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook.

Stratix II devices contain eight I/O banks and four enhanced PLL external clock output banks, as shown in Figure 2–57. The four I/O banks on the right and left of the device contain circuitry to support high-speed differential I/O for LVDS and HyperTransport inputs and outputs. These banks support all Stratix II I/O standards except PCI or PCI-X I/O pins, and SSTL-18 Class II and HSTL outputs. The top and bottom I/O banks support all single-ended I/O standards. Additionally, enhanced PLL external clock output banks allow clock output capabilities such as differential support for SSTL and HSTL.

Altera Corporation 2–87 May 2007 Stratix II Device Handbook, Volume 1

Page 88

I/O Structure

Figure 2–57. Stratix II I/O Banks Notes (1), (2), (3), (4)

DQS8T DQS7T DQS6T DQS5T

VREF0B3 VREF1B3 VREF2B3 VREF3B3 VREF4B3 VREF0B4 VREF1B4 VREF2B4 VREF3B4 VREF4B4

PLL7

Bank 3 Bank 4

PLL11 PLL5

Bank 11 Bank 9

DQS4T DQS3T DQS2T DQS1T DQS0T

PLL10

VREF4B2

VREF3B2VR EF2B2VREF1B2VREF0B2

VREF4B1

VREF3B1VREF2B1VREF 1B1VREF0B1

This I/O bank supports LVDS and LVPECL standards for input clock operations. Differential HSTL and differential SSTL standards are supported for both input and output operations.

Bank 2

PLL1

PLL2

This I/O bank supports LVDS

Bank 1

and LVPECL standards for input clock operations. Differential HSTL and differential SSTL standards are supported for both input and output operations.

PLL8 PLL9

VREF4B8 VREF3B8 VREF2B8 VREF1B8 VREF0B8 VREF4B7 VREF3B7 VREF2B7 VREF1B7 VREF0B7

Bank 8 Bank 7

I/O banks 3, 4, 9 & 11 support all single-ended I/O standards and differential I/O standards except for HyperTransport technology for both input and output operations.

I/O banks 1, 2, 5 & 6 support LVTTL, LVCMOS,

2.5-V, 1.8-V, 1.5-V, SSTL-2, SSTL-18 Class I, HSTL-18 Class I, HSTL-15 Class I, LVDS, and HyperTransport standards for input and output

operations. HSTL-18 Class II, HSTL-15-Class II,

SSTL-18 Class II standards are only supported

for input operations.

I/O banks 7, 8, 10 & 12 support all single-ended I/O standards and differential I/O standards except for HyperTransport technology for both input and output operations.

Bank 12 B ank 10

PLL12 PLL6

DQS4B DQS3B DQS2B DQS1B DQS0BDQS8B DQS7B DQS6B DQS5B

This I/O bank supports LVDS and LVPECL standards for input clock operations. Differential HSTL and differential SSTL standards are supported for both input and output operations.

VREF0B5

VREF1B5VR EF2B5VREF3B5VREF4B5

Bank 5

PLL4

PLL3

VREF0B6

VREF1B6VREF2B6VREF 3B6VREF4B6

Bank 6

Notes to Figure 2–57:

(1) Figure 2–57 is a top view of the silicon die that corresponds to a reverse view for flip-chip packages. It is a graphical

representation only.

(2) Depending on the size of the device, different device members have different numbers of V

groups. Refer to the

REF

pin list and the Quartus II software for exact locations.

(3) Banks 9 through 12 are enhanced PLL external clock output banks. These PLL banks utilize the adjacent V

REF

group when voltage-referenced standards are implemented. For example, if an SSTL input is implemented in PLL bank 10, the voltage level at VREFB7 is the reference voltage level for the SSTL input.

(4) Horizontal I/O banks feature SERDES and DPA circuitry for high speed differential I/O standards. See the High

Speed Differential I/O Interfaces in Stratix II & Stratix II GX Devices chapter of the Stratix II Device Handbook, Volume 2 or the Stratix II GX Device Handbook, Volume 2 for more information on differential I/O standards.

2–88 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 89

Stratix II Architecture

Each I/O bank has its own VCCIO pins. A single device can support

1.5-, 1.8-, 2.5-, and 3.3-V interfaces; each bank can support a different level independently. Each bank also has dedicated VREF pins to

CCIO

support the voltage-referenced standards (such as SSTL-2). The PLL banks utilize the adjacent VREF group when voltage-referenced standards are implemented. For example, if an SSTL input is implemented in PLL bank 10, the voltage level at VREFB7 is the reference voltage level for the SSTL input.

I/O pins that reside in PLL banks 9 through 12 are powered by the VCC_PLL<5, 6, 11, or 12>_OUT pins, respectively. The EP2S60F484, EP2S60F780, EP2S90H484, EP2S90F780, and EP2S130F780 devices do not support PLLs 11 and 12. Therefore, any I/O pins that reside in bank 11 are powered by the VCCIO3 pin, and any I/O pins that reside in bank 12 are powered by the VCCIO8 pin.

Each I/O bank can support multiple standards with the same V input and output pins. Each bank can support one V example, when V

is 3.3 V, a bank can s u p port LV T TL, LVC M OS, an d

CCIO

voltage level. For

REF

CCIO

for

3.3-V PCI for inputs and outputs.

On-Chip Termination

Stratix II devices provide differential (for the LVDS or HyperTransport technology I/O standard), series, and parallel on-chip termination to reduce reflections and maintain signal integrity. On-chip termination simplifies board design by minimizing the number of external termination resistors required. Termination can be placed inside the package, eliminating small stubs that can still lead to reflections.

Stratix II devices provide four types of termination:

■ Differential termination (R

■ Series termination (R

■ Parallel termination (R

)

) without calibration

) with calibration

Altera Corporation 2–89 May 2007 Stratix II Device Handbook, Volume 1

Page 90

I/O Structure

Table 2–17 shows the Stratix II on-chip termination support per I/O bank.

Table 2–17. On-Chip Termination Support by I/O Banks (Part 1 of 2)

On-Chip Termination Support I/O Standard Support Top & Bottom Banks Left & Right Banks

Series termination without calibration

3.3-V LVTTL

3.3-V LVCMOS

2.5-V LVTTL

2.5-V LVCMOS

1.8-V LVTTL

1.8-V LVCMOS

1.5-V LVTTL

1.5-V LVCMOS

SSTL-2 Class I and II

SSTL-18 Class I

SSTL-18 Class II

1.8-V HSTL Class I

1.8-V HSTL Class II

1.5-V HSTL Class I

1.2-V HSTL

vv vv

v v

v vv

vv v

2–90 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 91

Stratix II Architecture

Table 2–17. On-Chip Termination Support by I/O Banks (Part 2 of 2)

On-Chip Termination Support I/O Standard Support Top & Bottom Banks Left & Right Banks

Series termination with calibration

Parallel termination with calibration

Differential termination (1)

3.3-V LVTTL

3.3-V LVCMOS

2.5-V LVTTL

2.5-V LVCMOS

1.8-V LVTTL

1.8-V LVCMOS

1.5-V LVTTL

1.5-V LVCMOS

SSTL-2 Class I and II

SSTL-18 Class I and II

1.8-V HSTL Class I

1.8-V HSTL Class II

1.5-V HSTL Class I

1.2-V HSTL

SSTL-2 Class I and II

SSTL-18 Class I and II

1.8-V HSTL Class I

1.8-V HSTL Class II

1.5-V HSTL Class I and II

1.2-V HSTL

LV DS

HyperTransport technology

v v

Note to Table 2–17:

(1) Clock pins CLK1, CLK3, CLK9, CLK11, and pins FPLL[7..10]CLK do not support differential on-chip

termination. Clock pins CLK0, CLK2, CLK8, and CLK10 do support differential on-chip termination. Clock pins in the top and bottom banks (CLK[4..7, 12..15]) do not support differential on-chip termination.

Altera Corporation 2–91 May 2007 Stratix II Device Handbook, Volume 1

Page 92

I/O Structure

Differential On-Chip Termination

Stratix II devices support internal differential termination with a nominal resistance value of 100 Ω for LVDS or HyperTransport technology input receiver buffers. LVPECL input signals (supported on clock pins only) require an external termination resistor. Differential on-chip termination is supported across the full range of supported differential data rates as shown in the DC & Switching Characteristics chapter in volume 1 of the Stratix II Device Handbook.

f For more information on differential on-chip termination, refer to the

High-Speed Differential I/O Interfaces with DPA in Stratix II & Stratix II GX Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook.

f For more information on tolerance specifications for differential on-chip

termination, refer to the DC & Switching Characteristics chapter in volume 1 of the Stratix II Device Handbook.

On-Chip Series Termination Without Calibration

Stratix II devices support driver impedance matching to provide the I/O driver with controlled output impedance that closely matches the impedance of the transmission line. As a result, reflections can be significantly reduced. Stratix II devices support on-chip series termination for single-ended I/O standards with typical R and 50 Ω. Once matching impedance is selected, current drive strength is no longer selectable. Table 2–17 shows the list of output standards that support on-chip series termination without calibration.

values of 25

On-Chip Series Termination with Calibration

Stratix II devices support on-chip series termination with calibration in column I/O pins in top and bottom banks. There is one calibration circuit for the top I/O banks and one circuit for the bottom I/O banks. Each on-chip series termination calibration circuit compares the total impedance of each I/O buffer to the external 25- or 50-Ω resistors connected to the RUP and RDN pins, and dynamically enables or disables the transistors until they match. Calibration occurs at the end of device configuration. Once the calibration circuit finds the correct impedance, it powers down and stops changing the characteristics of the drivers.

f For more information on series on-chip termination supported by

Stratix II devices, refer to the Selectable I/O Standards in Stratix II & Stratix II GX Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook.

2–92 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 93

Stratix II Architecture

f For more information on tolerance specifications for on-chip termination

with calibration, refer to the DC & Switching Characteristics chapter in volume 1 of the Stratix II Device Handbook.

On-Chip Parallel Termination with Calibration

Stratix II devices support on-chip parallel termination with calibration for column I/O pins only. There is one calibration circuit for the top I/O banks and one circuit for the bottom I/O banks. Each on-chip parallel termination calibration circuit compares the total impedance of each I/O buffer to the external 50-Ω resistors connected to the RUP and RDN pins and dynamically enables or disables the transistors until they match. Calibration occurs at the end of device configuration. Once the calibration circuit finds the correct impedance, it powers down and stops changing the characteristics of the drivers.

1 On-chip parallel termination with calibration is only supported

for input pins.

f For more information on on-chip termination supported by Stratix II

devices, refer to the Selectable I/O Standards in Stratix II & Stratix II GX Devices chapter in volume 2 of the Stratix II Device Handbook or the Stratix II GX Device Handbook.

f For more information on tolerance specifications for on-chip termination

with calibration, refer to the DC & Switching Characteristics chapter in volume 1 of the Stratix II Device Handbook.

MultiVolt I/O Interface

The Stratix II architecture supports the MultiVolt I/O interface feature that allows Stratix II devices in all packages to interface with systems of different supply voltages.

The Stratix II VCCINT pins must always be connected to a 1.2-V power supply. With a 1.2-V V tolerant. The VCCIO pins can be connected to either a 1.5-, 1.8-, 2.5-, or

3.3-V power supply, depending on the output requirements. The output

levels are compatible with systems of the same voltage as the power supply (for example, when VCCIO pins are connected to a 1.5-V power supply, the output levels are compatible with 1.5-V systems).

The Stratix II VCCPD power pins must be connected to a 3.3-V power supply. These power pins are used to supply the pre-driver power to the output buffers, which increases the performance of the output pins. The VCCPD pins also power configuration input pins and JTAG input pins.

Altera Corporation 2–93 May 2007 Stratix II Device Handbook, Volume 1

level, input pins are 1.5-, 1.8-, 2.5-, and 3.3-V

CCINT

Page 94

I/O Structure

Table 2–18 summarizes Stratix II MultiVolt I/O support.

Table 2–18. Stratix II MultiVolt I/O Support Note (1)

(V)

CCIO

1.2 (4) v (2) v (2) v (2) v (2) v (4)

1.5 (4) vvv (2) v (2) v (3) v

1.8 (4) v vv (2) v (2) v (3) v (3) v

2.5 (4) vvv (3) v (3) v (3) v

3.3 (4) v vv (3) v (3) v (3) v (3) vv

Notes to Table 2–18:

(1) To drive inputs higher than V

and LVCMOS input levels to overdrive input buffer option in the Quartus II software.

(2) The pin current may be slightly higher than the default value. You must verify that the driving device’s VOL

maximum and VOH minimum voltages do not violate the applicable Stratix II VIL maximum and VIH minimum voltage specifications.

(3) Although V

a different level can still interface with the Stratix II device if it has inputs that tolerate the V

(4) Stratix II devices do not support 1.2-V LVTTL and 1.2-V LVCMOS. Stratix II devices support 1.2-V HSTL.

1.2 1.5 1.8 2.5 3.3 1.2 1.5 1.8 2.5 3.3 5.0

but less than 4.0 V, disable the PCI clamping diode and select the Allow LVTTL

CCIO

specifies the voltage necessary for the Stratix II device to drive out, a receiving device powered at

CCIO

The TDO and nCEO pins are powered by V

of the bank that t hey re side

CCIO

value.

in. TDO is in I/O bank 4 and nCEO is in I/O bank 7.

Input Signal (V) Output Signal (V)

Ideally, the V

supplies for the I/O buffers of any two connected pins are

at the same voltage level. This may not always be possible depending on the V

level of TDO and nCEO pins on master devices and the

CCIO

configuration voltage level chosen by VCCSEL on slave devices. Master and slave devices can be in any position in the chain. Master indicates that it is driving out TDO or nCEO to a slave device.

For multi-device passive configuration schemes, the nCEO pin of the master device drives the nCE pin of the slave device. The VCCSEL pin on the slave device selects which input buffer is used for nCE. When VCCSEL is logic high, it selects the 1.8-V/1.5-V buffer powered by V

CCIO

. When

VCCSEL is logic low it selects the 3.3-V/2.5-V input buffer powered by

. The ideal case is to have the V

CCPD

of the nCEO bank in a master

CCIO

device match the VCCSEL settings for the nCE input buffer of the slave device it is connected to, but that may not be possible depending on the application. Table 2–19 contains board design recommendations to ensure that nCEO can successfully drive nCE for all power supply combinations.

2–94 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 95

Table 2–19. Board Design Recommendations for nCEO

Stratix II Architecture

Stratix II nCEO V

Voltage Level in I/O Bank 7

CCIO

nCE Input Buffer Power in I/O

Bank 3

CCIO

3.3 V

VCCSEL high

Bank 3 = 1.5 V)

CCIO

VCCSEL high

Bank 3 = 1.8 V)

CCIO

VCCSEL low

(nCE Powered by V

Notes to Table 2–19:

(1) Input buffer is 3.3-V tolerant. (2) The nCEO output buffer meets VOH (MIN) = 2.4 V. (3) Input buffer is 2.5-V tolerant. (4) The nCEO output buffer meets VOH (MIN) = 2.0 V. (5) Input buffer is 1.8-V tolerant. (6) An external 250-Ω pull-up resistor is not required, but recommended if signal levels on the board are not optimal.

CCPD

= 3.3V)

v(1), (2) v (3), (4) v (5) vv

v (1), (2) v (3), (4) vv

v v (4) v (6)

CCIO

2.5 V

CCIO

1.8 V

CCIO

1.5 V

Level shifter

required

CCIO

1.2 V

Level shifter

required

Level shifter

required

For JTAG chains, the TDO pin of the first device drives the TDI pin of the second device in the chain. The V

input on JTAG input I/O cells

CCSEL

(TCK, TMS, TDI, and TRST) is internally hardwired to GND selecting the

3.3-V/2.5-V input buffer powered by V

of the TDO bank from the first device to match the V

CCIO

. The ideal case is to have the

CCPD

CCSEL

settings for TDI on the second device, but that may not be possible depending on the application. Table 2–20 contains board design recommendations to ensure proper JTAG chain operation.

Table 2–20. Supported TDO/TDI Voltage Combinations (Part 1 of 2)

Device

Stratix II Always

Altera Corporation 2–95 May 2007 Stratix II Device Handbook, Volume 1

TDI Input

Buffer Power

(3.3V)

CCPD

= 3.3 V V

CCIO

v (1) v (2) v (3)

Stratix II TDO V

= 2.5 V V

CCIO

Voltage Level in I/O Bank 4

CCIO

= 1.8 V V

CCIO

Level shifter

required

= 1.5 V V

= 1.2 V

CCIO

Level shifter

required

Page 96

High-Speed Differential I/O with DPA Support

Table 2–20. Supported TDO/TDI Voltage Combinations (Part 2 of 2)

Device

Non-Stratix II VCC = 3.3 V

Notes to Table 2–20:

(1) The TDO output buffer meets V (2) The TDO output buffer meets V (3) An external 250-Ω pull-up resistor is not required, but recommended if signal levels on the board are not optimal. (4) Input buffer must be 3.3-V tolerant. (5) Input buffer must be 2.5-V tolerant. (6) Input buffer must be 1.8-V tolerant.

High-Speed Differential I/O with DPA

TDI Input

Buffer Power

VCC = 2.5 V

VCC = 1.8 V

VCC = 1.5 V

= 3.3 V V

CCIO

v (1) v (2) v (3)

v (1), (4) v (2) v (3)

v (1), (4) v (2), (5) v

v (1), (4) v (2), (5) v (6) vv

(MIN) = 2.4 V.

(MIN) = 2.0 V.

Stratix II devices contain dedicated circuitry for supporting differential standards at speeds up to 1 Gbps. The LVDS and HyperTransport differential I/O standards are supported in the Stratix II device. In addition, the LVPECL I/O standard is supported on input and output clock pins on the top and bottom I/O banks.

Stratix II TDO V

= 2.5 V V

CCIO

Voltage Level in I/O Bank 4

CCIO

= 1.8 V V

CCIO

Level shifter

required

Level shifter

required

Level shifter

required

= 1.5 V V

= 1.2 V

CCIO

Level shifter

required

Level shifter

required

Level shifter

required

Support

The high-speed differential I/O circuitry supports the following high speed I/O interconnect standards and applications:

■ SPI-4 Phase 2 (POS-PHY Level 4)

■ SFI-4

■ Parallel RapidIO

■ HyperTransport technology

There are four dedicated high-speed PLLs in the EP2S15 to EP2S30 devices and eight dedicated high-speed PLLs in the EP2S60 to EP2S180 devices to multiply reference clocks and drive high-speed differential SERDES channels.

Tables 2–21 through 2–26 show the number of channels that each fast PLL

can clock in each of the Stratix II devices. In Tables 2–21 through 2–26 the first row for each transmitter or receiver provides the number of channels driven directly by the PLL. The second row below it shows the maximum channels a PLL can drive if cross bank channels are used from the adjacent center PLL. For example, in the 484-pin FineLine BGA EP2S15

2–96 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 97

device, PLL 1 can drive a maximum of 10 transmitter channels in I/O bank 1 or a maximum of 19 transmitter channels in I/O banks 1 and 2. The Quartus II software may also merge receiver and transmitter PLLs when a receiver is driving a transmitter. In this case, one fast PLL can drive both the maximum numbers of receiver and transmitter channels.

Table 2–21. EP2S15 Device Differential Channels Note (1)

Stratix II Architecture

Package

484-pin FineLine BGA Transmitter 38 (2) 10 9 9 10

672-pin FineLine BGA Transmitter 38 (2) 10 9 9 10

Transmitter/

Receiver

Receiver 42 (2) 11 10 10 11

Tot al

Channels

(3) 19 19 19 19

(3) 21 21 21 21

(3) 19 19 19 19

(3) 21 21 21 21

PLL 1 PLL 2 PLL 3 PLL 4

Center Fast PLLs

Table 2–22. EP2S30 Device Differential Channels Note (1)

Package

484-pin FineLine BGA Transmitter 38 (2) 10 9 9 10

672-pin FineLine BGA Transmitter 58 (2) 16 13 13 16

Transmitter/

Receiver

Receiver 42 (2) 11 10 10 11

Receiver 62 (2) 17 14 14 17

Tot al

Channels

(3) 19 19 19 19

(3) 21 21 21 21

(3) 29 29 29 29

(3) 31 31 31 31

PLL 1 PLL 2 PLL 3 PLL 4

Center Fast PLLs

Altera Corporation 2–97 May 2007 Stratix II Device Handbook, Volume 1

Page 98

High-Speed Differential I/O with DPA Support

Table 2–23. EP2S60 Differential Channels Note (1)

Package

484-pin FineLine BGA

672-pin FineLine BGA

1,020-pin FineLine BGA

Transmitter/

Receiver

Transmitter 38 (2) 10 9 9 10 10 9 9 10

Receiver 42 (2) 11 10 10 11 11 10 10 11

Transmitter 58 (2) 16 13 13 16 16 13 13 16

Receiver 62 (2) 17 14 14 17 17 14 14 17

Transmitter 84 (2) 21 21 21 21 21 21 21 21

Receiver 84 (2) 21 21 21 21 21 21 21 21

Tot al

Channels

(3) 19 19 19 19 - - - -

(3) 21 21 21 21 - - - -

(3) 29 29 29 29 - - - -

(3) 31 31 31 31 - - - -

(3) 42 42 42 42 - - - -

Center Fast PLLs Corner Fast PLLs (4)

PLL 1 PLL 2 PLL 3 PLL 4 PLL 7 PLL 8 PLL 9 PLL 10

Table 2–24. EP2S90 Differential Channels Note (1)

Package

484-pin Hybrid FineLine BGA

780-pin FineLine BGA

1,020-pin FineLine BGA

1,508-pin FineLine BGA

Transmitter/

Receiver

Transmitter 38 (2) 10 9 9 10 - - - -

Receiver 42 (2) 11 10 10 11 - - - -

Transmitter 64 (2) 16 16 16 16 - - -

Receiver 68 (2) 17 17 17 17 - - - -

Transmitter 90 (2) 23 22 22 23 23 22 22 23

Receiver 94 (2) 23 24 24 23 23 24 24 23

Transmitter 118 (2) 30 29 29 30 30 29 29 30

Receiver 118 (2) 30 29 29 30 30 29 29 30

Tot al

Channels

(3) 19 19 19 19 - - - -

(3) 21 21 21 21 - - - -

(3) 32 32 32 32 - - - -

(3) 34 34 34 34 - - -

(3) 45 45 45 45 - - - -

(3) 46 46 46 46 - - - -

(3) 59 59 59 59 - - - -

Center Fast PLLs Corner Fast PLLs (4)

PLL 1 PLL 2 PLL 3 PLL 4 PLL 7 PLL 8 PLL 9 PLL 10

2–98 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Page 99

Table 2–25. EP2S130 Differential Channels Note (1)

Stratix II Architecture

Package

780-pin FineLine BGA

1,020-pin FineLine BGA

1,508-pin FineLine BGA

Transmitter/

Receiver

Transmitter 64 (2) 16 16 16 16 - - -

Receiver 68 (2) 17 17 17 17 - - - -

Transmitter 88 (2) 22 22 22 22 22 22 22 22

Receiver 92 (2) 23 23 23 23 23 23 23 23

Transmitter 156 (2) 37 41 41 37 37 41 41 37

Receiver 156 (2) 37 41 41 37 37 41 41 37

Tot al

Channels

(3) 32 32 32 32 - - - -

(3) 34 34 34 34 - - -

(3) 44 44 44 44 - - - -

(3) 46 46 46 46 - - - -

(3) 78 78 78 78 - - - -

Center Fast PLLs Corner Fast PLLs (4)

PLL 1 PLL 2 PLL 3 PLL 4 PLL 7 PLL 8 PLL 9 PLL 10

Table 2–26. EP2S180 Differential Channels Note (1)

Package

1,020-pin FineLine BGA

1,508-pin FineLine BGA

Notes to Tables 2–21 to 2–26:

(1) The total number of receiver channels includes the four non-dedicated clock channels that can be optionally used

as data channels. (2) This is the maximum number of channels the PLLs can directly drive. (3) This is the maximum number of channels if the device uses cross bank channels from the adjacent center PLL. (4) The channels accessible by the center fast PLL overlap with the channels accessible by the corner fast PLL.

Therefore, the total number of channels is not the addition of the number of channels accessible by PLLs 1, 2, 3, and

4 with the number of channels accessible by PLLs 7, 8, 9, and 10.

Transmitter/

Receiver

Transmitter 88 (2) 22 22 22 22 22 22 22 22

Receiver 92 (2) 23 23 23 23 23 23 23 23

Transmitter 156 (2) 37 41 41 37 37 41 41 37

Receiver 156 (2) 37 41 41 37 37 41 41 37

Tot al

Channels

(3) 44 44 44 44 - - - -

(3) 46 46 46 46 - - - -

(3) 78 78 78 78 - - - -

Center Fast PLLs Corner Fast PLLs (4)

PLL 1 PLL 2 PLL 3 PLL 4 PLL 7 PLL 8 PLL 9 PLL 10

Altera Corporation 2–99 May 2007 Stratix II Device Handbook, Volume 1

Page 100

High-Speed Differential I/O with DPA Support

Dedicated Circuitry with DPA Support

Stratix II devices support source-synchronous interfacing with LVDS or HyperTransport signaling at up to 1 Gbps. Stratix II devices can transmit or receive serial channels along with a low-speed or high-speed clock. The receiving device PLL multiplies the clock by an integer factor W = 1 through 32. For example, a HyperTransport technology application where the data rate is 1,000 Mbps and the clock rate is 500 MHz would require that W be set to 2. The SERDES factor J determines the parallel data width to deserialize from receivers or to serialize for transmitters. The SERDES factor J can be set to 4, 5, 6, 7, 8, 9, or 10 and does not have to equal the PLL clock-multiplication W value. A design using the dynamic phase aligner also supports all of these J factor values. For a J factor of 1, the Stratix II device bypasses the SERDES block. For a J factor of 2, the Stratix II device bypasses the SERDES block, and the DDR input and output registers are used in the IOE. Figure 2–58 shows the block diagram of the Stratix II transmitter channel.

Figure 2–58. Stratix II Transmitter Channel

Data from R4, R24, C4, or direct link interconnect

Up to 1 Gbps

–

Dedicated Transmitter Interface

Regional or global clock

refclk

Fast PLL

Local

Interconnect

diffioclk

load_en

Each Stratix II receiver channel features a DPA block for phase detection and selection, a SERDES, a synchronizer, and a data realigner circuit. You can bypass the dynamic phase aligner without affecting the basic sourcesynchronous operation of the channel. In addition, you can dynamically switch between using the DPA block or bypassing the block via a control signal from the logic array. Figure 2–59 shows the block diagram of the Stratix II receiver channel.

2–100 Altera Corporation Stratix II Device Handbook, Volume 1 May 2007

Datasheet Stratix II Datasheet (ALTERA)

Specifications and Main Features

Frequently Asked Questions

User Manual

Functional Description

Logic Array Blocks

LAB Interconnects

LAB Control Signals

Adaptive Logic Modules

ALM Operating Modes

Normal Mode

Extended LUT Mode

Arithmetic Mode

Carry Chain

Shared Arithmetic Mode

Shared Arithmetic Chain

Register Chain

MultiTrack Interconnect

Clear & Preset Logic Control

Tri M a t r ix Memor y

Memory Block Size

M512 RAM Block

M4K RAM Blocks

M-RAM Block

Digital Signal Processing Block

Modes of Operation

DSP Block Interface

PLLs & Clock Networks

Global & Hierarchical Clocking

Global Clock Network

Regional Clock Network

Dual-Regional Clock Network

Combined Resources

Clock Control Block

Enhanced & Fast PLLs

Enhanced PLLs

Fast PLLs

I/O Structure

Double Data Rate I/O Pins

External RAM Interfacing

Programmable Drive Strength

Open-Drain Output

Bus Hold

Programmable Pull-Up Resistor

Advanced I/O Standard Support

On-Chip Termination

Differential On-Chip Termination

On-Chip Series Termination Without Calibration

On-Chip Series Termination with Calibration

On-Chip Parallel Termination with Calibration

MultiVolt I/O Interface

High-Speed Differential I/O with DPA Support

Dedicated Circuitry with DPA Support