Industrial Temperature Version Planned,
(– 40°C to 85°C)
The Texas Instruments (TI) TMS320C206‡ and TMS320LC206‡ digital signal processors (DSPs) are
fabricated with static CMOS integrated-circuit technology. The architectural design is based upon that of the
TMS320C20x series and is optimized for low-power operation. The combination of advanced Harvard
architecture, on-chip peripherals, on-chip memory, and a highly specialized instruction set is the basis of the
operational flexibility and speed of the ’C206.
The ’C206 offers these advantages:
D
Enhanced TMS320 architectural design for increased performance and versatility
D
Advanced integrated-circuit processing technology for increased performance
D
’C206 devices are pin- and code-compatible with ’C203 and ’F206 devices.
D
Source code for the ’C206 DSPs is software-compatible with the ’C1x and ’C2x DSPs and is upwardly
compatible with fifth-generation DSPs (’C5x)
D
New static-design techniques for minimizing power consumption and increasing radiation tolerance
Please be aware that an important notice concerning availability, standard warranty, and use in critical applications of
Texas Instruments semiconductor products and disclaimers thereto appears at the end of this data sheet.
†
IEEE Standard 1149.1-1990, IEEE Standard Test-Access Port
‡
Device numbers are hereafter referred to in the data sheet as ’C206, unless otherwise specified.
TI is a trademark of Texas Instruments Incorporated.
PRODUCTION DATA information is current as of publication date.
Products conform to specifications per the terms of Texas Instruments
standard warranty. Production processing does not necessarily include
testing of all parameters.
T able 1 shows the capacity of on-chip RAM and ROM, the number of serial and parallel I/O ports, the execution
time of one machine cycle, and the type of package with total pin count of the TMS320C206 and TMS320LC206
devices.
Table 1. Characteristics of the TMS320C206 and TMS320LC206 Processors
ON-CHIP MEMORY (16-BIT WORDS)
’x206
TMS320C206288
TMS320LC206288
†
On-chip RAM space B1 (256 words) and B2 (32 words) can be used as data memory only.
‡
On-chip RAM space B0 (256 words) can be used either in data space or program space depending on the value of the CNF bit in the ST1 register.
On-chip SARAM (4K) can be mapped in program space, data space, or both.
Read-select indicates an active, external read cycle. RD is active on all external program, data, and I/O reads.
RD
goes into the high-impedance state when OFF is active low. To implement a glueless zero wait-state
RD45O/Z
WE44O/Z
STRB46O/Z
BR43O/Z
HOLDA6O/Z
XF98O/Z
BIO99I
IO0
IO1
IO2
IO3
RS100I
PLLRS10I
EXT81I
MP/MC2I
NMI17I
HOLD/INT118I
INT2
INT3
†
I = input, O = output, Z = high impedance, PWR = power, GND = ground
96
97
19
20
I/O/Z
8
9
memory interface, the inverted R/W
RD
pin can be programmed to provide an inverted R/W signal instead of RD. The FRDN bit (bit 15) in the PMST
register controls this selection. FRDN=1 chooses R/W
RD
as the “read” signal on pin 45.
Write enable. The falling edge of WE indicates that the device is driving the external data bus (D15–D0). Data
can be latched by an external device on the rising edge of WE
I/O writes. WE
Strobe. STRB is always high unless asserted low to indicate an external bus cycle. STRB goes into the
high-impedance state when OFF
Bus-request. BR is asserted when a global data-memory access is initiated. BR goes into the high-impedance
state when OFF
Hold-acknowledge. HOLDA indicates to the external circuitry that the processor is in a hold state and that the
address, data, and memory control lines are in the high-impedance state so that they are available to the
external circuitry for access of local memory . HOLDA
low.
External flag output (latched software-programmable signal). XF is used for signalling other processors in
multiprocessing configurations or as a general-purpose output pin. XF goes into the high-impedance state
when OFF
Branch control input. When polled by the BCND pma,BIO instruction, ’C206 executes a branch to the
specified program memory address if BIO
Software-controlled input/output by way of the asynchronous serial-port control register (ASPCR). At reset,
IO0–IO3 are configured as inputs. These pins can be used as general-purpose input / output pins or as
handshake control for the UART. IO0–IO3 go into the high-impedance state when OFF
INITIALIZATION, INTERRUPTS, AND RESET OPERATIONS
Reset. RS causes the ’C206 and ’LC206 to terminate execution and forces the program counter to zero.
When RS
various registers and status bits.
Phase locked loop reset. PLLRS resets the PLL to initiate PLL locking. At power up, both PLLRS and RS
should be active (low) to reset the DSP core and the PLL circuitry. The PLL can only be reset along with
the core as shown in Table 2. The state of the PLLRS
tied high or low.
Bootloader mode pin. EXT8 is latched to bit 3 (LEVEXT8) in the PMST register. The bootloader located in
ROM uses EXT8 to determine the type of boot method. If EXT8 is high, the enhanced ’C206 bootloader is
used. If EXT8 is low, the ’C203 style bootloader is used. Refer to the
number SPRU127) for more details regarding the ’C203 style bootloader.
Microprocessor/microcomputer mode select. If MP/MC is low, the on-chip ROM memory is mapped into
program space. When MP/MC
RESET, and its value is latched into bit 0 of the PMST register.
Nonmaskable interrupt. NMI is an external interrupt that cannot be masked by way of the interrupt-mode bit
(INTM) or the interrupt-mask register (IMR). When NMI
vector location. If NMI
HOLD and INT1 share the same pin. Both are treated as interrupt signals. If the MODE bit is 0 in the interrupt
control register (ICR), hold logic can be implemented in combination with the IDLE instruction in software. At
reset, the MODE bit in ICR is zero, enabling the HOLD mode for the pin.
External user interrupts. INT2 and INT3 are prioritized and maskable by the IMR and the INTM. INT2 and INT3
I
can be polled and reset by way of the interrupt flag register (IFR).
goes into the high-impedance state when OFF is active low.
is active low.
MULTI-PROCESSING SIGNALS
is active low.
is active low.
is brought high, execution begins at location 0 of program memory after 16 cycles. RS affects
is high, the device accessess off-chip memory. This pin is only sampled at
is not used, it should be pulled high.
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
signal can be used as the “read” signal in place of RD. The function of the
as the new “read” signal. FRDN=0 (at reset) chooses
. WE is active on all external program, data, and
goes into the high-impedance state when OFF is active
is low.
is active low.
is not applicable for 2 mode and should always be
TMS320C20x User’s Guide
is activated, the processor traps to the appropriate
DR86I
TX93OAsynchronous serial port (ASP/UART) data transmit output pin
RX95IAsynchronous serial port (ASP/UART) data receive pin
TRST79I
TCK78I
TMS81IJT AG test-mode select. TMS is clocked into the TAP controller on the rising edge of TCK.
TDI80IJTAG test-data input. TDI is clocked into the selected register (instruction or data) on a rising edge of TCK.
TDO82O/Z
EMU076I/O/Z
†
I = input, O = output, Z = high impedance, PWR = power, GND = ground
12
13
3
5
Timer output. TOUT signals a pulse when the on-chip timer counts down past zero. The pulse is one CLKOUT1
cycle wide. TOUT goes into the high-impedance state when OFF
Master clock ouput. The CLKOUT1 high pulse signifies the logic phase while the low pulse signifies the latch
phase.
I
Input clock. CLKIN/X2 is the input clock to the device. As CLKIN, the pin operates as the external oscillator
O
clock input and as X2, the pin operates as the internal oscillator input with X1 being the internal oscillator output.
DIV1 and DIV2 are used to configure the on-chip PLL options, providing four clock modes (÷2, ×1, ×2, and ×4)
for a given input clock frequency. Refer to clock options in electrical characteristics section. Note that in the
I
divide-by-2 mode, the PLL is bypassed. DIV1–DIV2 should not be changed unless the RS
SERIAL PORT SIGNALS (SSP AND ASP)
Transmit clock. CLKX is a clock signal for clocking data from the XSR (transmit shift register) to the DX
data-transmit pin. CLKX can be an input if the MCM bit in the synchronous serial-port control register (SSPCR)
is set to 0. CLKX can also be driven by the device at one-half of the CLKOUT1 frequency when
MCM = 1. If the serial port is not being used, CLKX goes into the high-impedance state when OFF is active
low. Value at reset is as an input.
Receive clock. External clock signal for clocking data from the DR (data-receive) pin into the serial-port receive
shift register (RSR). CLKR must be present during serial-port transfers. If the serial port is not being used,
CLKR can be sampled as an input by IN0 bit of the SSPCR.
Frame synchronization pulse for transmit. The falling edge of the FSX pulse initiates the data-transmit process,
beginning the clocking of the SR. Following reset, FSX is an input. FSX can be selected by software to be an
output when the TXM bit in the serial control register, SSPCR, is set to 1. FSX goes into the high-impedance
state when OFF
Frame synchronization pulse for receive input. The falling edge of the FSR pulse initiates the data-receive
process, beginning the clocking of the RSR. FSR goes into the high-impedance state when OFF
Synchronous serial port (SSP) data transmit output. Serial data is transmitted from the transmit shift register
(XSR) through the DX pin. DX is in the high-impedance state when OFF
Synchronous serial port (SSP) data receive input. Serial data is received in the receive shift register (RSR)
through the DR pin.
IEEE Standard 1149.1 (JTAG) test reset. TRST, when active high, gives the scan system control of the
operations of the device. If TRST
are ignored. If the TRST
JTAG test clock. TCK is normally a free-running clock signal with a 50% duty cycle. The changes on the
test-access port (TAP) input signals (TMS and TDI) are clocked into the TAP controller, instruction register,
or selected test-data register on the rising edge of TCK. Changes at the TAP output signal (TDO) occur on the
falling edge of TCK.
JTAG test-data output. The contents of the selected register (instruction or data) are shifted out of TDO on the
falling edge of TCK. TDO is in the high-impedance state except when the scanning of data is in progress.
Emulator pin 0. When TRST is driven low, EMU0 must be high for activation of the OFF condition. When TRST
is driven high, EMU0 is used as an interrupt to or from the emulator system and is defined as an input/output
through the JTAG scan.
is active low.
TEST SIGNALS
is driven low, the device operates in its functional mode, and the test signals
pin is not driven, an external pulldown resistor must be used.
Emulator pin 1. Emulator pin 1 disables all outputs. When TRST is driven high, EMU1 /OFF is used as an
interrupt to or from the emulator system and is defined as an input/output through the JTAG scan. When TRST
is driven low, this pin is configured as OFF. EMU1/ OFF, when active low, puts all output drivers in the
EMU1/OFF77I/O/Z
7
16
V
DD5
V
DD
V
DD
V
SS
†
I = input, O = output, Z = high impedance, PWR = power, GND = ground
‡
For the ’C206, the 3.3-V and 5-V power supplies may be sequenced in any order.
35
PWR5-V I/O power supply (Applicable for TMS320C206 only‡)
50
63
91
7
16
35
PWR3.3-V I/O power supply (Applicable for ’LC206 only)
50
63
91
4
11
PWR3.3-V core power supply (Applicable for both ’C206 and ’LC206 devices‡)
75
14
21
25
30
37
42
48
GNDGround
54
59
65
70
83
88
94
high-impedance state. Note that OFF
multiprocessing applications). Therefore, for the OFF
TRST
= 0
EMU0 = 1
EMU1/OFF
= 0
SUPPLY PINS
is used exclusively for testing and emulation purposes (not for
NOTES: A. Symbol descriptions appear in Table 3 and Table 4.
B. For clarity, the data and program buses are shown as single buses although they include address and data bits.
8
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
IFR
g
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
Table 3. Legend for the ’C206 Internal Hardware Functional Block Diagram
SYMBOLNAMEDESCRIPTION
ACCAccumulator
Asynchronous
ADTR
ARAU
ARB
ARP
ASP
ASPCR
AUX REGS
(AR0–AR7)
BR
BRDBaud-Rate DivisorUsed to set the baud rate of the UART
CCarry
CALU
DARAMDual Access RAM
DP
GREG
IMR
INT#Interrupt TrapsA total of 32 interrupts by way of hardware and/or software are available.
IOSR
IR
Data Transmit and
Receive Register
Auxiliary Register
Arithmetic Unit
Auxiliary Register
Pointer Buffer
Auxiliary Register
Pointer
Universal
Asynchronous
Receive/Transmit
Asynchronous
Serial-Port Control
Register
Auxiliary Registers
0–7
Bus Request
Signal
Central Arithmetic
Logic Unit
Data Memory
Page Pointer
Global Memory
Allocation
Register
Interrupt FlagThe 7-bit IFR indicates that the ’C206 has latched an interrupt pulse from one of the seven maskable
Registerinterrupt sources.
Interrupt Mask
Register
I/O Status
Register
Instruction
Register
32-bit register that stores the results and provides input for subsequent CALU operations. Also includes shift
and rotate capabilities. ACCH is the accumulator high word; ACCL is the accumulator low word.
16-bit read/write register used to transmit data from/receive data into the asynchronous serial port. Note
that the ASP works with 8-bit data.
An unsigned, 16-bit arithmetic unit used to calculate indirect addresses using the auxiliary registers as
inputs and outputs.
See Table 4 for status register field definitions.
See Table 4 for status register field definitions.
ASP is the asynchronous serial port (UART).
ASPCR controls the asynchronous serial-port operation This register contains bits for setting port modes,
enabling/disabling automatic baud-rate detection, selecting the number of stop bits, and configuring I/O
pins, etc.
These 16-bit registers are used as pointers to anywhere within the data space address range. They are
operated upon by the ARAU and are selected by the auxiliary register pointer (ARP). AR0 can also be used
as an index value for AR updates of more than one and as a compare value to AR.
BR is asserted during access of the external global data memory space. READY is asserted to the device
when the global data memory is available for the bus transaction. BR
address space by up to 32K words.
Register carry output from CALU. C is fed back into the CALU for extended arithmetic operation. The C bit
resides in status register 1 (ST1), and can be tested in conditional instructions. C is also used in accumulator
shifts and rotates.
32-bit-wide main arithmetic logic unit for the TMS320C20x core. The CALU executes 32-bit operations in
a single machine cycle. CALU operates on data coming from ISCALE or PSCALE with data from ACC, and
provides status results to PCTRL.
If the on-chip RAM configuration control bit (CNF) is set to 0, the reconfigurable dual-access RAM (DARAM)
block B0 is mapped to data space; otherwise, B0 is mapped to program space. Blocks B1 and B2 are
mapped to data memory space only , at addresses 0300–03FF and 0060–007F, respectively . Blocks 0 and
1 contain 256 words, while Block 2 contains 32 words.
See Table 4 for status register field definitions.
GREG specifies the size of the global data memory space.
IMR individually masks or enables the seven interrupts.
IOSR detects current levels (and changes with inputs) on pins IO0–IO3 and the status of UART.
IR is the instruction register.
can be used to extend the data memory
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
9
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
Table 3. Legend for the ’C206 Internal Hardware Functional Block Diagram (Continued)
SYMBOLNAMEDESCRIPTION
ISCALE
MPYMultiplier
MSTACKMicro Stack
MUXMultiplexerMultiplexes buses to a common input
NPAR
16-bit to 32-bit barrel left-shifter. ISCALE (ISFL) shifts incoming 16-bit data 0 to16 positions left, relative to
the 32-bit output within the fetch cycle; therefore, no cycle overhead is required for input scaling operations.
16 × 16-bit multiplier to a 32-bit product. MPY executes multiplication in a single cycle. MPY operates either
signed or unsigned 2s-complement arithmetic multiply.
MSTACK provides temporary storage for the address of the next instruction to be fetched when
program-address-generation logic is used to generate sequential addresses in data space.
NPAR holds the program address to be driven out on the PAB on the next cycle.
32-bit to 16-bit barrel left-shifter. OSCALE (OSFL) shifts the 32-bit accumulator output 0 to 7 bits left for
quantization management and outputs either the 16-bit high- or low-half of the shifted 32-bit data to the
Data-Write Data Bus (DWEB).
PAR holds the address currently being driven on P AB for as many cycles as it takes to complete all memory
operations scheduled for the current bus cycle.
PC increments the value from NPAR to provide sequential addresses for instruction-fetching and sequential
data-transfer operations.
PCTRL decodes instruction, manages the pipeline, stores status, and decodes conditional operations.
PRD contains the 16-bit period that is loaded into the timer counter when the counter borrows or when the
reload bit is activated. Reset initializes the PRD to FFFFh.
0-, 1- or 4-bit left shift or 6-bit right shift of multiplier product. The left-shift options are used to manage the
additional sign bits resulting from the 2s-complement multiply. The right-shift option is used to scale down
the number to manage overflow of product accumulation in the CALU. PSCALE (PSFL) resides in the path
from the 32-bit product shifter and from either the CALU or the DWEB, and requires no cycle overhead.
16-bit read/write register used to transmit data from/receive data into the synchronous serial port. This
register functions as the path to the transmit and receive FIFOs of the SSP.
SSP is the synchronous serial-port.
SSPCR is the control register for selecting the serial port’s mode of operation.
SSPCT is the synchronous serial-port counter register.
SSPMC is the synchronous serial-port multichannel register.
SSPST is the synchronous serial-port status register.
STACK is a block of memory used for storing return addresses for subroutines and interrupt-service
routines, or for storing data. The ’C20x stack is 16-bit wide and eight-level deep.
TCR contains the control bits that define the divide-down ratio, start/stop the timer, and reload the period.
Also contained in TCR is the current count in the prescaler. Reset initializes the timer divide-down ratio
to 0 and starts the timer.
10
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
Table 3. Legend for the ’C206 Internal Hardware Functional Block Diagram (Continued)
SYMBOLNAMEDESCRIPTION
TIM
TREG
architectural overview
The ’C206 advanced Harvard-type architecture maximizes the processing power by maintaining two separate
memory bus structures—program and data—for full-speed execution. The multiple buses allow data and
instructions to be read simultaneously. Instructions support data transfers between the two spaces. This
architecture permits coefficients stored in program memory to be read in RAM, eliminating the need for a
separate coefficient RAM. This, coupled with a four-instruction deep pipeline, allows the
TMS320C206/TMS320LC206 to execute most instructions in a single cycle.
status and control registers
Two status registers, ST0 and ST1, contain the status of various conditions and modes. These registers can
be stored into data memory and loaded from data memory , thus allowing the status of the machine to be saved
and restored for subroutines.
Timer-Counter
Register
Temporary
Register
TIM contains the current 16-bit count of the timer. Reset initializes the TIM to FFFFh.
16-bit register holds one of the operands for the multiply operations. TREG holds the dynamic shift count
for the LACT , ADDT, and SUBT instructions. TREG holds the dynamic bit position for the BITT instruction.
The load-status register (LST) instruction is used to write to ST0 and ST1 (except the INTM bit which is not
affected by the LST instruction). The store-status register (SST) instruction is used to read from the ST0 and
ST1. The individual bits of these registers can be set or cleared by the SETC and CLRC instructions. Figure 1
shows the organization of status registers ST0 and ST1, indicating all status bits contained in each. Several bits
in the status registers are reserved and are read as logic 1s. See Table 4 for status-register field definitions.
15–13 12 11 109 87654321 0
ST0
ST1
ARPOVOVM1INTMDP
15–13121110918171615XF41312PM1–0
ARBCNFTCSXMC
Figure 1. Status and Control Register Organization
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
11
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
status and control registers (continued)
Table 4. Status Register Field Definitions
FIELDFUNCTION
ARB
ARP
C
CNF
DP
INTM
OV
OVM
PM
SXM
TC
XF
Auxiliary register pointer buffer . Whenever the ARP is loaded, the old ARP value is copied to the ARB except during an LST
instruction. When the ARB is loaded by an LST #1 instruction, the same value is also copied to the ARP.
Auxiliary register pointer. ARP selects the auxiliary register (AR) to be used in indirect addressing. When the ARP is loaded,
the old ARP value is copied to the ARB register. ARP can be modified by memory-reference instructions when using indirect
addressing, and by the LARP, MAR, and LST instructions. The ARP is also loaded with the same value as ARB when an LST
#1 instruction is executed.
Carry bit. C is set to 1 if the result of an addition generates a carry, or reset to 0 if the result of a subtraction generates a borrow .
Otherwise, C is reset after an addition or set after a subtraction, except if the instruction is ADD or SUB with a 16-bit shift. In
these cases, the ADD can only set and the SUB only reset the carry bit, but cannot affect it otherwise. The single-bit shift and
rotate instructions also affect C, as well as the SETC, CLRC, and LST #1 instructions. Branch instructions have been provided
to branch on the status of C. C is set to 1 on a reset.
On-chip RAM configuration-control bit. If CNF is set to 0, the reconfigurable data DARAM blocks are mapped to data space;
otherwise, they are mapped to program space. The CNF can be modified by the SETC CNF, CLRC CNF, and LST #1
instructions. RS
Data memory page pointer . The 9-bit DP register is concatenated with the seven LSBs of an instruction word to form a direct
memory address of 16 bits. DP can be modified by the LST and LDP instructions.
Interrupt-mode bit. When INTM is set to 0, all unmasked interrupts are enabled. When set to 1, all maskable interrupts are
disabled. INTM is set and reset by the SETC INTM and CLRC INTM instructions. INTM has no effect on the nonmaskable RS
and NMI interrupts. Note that INTM is unaffected by the LST instruction. This bit is set to 1 when a maskable interrupt is
acknowledged or when RS
Overflow-flag bit. As a latched overflow signal, OV is set to 1 when overflow occurs in the ALU. Once an overflow occurs, the
OV remains set until a reset, BCND/D on OV/NOV, or LST instruction clears OV.
Overflow-mode bit. When OVM is set to 0, overflowed results overflow normally in the accumulator. When set to 1, the
accumulator is set to either its most positive or negative value upon encountering an overflow. The SETC and CLRC
instructions set and reset this bit, respectively. LST also can be used to modify the OVM.
Product-shift mode. If these two bits are 00, the multiplier’s 32-bit product is loaded into the ALU with no shift. If PM = 01, the
PREG output is left-shifted one place and loaded into the ALU, with the LSB zero-filled. If PM = 10, the PREG output is
left-shifted by four bits and loaded into the ALU, with the LSBs zero-filled. PM = 1 1 produces a right shift of six bits, sign- extended. Note that the PREG contents remain unchanged. The shift takes place when transferring the contents of the PREG
to the ALU. PM is loaded by the SPM and LST #1 instructions. PM is cleared by RS
Sign-extension mode bit. SXM = 1 produces sign extension on data as it is passed into the accumulator through the scaling
shifter. SXM = 0 suppresses sign extension. SXM does not af fect the definitions of certain instructions; for example, the ADDS
instruction suppresses sign extension regardless of SXM. SXM is set by the SETC SXM and reset by the CLRC SXM
instructions, and can be loaded by the LST #1. SXM is set to 1 by reset.
T est/control flag bit. TC is affected by the BIT , BITT, CMPR, LST #1, and NORM instructions. TC is set to a 1 if a bit tested by
BIT or BITT is a 1, if a compare condition tested by CMPR exists between AR (ARP) and AR0, if the exclusive-OR function
of the two MSBs of the accumulator is true when tested by a NORM instruction. The conditional branch, call, and return
instructions can execute, based on the condition of TC.
XF pin status bit. XF indicates the state of the XF pin, a general-purpose output pin. XF is set by the SETC XF and reset by
the CLRC XF instructions. XF is set to 1 by reset.
sets the CNF to 0.
is asserted.
.
central processing unit
The TMS320C206/TMS320LC206 central processing unit (CPU) contains a 16-bit scaling shifter, a 16 x16-bit
parallel multiplier, a 32-bit central arithmetic logic unit (CALU), a 32-bit accumulator, and additional shifters at
the outputs of both the accumulator and the multiplier. This section describes the CPU components and their
functions. The functional block diagram shows the components of the CPU.
12
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
input scaling shifter
The TMS320C206/TMS320LC206 provides a scaling shifter with a 16-bit input connected to the data bus and
a 32-bit output connected to the CALU. This shifter operates as part of the path of data coming from program
or data space to the CALU, and requires no cycle overhead. It is used to align the 16-bit data coming from
memory to the 32-bit CALU. This is necessary for scaling arithmetic as well as aligning masks for logical
operations.
The scaling shifter produces a left shift of 0 to 16 on the input data. The LSBs of the output are filled with zeros;
the MSBs may either be filled with zeros or sign-extended, depending upon the value of the SXM bit
(sign-extension mode) of status register ST1. The shift count is specified by a constant embedded in the
instruction word or by a value in TREG. The shift count in the instruction allows for specific scaling or alignment
operations specific to that point in the code. The TREG base shift allows the scaling factor to adapt to the
performance of the system.
multiplier
The TMS320C206/TMS320LC206 uses a 16 x16-bit hardware multiplier that is capable of computing a signed
or an unsigned 32-bit product in a single machine cycle. All multiply instructions, except the MPYU (multiply
unsigned) instruction, perform a signed-multiply operation. That is, two numbers being multiplied are treated
as 2s-complement numbers, and the result is a 32-bit 2s-complement number. There are two registers
associated with the multiplier:
D
a 16-bit temporary register (TREG) that holds one of the operands for the multiplier, and
D
a 32-bit product register (PREG) that holds the product.
Four product shift modes (PM) are available at the PREG output (PSCALE). These shift modes are useful for
performing multiply/accumulate operations, performing fractional arithmetic, or justifying fractional products.
The PM field of status register ST1 specifies the PM shift mode, as shown in Table 5.
Table 5. PSCALE Product Shift Modes
PMSHIFTDESCRIPTION
00no shiftProduct fed to CALU or data bus with no shift
01left 1Removes the extra sign bit generated in a 2s-complement multiply to produce a Q31 product
10left 4Removes the extra four sign bits generated in a 16 x13 2s-complement multiply to a produce a Q31
11right 6Scales the product to allow up to 128 product accumulation without the possibility of accumulator overflow
product when using the multiply by a 13-bit constant
The product can be shifted one bit to compensate for the extra sign bit gained in multiplying two 16-bit
2s-complement numbers (MPY). A four-bit shift is used in conjunction with the MPY instruction with a short
immediate value (13 bits or less) to eliminate the four extra sign bits gained in multiplying a 16-bit number by
a 13-bit number. Finally, the output of PREG can be right-shifted six bits to enable the execution of up to
128 consecutive multiply/accumulates without the possibility of overflow.
The L T (load TREG) instruction normally loads TREG to provide one operand (from the data bus), and the MPY
(multiply) instruction provides the second operand (also from the data bus). A multiplication can also be
performed with a 13-bit immediate operand when using the MPY instruction. A product is then obtained every
two cycles. For efficient implementation of multiple products, or multiple sums of products, the CPU provides
pipelining of the TREG load operation with certain CALU operations which use the PREG. These operations
include: load ACC with PREG (L TP); add PREG to ACC (LTA); add PREG to ACC and shift TREG input data
to next address in data memory (LTD); and subtract PREG from ACC (LTS).
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
13
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
multiplier (continued)
Two multiply/accumulate instructions (MAC and MACD) fully utilize the computational bandwidth of the
multiplier, allowing both operands to be processed simultaneously. The data for these operations can be
transferred to the multiplier each cycle by way of the program and data buses. This facilitates single-cycle
multiply/accumulates when used with the repeat (RPT) instruction. In these instructions, the coefficient
addresses are generated by program-address generation (PAGEN), while the data addresses are generated
by data-address generation (DAGEN). This allows the repeated instruction to access the values sequentially
from the coefficient table and step through the data in any of the indirect addressing modes.
The MACD instruction, when repeated, supports filter constructs (weighted running averages) so that as the
sum-of-products is executed, the sample data is shifted in memory to make room for the next sample and to
discard the oldest sample.
The MPYU instruction performs an unsigned multiplication, which greatly facilitates extended-precision
arithmetic operations. The unsigned contents of TREG are multiplied by the unsigned contents of the addressed
data-memory location, with the result placed in PREG. This allows the operands of greater than 16 bits to be
broken down into 16-bit words and processed separately to generate products of greater than 32 bits. The
SQRA (square/add) and SQRS (square/subtract) instructions pass the same value to both inputs of the
multiplier for squaring a data-memory value.
After the multiplication of two 16-bit numbers, the 32-bit product is loaded into the 32-bit product register
(PREG). The product from PREG can be transferred to the CALU or to data memory through the SPH (store
product high) and SPL (store product low) instructions. Note: the transfer of PREG to either the CALU or data
memory passes through the PSCALE shifter and is therefore, affected by the product-shift mode value defined
by the PM bits in the ST1 register. This is important when saving PREG in an interrupt-service routine context
save as the PSCALE shift effects cannot be modeled in the restore operation. PREG can be cleared by
executing the MPY #0 instruction. The product register can be restored by loading the saved low half into TREG
and executing a MPY #1 instruction. The high half is then loaded using the LPH instruction.
central arithmetic logic unit
The TMS320C206/TMS320LC206 CALU implements a wide range of arithmetic and logical functions, the
majority of which execute in a single clock cycle. This arithmetic/logic unit (ALU) is referred to as central to
differentiate it from a second ALU used for indirect address generation called the auxiliary register arithmetic
unit (ARAU). Once an operation is performed in the CALU, the result is transferred to the accumulator (ACC)
where additional operations, such as shifting, may occur. Data that is input to the CALU can be scaled by
ISCALE when coming from one of the data buses (DRDB or PRDB) or scaled by PSCALE when coming from
the multiplier.
The CALU is a general-purpose ALU unit that operates on 16-bit words taken from data memory or derived from
immediate instructions. In addition to arithmetic operations, the CALU can perform Boolean operations,
facilitating the bit-manipulation ability required for a high-speed controller. One input to the CALU is always
provided from the accumulator, and the other input can be provided from the product register (PREG) of the
multiplier or from the output of the scaling shifter (that has been read from data memory or from the ACC). After
the CALU has performed the arithmetic or logical operation, the result is stored in the accumulator.
The TMS320C206/TMS320LC206 supports floating-point operations for applications requiring a large dynamic
range. The NORM (normalization) instruction is used to normalize fixed-point numbers contained in the
accumulator by performing left shifts. The four bits of the TREG define a variable shift through the scaling shifter
for the LACT/ADDT/SUBT (load/add to/subtract from accumulator with shift specified by TREG) instructions.
These instructions are useful in floating-point arithmetic where denormalization of a number is required (that
is, floating-point to fixed-point conversion). They are also useful in the implementation of automatic-gain control
(AGC) at the input of a filter. The BITT (bit test) instruction provides testing of a single bit of a word in data
memory based on the value contained in the four LSBs of TREG.
14
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
central arithmetic logic unit (continued)
The CALU overflow-saturation mode can be enabled/disabled by setting/resetting the OVM bit of ST0. Setting
the OVM status-register bit selects the overflow-saturation mode. When the CALU is in the overflow-saturation
mode and an overflow occurs, the overflow flag is set and the accumulator is loaded with either the most positive
or the most negative value representable in the accumulator, depending upon the direction of the overflow . The
value of the accumulator upon saturation is 07FFFFFFFh (positive) or 080000000h (negative). If the OVM
(overflow mode) status register bit is reset and an overflow occurs, the overflowed results are loaded into the
accumulator without modification. (Logical operations cannot result in overflow.)
The CALU can execute a variety of branch instructions that depend on the status of the CALU and accumulator .
These instructions can be executed conditionally , based on various combinations of the associated status bits.
For overflow management, these conditions include the OV (branch on overflow) and EQ (branch on
accumulator equal to zero). In addition, the BACC (branch to address in accumulator) instruction provides the
ability to branch to an address specified by the accumulator (computed goto). Bit-test instructions (BIT and
BITT), which do not affect the accumulator, allow the testing of a specified bit of a word in data memory.
The CALU also has a carry bit (bit 9 of status register ST1) that facilitates efficient computation of
extended-precision products and additions or subtractions. The carry bit is also useful in overflow management.
The carry bit is affected by the following operations:
D
Additions to and subtractions from the accumulator:
C = 0: When the result of a subtraction generates a borrow.
When the result of an addition does not generate a carry . (Exception: When the ADD instruction is
used with a shift of 16 and no carry is generated, the ADD instruction has no effect on C.)
C = 1: When the result of an addition generates a carry.
When the result of a subtraction does not generate a borrow. (Exception: When the SUB instruction
is used with a shift of 16 and no borrow is generated, the SUB instruction has no effect on C.)
D
Single-bit shifts and rotations of the accumulator value. During a left shift or rotation, the most significant
bit of the accumulator is passed to C; during a right shift or rotation, the least significant bit is passed to C.
Note: the carry bit is set to “1” on a hardware reset.
The ADDC (add to accumulator with carry) and SUBB (subtract from accumulator with borrow) instructions use
the previous value of carry in their addition/subtraction operation.
accumulator
The 32-bit accumulator is the registered output of the CALU. It can be split into two 16-bit segments for storage
in data memory. Shifters at the output of the accumulator provide a left shift of 0 to 7 places. This shift is
performed while the data is being transferred to the data bus for storage. The contents of the accumulator
remain unchanged. When the post-scaling shifter is used on the high word of the accumulator (bits 16–31), the
MSBs are lost and the LSBs are filled with bits shifted in from the low word (bits 0–15). When the postscaling
shifter is used on the low word, the LSBs are zero-filled.
The SFL and SFR (in-place one-bit shift to the left / right) instructions and the ROL and ROR (rotate to the
left/right) instructions implement shifting or rotating of the accumulator contents through the carry bit. The SXM
bit affects the definition of the SFR (shift accumulator right) instruction. When SXM = 1, SFR performs an
arithmetic right shift, maintaining the sign of the accumulator data. When SXM = 0, SFR performs a logical shift,
shifting out the LSBs and shifting in a zero for the MSB. The SFL (shift accumulator left) instruction is not affected
by the SXM bit and behaves the same way in both cases, shifting out the MSB and shifting in a zero. Repeat
(RPT) instructions can be used with the shift and rotate instructions for multiple-bit shifts.
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
15
TMS320C206, TMS320LC206
DIGITAL SIGNAL PROCESSORS
SPRS065B – JUNE 1998 – REVISED JANUARY 1999
auxiliary registers and auxiliary-register arithmetic unit (ARAU)
The ’C206 provides a register file containing eight auxiliary registers (AR0–AR7). The auxiliary registers are
used for indirect addressing of the data memory or for temporary data storage. For indirect data-memory
addressing, the address of the desired memory location is placed into the selected auxiliary register. These
registers are referenced with a 3-bit auxiliary register pointer (ARP) that is loaded with a value from 0
through 7, designating AR0 through AR7, respectively . The auxiliary registers and the ARP can be loaded from
data memory , the ACC, the product register, or by an immediate operand defined in the instruction. The contents
of these registers can also be stored in data memory or used as inputs to the CALU.
The auxiliary register file (AR0–AR7) is connected to the auxiliary register arithmetic unit (ARAU). The ARAU
can autoindex the current auxiliary register while the data-memory location is being addressed. Indexing either
by ±1 or by the contents of the AR0 register can be performed. As a result, accessing tables of information does
not require the CALU for address manipulation; thus, the CALU is free for other operations in parallel.
memory
The ’C206 implements three separate address spaces for program memory , data memory, and I/O. Each space
accommodates a total of 64K 16-bit words. Within the 64K words of data space, the 256 to 32K words at the
top of the address range can be defined to be external global-memory in increments of powers of two, as
specified by the contents of the global-memory allocation register (GREG). Access to global memory is
arbitrated using the global-memory bus request (BR
) signal.
On the ’C206, the first 96 (0 – 5Fh) data-memory locations are allocated for memory-mapped registers or
reserved. This memory-mapped register space contains various control and status registers including those for
the CPU.
The on-chip memory of the ’C206 includes 544 x 16 words of dual-access RAM (DARAM), 4K x 16 single-access
RAM (SARAM), and 32K x 16 program ROM memory . T able 6 shows the mapping of these memory blocks and
the appropriate control bits and pins. Figure 2 shows the effects of the memory control pin MP/MC
control bit CNF on the mapping of the respective memory spaces to on-chip or off-chip. The PON and DON bits
select the SARAM (4K) mapping in program, data, or both. See Table 9 for details on the PMST register, the
PON bit, and the DON bit. At reset, these bits are 1 1, and the on-chip SARAM is mapped in both the program
and data space. The SARAM addresses (800h in data and 8000h in program memory) are accessible in
external memory space, if the on-chip SARAM is not enabled.
At reset, if the MP/MC
external program space. The MP/MC
high, the device is in microprocessor mode. PMST register bits can be read and modified in software. If bit 0
is cleared to 0, the device enters microcomputer mode and program memory addresses from 0000h to 7FFFh
map to on-chip ROM.
If the MP/MC
the on-chip ROM mapped in program space. The on-chip ROM could either contain the bootloader or
customer-specific application code which is then executed.
The on-chip data memory blocks B0 and B1 are 256 16 words each. When CNF = 0, B0 is mapped in data
space at addresses 0200–02FFh. When CNF = 1, B0 is mapped in program space at addresses
0FF00–0FFFFh. The B1 block is always mapped in data space at addresses 0300–03FFh, and B2 block is
always mapped in a data space at addresses 60–7Fh.
pin is held low during reset, the device starts in microcomputer mode and branches to 0000h in
pin is held high, the device starts in microprocessor mode and branches to 0000h in
pin status is latched in the PMST register (bit 0). As long as this bit remains
256 x 16 word dual-access RAM (DARAM) (B0)200h – 2FFh–xxx0
256 x 16 word DARAM (B0)–FF00h – FFFFhxxx1
256 x 16 word DARAM (B1)300h – 3FFh–xxxx
32 x 16 word DARAM (B2)60h – 7Fh–xxxx
32K x 16 word on-chip program memory (ROM)–0000h – 7FFFh0xxx
32K x 16 word external program memory–0000h – 7FFFh1xxx
32K x 16 word external program memory,
if CNF=0 and MP/MC
External program memory, if CNF=1–8000h – FDFFh0x01
4K x 16 word on-chip SARAM (data)800h – 17FFhx10x
4K x 16 word on-chip SARAM (program)–8000h – 8FFFhx01x
4K x 16 word program and data on-chip SARAM
4K x 16 word on-chip SARAMnot availablenot availablex00x
†
The “x” denotes a “don’t care” condition.
‡
The single 4K on-chip SARAM block is accessible from both data and program memory space.
=0
DATA MEMORY
ADDRESS
–8000h – FFFFh0x00
‡
800h – 17FFh8000h – 8FFFhx11x
PROG MEMORY
ADDRESS
MP/MC†DON
†
PON
†
on-chip ROM
The mask-programmable ROM is located in program memory space. Customers can arrange to have this ROM
programmed with contents unique to any particular application. The ROM is enabled or disabled by the state
of the MP/MC
the block of program memory from addresses 0000–7FFFh. (Note: the last 100h words, 7F00–7FFFh, are
reserved for test.) When in microprocessor mode (MP/MC
device’s external program memory space.
control input upon resetting the device. In microcomputer mode (MP/MC = 0), the ROM occupies
= 1), addresses 0000h–7FFFh are located in the
CNF
BIT
†
bootloader
A bootloader is available in the standard ’C206/’LC206 on-chip ROM. This bootloader can be used to
automatically transfer user code from an external source to program memory at power up. If MP/MC
device is sampled low during a hardware reset, execution begins at location 0000h of the on-chip ROM. This
location contains a branch instruction to the start of the bootloader program. User code can be transferred to
the DSP program memory using any one of the following options:
D
8-bit transfer through the Synchronous Serial Port (SSP)
D
8-bit transfer through the Asynchronous Serial Port (ASP)
D
8/16-bit external EPROM
D
8/16-bit parallel port mapped to I/O space address 0001h of the DSP
D
Warm boot option
The standard ’C206/’LC206 on-chip ROM also contains the A-law, µ-law table in addition to the bootloader . (The
A-law table starts at 0400h, and the µ-law table starts at 0500h.)
of the
18
POST OFFICE BOX 1443 • HOUSTON, TEXAS 77251–1443
Loading...
+ 40 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.