beautypg.com

1 instruction pre-fetch and fetch, 2 instruction dispatch, 3 address generation – Freescale Semiconductor StarCore SC140 User Manual

Page 184: Instruction pre-fetch and fetch -4, Instruction dispatch -4, Address generation -4

background image

5-4

SC140 DSP Core Reference Manual

Pipeline

5.1.1.1 Instruction Pre-Fetch and Fetch

The first two stages of the pipeline are the pre-fetch and fetch stages. These two stages combined are
responsible for the program memory read of the fetch set. Each fetch set contains eight instruction words.

In the pre-fetch stage, the address of the fetch set is driven into the program address bus (PAB) along with
the read strobe. This enables the memory read process. While the address is issued to memory, the fetch
counter (FC) in the PSEQ is updated for the next program memory read. Both of these operations occur in
parallel. The address can be generated by the PSEQ for:

Normal program flow

Exception program flow

Hardware loops

Change-of-flow instructions in the AGU

The fetch stage, which follows the pre-fetch stage, is dedicated to waiting for the memory access to be
completed. Memory access is completed when the PSEQ samples the program memory value from the
128-bit wide program data bus (PDB).

Since an execution set can overlap to a second fetch set, more than one fetch set is stored in a buffer.
However, the instructions in an execution set need to be dispatched together.

The SC140 core in the current implementation holds an internal fetch buffer of 4 fetch sets (128 bits each).
VLES are dispatched form this buffer to the execution units, emptying the buffer. Dispatching is unrelated
to fetching, and only the actual number of instruction words needed for execution are dispatched. Hence a
single fetch set can be "consumed" anywhere between 1 cycle (in case the VLES is 8 words long) and 8
cycles (in case it contains 8 VLES of 1 word each). The fetch unit tries to keep this buffer full as much as it
can. After reset and after every change of flow (COF), a series of 4 fetch requests is issued to the memory.
Upon dispatch of the last instruction in the fetch set, another fetch request is issued to the memory. In serial
code it means that a fetch is issued between once per cycle and once every 8 cycles, depending on the
width of the VLES that are executed. Hence the main factors that affect the density of fetches on the
program bus are the width of the execution set (number of instructions per set - affects the rate the buffer is
emptied) and the frequency of COF instructions (which trigger a fetch buffer flush and re-fill).

5.1.1.2 Instruction Dispatch

After the fetch set is read from memory to the PSEQ, the PSEQ detects which instructions are grouped into
an execution set. These instructions will be dispatched in parallel such that the number of cycles taken by
the longest instruction will determine the number of cycles for the whole execution set. The PSEQ detects
the type of instructions (such as DALU or AGU), and the AGU instructions are decoded.

5.1.1.3 Address Generation

The address generation pipeline stage is implemented in the AGU and DALU. In the DALU, the address
generation stage includes decoding the DALU instructions. However, in the AGU, the address generation
stage includes updating the address pointers as well as the actual memory accesses (driving the address and
the read/write strobes). The AGU is also responsible for address calculation when a change-of-flow
operation takes place.