

# **Topics**

### 1. Basic Processor Architecture

- 2. Different Types of Processor Architectures
- 3. RISC-V Processor Architecture
- 4. RISC-V Instruction Set Architecture
- 5. Programming RISC-V using assembly language

### **Basic Processor Architecture**

Processor Architecture refers to the design and organization of a processor's central processing unit (CPU).

### **Components of Processor:**

- Arithmetic and Logic Unit: Performs mathematical calculations.
- Control Unit: Control the overall processing of the processor.
- Decoders Unit: Convert coded instructions into signals that can control other components.
- Registers: Hold data, instructions, and addresses temporarily during processing.
- Buses: Electrical pathways that transmit data and signals between components. Types include the data bus, address bus, and control bus.





- Clock: Generates timing signals to synchronize the operations of the CPU components. The clock speed determines how many instructions per second the CPU can execute.
- **Instruction Set Architecture (ISA)**: Defines the set of instructions the CPU can execute
- Cache: Stores frequently accessed data and instructions to speed up processing.
- Memory Management Unit (MMU): Handles the translation of virtual addresses to physical addresses. Manages memory protection and caching.
- Input/Output (I/O) Interfaces: Allow the CPU to communicate with peripheral devices. Include ports and controllers for devices such as keyboards, mice, and storage.
- Power Control Unit:



### **Arithmetic Logic Unit ALU:**

In computing, an arithmetic logic unit (ALU) is a combinational digital circuit that performs arithmetic and bitwise operations on integer binary numbers.

It is a fundamental building block of many types of computing circuits, including the central processing unit (CPU) of computers, FPUs, and graphics processing units (GPUs).

### **Functions of ALU:**

| Basic Operations      | Basic Instructions                                                        |
|-----------------------|---------------------------------------------------------------------------|
| Arithmetic operations | Addition, Subtraction, Multiplication, division                           |
| Logical operations    | Logical Sum(OR), Logical Product(AND), Logical negation (NOT)             |
| Comparison            | Comparison Instruction (size compare)                                     |
| Branch                | Branch instructions to alter the instruction sequence based on conditions |

### **Registers**

- Registers are a type of computer memory built directly into the processor that is used to store and manipulate data during the execution of instructions.
- A register may hold an instruction, a storage address, or any kind of data (such as a bit sequence or individual characters).



## Registers in Processor Architecture



### **Control Unit:**

The control unit controls all the operations of the processor. It retrieves, decodes and executes the code instructions one-by-one in the order they are stored in the main memory.

It instructs the arithmetic logic unit, memory, input/output devices how to respond to the instructions of the program.



## Stages: Execution Clock Cycles





### Instruction Set Architecture (ISA)

An **Instruction Set Architecture (ISA)** is part of the abstract model of a computer that defines how the CPU is controlled by the software.

- The ISA acts as an interface between the hardware and the software, specifying both what the processor is capable of doing as well as how it gets done.
- The ISA defines the supported data types, the registers, how the hardware manages main memory, key features (such as virtual memory), which instructions a microprocessor can execute, and the input/output model of multiple ISA implementations.
- Provides:
  - **Programmability**
  - **Flexibility**
  - **Reusablility**
  - **Adaptability**
  - **Accessibility**



Machine Instructions

1101010001100101

0111010101101101

1101011101100111

0101110101101100

1111010101010011

0101011101101101

## Instruction Set Format

| 31 | 27                   | 26   | 25   | 24 | 20       | 19  | 15     | 14  | 12  | 11  | 7        | 6   | 0      |        |
|----|----------------------|------|------|----|----------|-----|--------|-----|-----|-----|----------|-----|--------|--------|
|    | func                 | t7   |      | rs | 2        | rs1 |        | fun | ct3 |     | rd       | opc | ode    | R-type |
|    | in                   | nm[: | 11:0 | )] |          | rs1 |        | fun | ct3 |     | rd       | opc | ode    | I-type |
| iı | nm[1                 | 1:5] |      | rs | 2        | rs1 |        | fun | ct3 | imr | n[4:0]   | opc | ode    | S-type |
| im | m[12                 | 10:5 | 5]   | rs | 2        | rs1 |        | fun | ct3 | imm | [4:1 11] | opc | ode    | B-type |
|    | imm[31:12] rd opcode |      |      |    |          | ode | U-type |     |     |     |          |     |        |        |
|    | imm[20 10:1 1        |      |      |    | 11 19:12 | 2]  |        |     |     | rd  | opc      | ode | J-type |        |

A form of representation of an instruction composed of fields of binary numbers."

#### **Fields of instruction:**

There are several fields of the instruction that serve a specific role in the format. Some common are fields are given below:

#### 1. Opcode:

- Specifies the operation to be performed (e.g., add, subtract, load, store).
- Determines what action the CPU should take.

#### 2. Operand:

- The data or the addresses of the data on which the operation is to be performed.
- Can include immediate values, register addresses, or memory addresses.

### 3. Addressing Modes:

Processor uses different Addressing modes Common modes include: immediate, direct, indirect, register, and indexed addressing.

#### 4. Registers:

Specifies which CPU registers are to be used in the operation.

Could include source and destination registers.



### **Instruction Types**

A computer's instructions can be any length and have any number of addresses.

- The arrangement of a computer's registers determines the different address fields in the instruction format.
- The instruction can be classified as three, two, and one address instruction or zero address instruction, depending on the number of address fields.

Based on these differences the instructions are classified as

- 1) Three Address Instruction
- 2) Two Address Instruction
- 3) One Address Instruction
- 4) Zero Address Instruction

## **Three Address Instruction:**

Three-address instruction is a format of machine instruction. It has one opcode and three address fields.

One address field is used for destination and two address fields for source.

| OPCODE | DESTINATION | SOURCE 1 | SOURCE 2 |
|--------|-------------|----------|----------|
|--------|-------------|----------|----------|

### **Example:**

| ADD | R1, A, B  | R1 = M[A] + M[B] |
|-----|-----------|------------------|
| ADD | R2, C, D  | R2 = M[C] + M[D] |
| MUL | X, R1, R2 | M[X] = R1 * R2   |

### **Two Address Instruction:**

Two-address instruction is a format of machine instruction. It has one opcode and two address fields which may be memory locations or registers..

One address field is used for destination and one address field for source.

For example, a two-address instruction might add the contents of two registers together and store the result in one of the registers.

| OPCODE I | DESTINATION | SOURCE |
|----------|-------------|--------|
|----------|-------------|--------|

### **Example**

| MOV | R1, A | R1 = M[A]      |
|-----|-------|----------------|
| ADD | R1, B | R1 = R1 + M[B] |

### **One Address Instruction:**

These instructions specify one operand or address, which typically refers to a memory location or register.

The instruction operates on the contents of that operand, and the result may be stored in the same or a different location.

For example, a one-address instruction might load the contents of a memory location into a register.

| OPCODE | DESTINATION |
|--------|-------------|
|--------|-------------|

**Example:** 

| STORE | Т | M[T] = AC |
|-------|---|-----------|
| LOAD  | С | AC = M[C] |

### **Zero Address Instruction:**

These instructions do not specify any operands or addresses. Instead, they operate on data stored in registers or memory locations implicitly defined by the instruction.

For example, a zero-address instruction might simply add the contents of two registers together without specifying the register names.

## Types of Instructions and Addressing Modes

Implied Mode

Example: CLC; Clear the carry flag, no operands needed

**Immediate Mode** 

Example: ADDI x1, x2, 10; Add immediate value 10 to register x2 and store result in x1

Register Mode

Example: MOV r0, r1; Move the contents of register r1 to register r0

Register Indirect Mode

Example: LW \$t0, 0(\$t1); Load the word at the address in \$t1 into \$t0

### **Autodecrement Mode**

Example: MOV -(R1), R0; Decrement R1 and then move the value at the new address in R1 to R0

### **Autoincrement Mode**

Example: MOV (R1)+, R0; Move the value at the address in R1 to R0, then increment R1

### Direct Address Mode

Example: LDA \$4000; Load the accumulator with the value at memory address \$4000

### **Indirect Address Mode**

Example: JMP (\$1234); Jump to the address stored at memory location \$1234

### **Indexed Addressing Mode**

Example: MOV AX, [BX+SI]; Move the value at address (BX + SI) into AX

# Important Parameters of a Processor

Clock
Data Bus
Instruction Bus
Instructions Per Cycles
Pipeline Stage

# **Topics**

- 1. Basic Processor Architecture
- 2. Different Types of Processor Architectures
- 3. RISC-V Processor Architecture
- 4. RISC-V Instruction Set Architecture
- 5. Programming RISC-V using assembly language

## Processors Types

- General Purpose Processor
- Digital Signal Processor
- Vector Processor
- Application specific Processor



## Flynn Taxonomy

 The matrix below defines the 4 possible classifications according to Flynn

| SISD                              | SIMD                                |  |
|-----------------------------------|-------------------------------------|--|
| Single Instruction, Single Data   | Single Instruction, Multiple Data   |  |
| MISD                              | MIMD                                |  |
| Multiple Instruction, Single Data | Multiple Instruction, Multiple Data |  |

## **Types of Processor ISA**

Reduced Instruction Set Computing (RISC) vs Complex Instruction Set Computing (CISC)

| Aspect                 | RISC                                         | CISC                      |  |
|------------------------|----------------------------------------------|---------------------------|--|
| Instructions Per Cycle | Small and fixed length                       | Large and variable length |  |
| Instruction Complexity | Simple and standardised                      | Complex and versatile     |  |
| Instruction Execution  | Single clock cycle                           | Several clock cycles      |  |
| RAM Usage              | Heavy use of RAM                             | More efficient use of RAM |  |
| Memory                 | Increased memory usage to store instructions | Memory efficient coding   |  |
| Cost                   | Cheaper than CISC                            | Higher                    |  |

### RISC vs CISC

The RISC approach has several advantages over CISC:

- Simplifies Hardware Implementation: It simplifies the hardware implementation of the processor, as fewer instructions need to be decoded and executed. This can lead to faster execution times and lower power consumption.
- Higher Instruction Level Parallelism: RISC processors typically have a higher instruction-level parallelism, allowing them to execute multiple instructions simultaneously, which can further improve performance.
- **Simplicity:** The simplicity of the RISC instruction set makes it easier to develop compilers and other software tools that can generate efficient code for the processor.

### RISC vs CISC

RISC is a processor design philosophy that emphasizes simplicity and efficiency by using a small set of simple and general-purpose instructions.

- The *complex instruction set computing* (CISC), employs a larger set of more complex instructions that can perform multiple operations in a single instruction.
- RISC architectures prioritize simplicity and execute one instruction per clock cycle, resulting in streamlined designs and efficient decoding.
- CISC architectures, on the other hand, employ complex instructions capable of performing multiple actions but may require several clock cycles for execution. Both the CPUs aim to enhance CPU performance.

# Single-purpose processors

Digital circuit designed to execute exactly one program a.k.a. coprocessor, accelerator or peripheral

**Features** 

Contains only the components needed to execute a single program

No program memory

**Benefits** 

**Fast** 

Low power

Small size



## Embedded System Processor Architecture

- Reduced Instruction Set Computing (RISC):
  - Common architectures: ARM, RISC-V.
  - Simple, efficient instruction set optimized for low power and high performance.
- System on Chip (SoC):
  - Frequently used in embedded systems.
  - Integrates CPU, memory, peripherals, and other components on a single chip.
- **Microcontroller Units (MCUs):** 
  - Often used in simpler embedded applications.
  - Includes integrated peripherals like ADCs, DACs, timers, and communication interfaces
- Real-Time Capabilities:
  - Designed for deterministic performance and real-time operating system (RTOS) support.
- Low Power Consumption:
  - Architectures and components optimized for minimal power usage.
- Integrated Analog and Digital Peripherals:
  - Features like GPIOs, serial communication interfaces, and specialized hardware accelerators.



ALU

Data

## Digital Signal Processor

### **Specialized Instruction Set:**

Optimized for mathematical operations like multiply-accumulate (MAC). Single-cycle multiply and MAC instructions.

### **Harvard Architecture:**

Separate program and data memories to allow simultaneous access and increase throughput.

### **Specialized Data Path:**

Multiple data buses and address buses.

Dedicated hardware for specific tasks such as FFT (Fast Fourier Transform) and filters.

### **High-Performance ALUs:**

Multiple arithmetic logic units (ALUs) to perform parallel operations.

Support for fixed-point and floating-point arithmetic.

### **Circular and Bit-Reversed Addressing:**

Efficiently manage circular buffers and data structures used in signal processing.

### **Low-Latency Memory Access:**

On-chip RAM with very low access latency.

Multi-level cache hierarchy optimized for predictable access patterns.



## General-Purpose Processor (GPP) Architecture

### Complex Instruction Set Computing (CISC):

- Common architecture: x86.
- Rich instruction set with complex instructions.
- Often integrates many features directly in hardware.

#### Multi-Core and Hyper-Threading:

- 3 Multiple cores for parallel processing.
- Hyper-threading for improved performance through parallel execution within each core.

#### Large Cache Hierarchy:

- Multiple levels of cache (L1, L2, L3) to reduce latency and increase speed.
- Advanced Branch Prediction and Speculative Execution:
- Techniques to predict instruction paths and execute ahead to improve performance.

#### Integrated Memory Management Unit (MMU):

Manages virtual memory, enabling sophisticated operating system features.

#### High-Speed Interconnects:

3 Fast communication between CPU, memory, and peripherals.

#### Graphics Processing Unit (GPU) Integration:

3 Some GPPs include integrated GPUs for handling graphics processing tasks.





# Topics

- 1. Basic Processor Architecture
- 2. Different Types of Processor Architectures
- 3. RISC-V Processor Architecture
- 4. RISC-V Instruction Set Architecture
- 5. Programming RISC-V using assembly language



### **RISC-V Processor Architecture**

The RISC-V (pronounced as risk-five) architecture is an open-source instruction set architecture (ISA) implementation of reduced instruction set computing RISC.

RISC-V is open-hardware architecture, its open source allows anyone to utilize the ISA.

### **History of RISC-V**

- Prof. Krste Asanović and graduate students Yunsup Lee and Andrew Waterman started the RISC-V instruction set in May 2010 as part of the <u>Parallel Computing Laboratory</u> (Par Lab) at UC Berkeley, of which Prof. David Patterson was Director.
- No patents were filed related to RISC-V in any of these projects, as the RISC-V ISA itself does not represent any new technology.

• RISC processor implementations—including some based on other open ISA standards— are widely available from various vendors worldwide.



## Processor Architecture

### **Base Instruction Set**

RV32I Base Integer Instruction Set, 32-bit
RV32E Base Integer Instruction Set (embedded), 32-bit
RV64I Base Integer Instruction Set, 64-bit

### **Extension:**

| Name                                                    | Description                                               |
|---------------------------------------------------------|-----------------------------------------------------------|
| M Standard Extension for Integer Multiplication and Div |                                                           |
| A Standard Extension for Atomic Instructions            |                                                           |
| F                                                       | Standard Extension for Single-Precision Floating-Point    |
| D                                                       | Standard Extension for Double-Precision Floating-Point    |
| Zicsr                                                   | Control and Status Register (CSR) Instructions            |
| Zifencei                                                | Instruction-Fetch Fence                                   |
| G                                                       | Shorthand for the IMAFDZicsr_Zifencei base and extensions |
| C                                                       | Standard Extension for Compressed Instructions            |

### **Base and Extension of RISC-V**

- Four base integer ISAs
  - RV32E, RV32I, RV64I, RV128I
  - RV32E is 16-register subset of RV32I
  - Only <50 hardware instructions needed for base
- Standard extensions
  - M: Integer multiply/divide
  - A: Atomic memory operations (AMOs + LR/SC)
  - F: Single-precision floating-point
  - D: Double-precision floating-point
  - G = IMAFD, "General-purpose" ISA
  - Q: Quad-precision floating-point
- All the above are a fairly standard RISC encoding in a fixed 32-bit instruction format
- Above user-level ISA components frozen in 2014
  - Supported forever after



# RISCV: Registers and Mapping

RISC-V uses a memory-mapped I/O architecture, which means that input and output operations, memory access, and peripheral access are all performed using the same load and store instructions.

- This unified approach simplifies the instruction set and enhances the flexibility and efficiency of the architecture. There are two basic types of instructions:
- Instructions that either load memory into registers or store data from registers into memory
- Instructions that perform arithmetical or logical operations between two registers

# Why RISC-V

**Open Hardware:** Allowing anyone to design, implement, and customize processors without restrictions, fostering innovation and collaboration within the community.

**Royalty-Free:** There are no licensing fees, reducing costs for developers and manufacturers.

**Security:** Rigorous security analysis and the implementation of custom security features, enhancing trustworthiness.

| ISA    | Pages | Words     | Hours to read | Weeks to read |
|--------|-------|-----------|---------------|---------------|
| RISC-V | 236   | 76,702    | 6             | 0.2           |
| ARM-32 | 2736  | 895,032   | 79            | 1.9           |
| x86-32 | 2198  | 2,186,259 | 182           | 4.5           |

|                        |                                                        | Mistakes of the Past                                               |                                                                                                                                  | Lessons learned                                                                                                                                                                                                           |
|------------------------|--------------------------------------------------------|--------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                        | ARM-32 (1986)                                          | MIPS-32 (1986)                                                     | x86-32 (1978)                                                                                                                    | RV32I (2011)                                                                                                                                                                                                              |
| Cost                   | Integer multiply mandatory                             | Integer multiply<br>and divide manda-<br>tory                      | erations. Integer                                                                                                                | No 8-bit and 16-bit operations. Integer multiply and divide optional (RV32M)                                                                                                                                              |
| Simplicity             | struction execution.<br>Complex data<br>address modes. | extended immediates. Some arithmetic instruc-                      | Complex procedure<br>call/return instruc-<br>tions (enter/leave).<br>Stack instructions<br>(push/pop). Com-<br>plex data address | complex call/return or                                                                                                                                                                                                    |
| Performance            |                                                        |                                                                    | branches. At most                                                                                                                | Compare and branch instructions (no condition codes). 3 registers per instruction. No load multiple. Source and destination registers fixed in instruction format. Constant immediates. PC not a general purpose register |
| I                      | line length when                                       | Delayed load. HI and LO registers                                  | eral purpose (AX,                                                                                                                | No delayed branch. No delayed load. General purpose registers                                                                                                                                                             |
| growth<br>Program size | tions (+Thumb-2 as separate ISA)                       | opcode space Only 32-bit instructions (+microMIPS as separate ISA) | structions, but poor choices                                                                                                     | Generous available op-<br>code space  32-bit instructions + 16-<br>bit RV32C extension                                                                                                                                    |
| gramming /             | Aligned data in memory. Irregu-                        | memory. Inconsis-                                                  | PC-relative data ad-                                                                                                             | 31 registers. Data can be unaligned. PC-relative data addressing. Symmetric data address mode. Performance counters defined in architecture                                                                               |

## Types of RISC-V Processor Architectures

- RISC-V provides a detailed, open Instruction Set Architecture (ISA), which serves as a blueprint for designing processors architecture.
- Single-Cycle Architecture:
- Multi-Cycle Architecture:
- Pipelined Architecture:
- Superscalar Architecture:
- Out-of-Order Execution:
- Very Long Instruction Word (VLIW) Architecture:
- Vector Processing Architecture:
- Custom Instruction Set Extensions:



## Defining/Designing RISC-V Processor Architecture

- Fetch: Retrieve the instruction from memory.
- Decode: Interpret the instruction and prepare operands.
- Execute: Perform the computation or operation (ALU operations, branches).
- Memory: Access memory for load/store operations.
- Writeback: Write the result to the register file or memory.

## 5 Stages of Processor Arch

### Fetch Unit

Function: Retrieves instructions from memory.

PC Usage: The PC holds the address of the next instruction to be fetched. After fetching an instruction, the PC is typically incremented to point to the next instruction address. Example: If the starting address of the first instruction is 0x8000000, the Fetch Unit will

fetch the instruction from address 0x8000000 initially.

### Decode Unit

Function: Interprets the fetched instruction to determine its operation and operands. Memory Access: Decodes memory addresses and identifies whether they are for RAM,

ROM, or I/O devices. It also decodes which registers are involved.

ALU: Determines the type of ALU operation required (e.g., addition, subtraction) and prepares operands for execution.

Example: Decodes an instruction to add two registers and prepare the operands for the ALU.

#### **Execute Unit**

Function: Performs the arithmetic or logical operations as specified by the instruction.

ALU: Executes ALU operations (e.g., addition, subtraction) using the operands provided by the Decode Unit.

Memory Access: Computes effective addresses for load/store operations.

Example: Executes an addition operation on two registers or calculates the address for a load instruction.

### **Memory Unit**

Function: Accesses memory or I/O based on the address computed in the Execute stage.

Memory Access: Performs read/write operations to RAM or memory-mapped I/O devices based on the effective address.

Example: Reads data from address 0x00002000 in RAM or writes data to a memory-mapped I/O device at 0x20000000.

#### **Write Back Unit**

Function: Writes the result of computations or memory accesses back to the register file or memory.

Memory Access: Updates the register file with results from the Memory Unit or ALU operations.

Example: Writes the result of an addition operation back to a register or stores data retrieved from memory to a register.

### **Pipe-lined VS Single Cycle Processor Architecture**

- Fetch instruction from memory.
- Read registers and decode the instruction.
- 3. Execute the operation or calculate an address.
- 4. Access an operand in data memory (if necessary).
- 5. Write the result into a register (if necessary).





## **5 Stage Pipelined Processor Architecture**



## Topics

- 1. Basic Processor Architecture
- 2. Different Types of Processor Architectures
- 3. RISC-V Processor Architecture
- 4. RISC-V Instruction Set Architecture
- 5. Programming RISC-V using assembly language



### **RISCV Instructions Set**

 RISC-V (Reduced Instruction Set Computing V) is an open standard instruction set architecture (ISA) that is designed to be scalable and extensible. The number of instructions in RISC-V can vary based on the specific subset or extensions of the ISA being used. Here's a breakdown of the primary RISC-V instruction sets and their respective instruction counts:

### Base ISA:

- RV32I (32-bit Integer): The base integer instruction set for 32-bit processors includes approximately 47 instructions.
- RV64I (64-bit Integer): The base integer instruction set for 64-bit processors extends RV32I and includes a few additional instructions specific to 64 bit operations.

## Instruction Extensions

### Standard Extensions:

- > M (Multiply/Divide): Adds multiply and divide instructions.
- > A (Atomic): Adds atomic instructions for synchronization.
- F (Single-Precision Floating-Point): Adds single-precision floating-point instructions.
- > D (Double-Precision Floating-Point): Adds double-precision floating-point instructions.
- Q (Quad-Precision Floating-Point): Adds quad-precision floating-point instructions.
- > C (Compressed): Adds compressed instructions to reduce code size.

### Other Extensions:

- > B (Bit-Manipulation): Adds instructions for bit manipulation.
- > V (Vector): Adds vector processing instructions.
- P (Packed-SIMD): Adds packed SIMD instructions.
- > Z (Various small extensions): These include specific sets of instructions like Zifencei for instruction-fence or Zicsr for control and status registers.

## **Basic RISCV Processor**

### • The 47 standard instructions in RV32I include:

- > Arithmetic Instructions: ADD, SUB, MUL, etc.
- Logical Instructions: AND, OR, XOR, etc.
- > Immediate Instructions: ADDI, ORI, XORI, etc.
- Load Instructions: LB, LH, LW, etc.
- Store Instructions: SB, SH, SW, etc.
- Branch Instructions: BEQ, BNE, BLT, etc.
- > Jumps: JAL, JALR
- System Instructions: ECALL, EBREAK
- > Other Instructions: NOP, AUIPC, LUI, etc.

| 31        | 25          | 24  | 20         | 19     | 15 | 14 | 12    | 11 7        | 6       |         | 0        |           |
|-----------|-------------|-----|------------|--------|----|----|-------|-------------|---------|---------|----------|-----------|
|           |             |     | mm[31:12   |        |    |    |       | rd          |         | 0110111 |          | U lui     |
|           |             |     | mm[31:12   |        |    |    |       | rd          |         | 0010111 |          | U auipc   |
|           |             | -   | 20 10:1 11 | 19:12] |    |    |       | rd          |         | 1101111 |          | J jal     |
|           | nm[11       | :0] |            | rs1    |    | 00 |       | rd          |         | 1100111 |          | I jalr    |
| imm[12 10 |             |     | rs2        | rs1    |    | 00 |       | imm[4:1 11] |         | 1100011 |          | B beq     |
| imm[12 10 |             |     | rs2        | rs1    |    | 00 |       | imm[4:1 11] |         | 1100011 |          | B bne     |
| imm[12 10 |             | rs2 |            | rs1    |    | 10 |       | imm[4:1 11] |         | 1100011 |          | B blt     |
| imm[12 10 |             |     | rs2        | rs1    |    | 10 |       | imm[4:1 11] |         | 1100011 |          | B bge     |
| imm[12 10 |             |     | rs2        | rs1    |    | 11 |       | imm[4:1 11] |         | 1100011 |          | B bltu    |
| imm[12 10 |             |     | rs2        | rs1    |    | 11 |       | imm[4:1 11] |         | 1100011 |          | B bgeu    |
|           | nm[11       |     |            | rs1    |    | 00 |       | rd          |         | 0000011 |          | I lb      |
|           | nm[11       |     |            | rs1    |    | 00 |       | rd          |         | 0000011 |          | I lh      |
|           | nm[11       |     |            | rs1    |    | 01 |       | rd          |         | 0000011 |          | I lw      |
|           | nm[11       |     |            | rs1    |    | 10 |       | rd          |         | 0000011 |          | I lbu     |
|           | nm[11       | :0] |            | rs1    |    | 10 |       | rd          |         | 0000011 |          | I lhu     |
| imm[11:   |             |     | rs2        | rs1    |    | 00 |       | imm[4:0]    |         | 0100011 |          | S sb      |
| imm[11:   |             |     | rs2        | rs1    |    | 00 |       | imm[4:0]    |         | 0100011 |          | S sh      |
| imm[11:   |             |     | rs2        | rs1    |    | 01 |       | imm[4:0]    |         | 0100011 |          | S sw      |
|           | nm[11       |     |            | rs1    |    | 00 |       | rd          |         | 0010011 |          | I addi    |
|           | nm[11       |     |            | rs1    |    | 01 |       | rd          |         | 0010011 |          | I slti    |
|           | nm[11       |     |            | rs1    |    | 01 |       | rd          |         | 0010011 |          | I sltiu   |
|           | nm[11       |     |            | rs1    |    | 10 |       | rd          |         | 0010011 |          | I xori    |
|           | nm[11       |     |            | rs1    |    | 11 |       | rd          |         | 0010011 |          | I ori     |
|           | nm[11       | -   |            | rs1    |    | 11 |       | rd          |         | 0010011 |          | I andi    |
| 000000    |             |     | shamt      | rs1    |    | 00 |       | rd          |         | 0010011 |          | I slli    |
| 000000    |             |     | shamt      | rs1    |    | 10 |       | rd          |         | 0010011 |          | I srli    |
| 010000    |             |     | shamt      | rs1    |    | 10 |       | rd          |         | 0010011 |          | I srai    |
| 000000    |             |     | rs2        | rs1    |    | 00 |       | rd          |         | 0110011 |          | R add     |
| 010000    |             |     | rs2        | rs1    |    | 00 |       | rd          |         | 0110011 |          | R sub     |
| 000000    |             |     | rs2        | rs1    |    | 00 |       | rd          |         | 0110011 |          | R sll     |
| 000000    |             |     | rs2        | rs1    |    | 01 |       | rd          |         | 0110011 |          | R slt     |
| 000000    |             |     | rs2        | rs1    |    | 01 |       | rd          |         | 0110011 |          | R sltu    |
| 000000    |             |     | rs2        | rs1    |    | 10 |       | rd          |         | 0110011 |          | R xor     |
| 000000    |             |     | rs2        | rs1    |    | 10 |       | rd          |         | 0110011 |          | R srl     |
| 010000    |             |     | rs2        | rs1    |    | 10 |       | rd          |         | 0110011 |          | R sra     |
| 000000    |             |     | rs2        | rs1    |    | 11 |       | rd          |         | 0110011 |          | R or      |
| 000000    |             |     | rs2        | rs1    |    | 11 |       | rd          |         | 0110011 |          | R and     |
| 0000      | pre         |     | succ       | 00000  |    | 00 |       | 00000       |         | 0001111 |          | I fence   |
| 0000      | 000         |     | 0000       | 00000  |    | 00 |       | 00000       |         | 0001111 |          | I fence.i |
|           | 00000       |     |            | 00000  |    | 00 |       | 00000       |         | 1110011 |          | I ecall   |
| 000       | 00000000001 |     | 00000      | )      | 00 |    | 00000 |             | 1110011 |         | I ebreak |           |
|           | csr         |     | rs1        |        | 00 |    | rd    |             | 1110011 |         | I csrrw  |           |
| csr       |             |     | rs1        |        | 01 |    | rd    | $\perp$     | 1110011 |         | I csrrs  |           |
| csr       |             |     | rs1        |        | 01 |    | rd    | $\perp$     | 1110011 |         | I csrrc  |           |
| csr       |             |     | zimm       |        | 10 |    | rd    | $\perp$     | 1110011 |         | I csrrwi |           |
| csr       |             |     |            | zimm   |    | 11 |       | rd          | $\perp$ | 1110011 |          | I csrrsi  |
|           | csr         |     |            | zimm   |    | 11 | 1     | rd          |         | 1110011 |          | I csrrci  |

## Types of RISCV ISA

#### **RISC-V Instruction Set:**

The RISC-V instruction set is a collection of instructions that define the operations a RISC-V processor can perform.

These instructions are designed to be simple, efficient, and easily extensible, allowing for a high degree of customization and optimization.

### **Instruction Types:**

- 1. R-Type (Register Type): Used for register-register arithmetic and logical operations.
- 2. I- Type (Instruction Type): Used for immediate arithmetic, load instructions, and register-immediate operations.
- 3. S-Type (Store Type): Used for store instructions.
- 4. U-Type (Upper Immediate Type): Used for upper immediate instructions
- 5. B-Type (Branch Type): Used for conditional branch instructions.
- 6. J-Type (Jump Type): Used for jump instructions like JAL.
- 7. F-Type (Floating-Point) Instructions
- 8. A-Type (Atomic) Instructions
- 9. C-Type (Compressed) Instructions

# Registers

- Total Registers: 32 general-purpose registers, additional special-purpose and control registers.
  - General Purpose Registers: x0 to x31, with specific roles for some registers.
  - Special Purpose Registers: Includes PC, SP, GP, TP.
    - Program Counter (PC): Holds the address of the current instruction being executed.
    - Instruction Register (IR): Holds the current instruction being executed (in some implementations).
    - Stack Pointer (SP): Points to the top of the stack.
    - Global Pointer (GP): Points to the global data region.
    - Thread Pointer (TP): Points to the thread-local storage.
  - Control and Status Registers: Includes MSR, MEPC, MCAUSE, MSTATUS, MTVEC.
    - Machine Status Register (MSR): Controls machine-level status and configuration.
    - Machine Exception Program Counter (MEPC): Holds the address of the instruction where an exception occurred.
    - Machine Cause Register (MCAUSE): Contains information about the cause of the last exception.
    - Machine Status Register (MSTATUS): Holds the status of the machine, including interrupts and mode.
    - Machine Trap Vector Base Address Register (MTVEC): Base address for the trap vector.
  - Floating-Point Registers: If included, f0 to f31 for floating-point operations.

| Symbolic name | Description                                                        | Saved by                                                                                                                                                                                                                                                                                            |
|---------------|--------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|               | 32 integer registers                                               |                                                                                                                                                                                                                                                                                                     |
| Zero          | Always zero                                                        |                                                                                                                                                                                                                                                                                                     |
| ra            | Return address                                                     | Caller                                                                                                                                                                                                                                                                                              |
| sp            | Stack pointer                                                      | Callee                                                                                                                                                                                                                                                                                              |
| gp            | Global pointer                                                     |                                                                                                                                                                                                                                                                                                     |
| tp            | Thread pointer                                                     |                                                                                                                                                                                                                                                                                                     |
| t0            | Temporary / alternate return address                               | Caller                                                                                                                                                                                                                                                                                              |
| t1–2          | Temporary                                                          | Caller                                                                                                                                                                                                                                                                                              |
| s0/fp         | Saved register / frame pointer                                     | Callee                                                                                                                                                                                                                                                                                              |
| s1            | Saved register                                                     | Callee                                                                                                                                                                                                                                                                                              |
| a0-1          | Function argument / return value                                   | Caller                                                                                                                                                                                                                                                                                              |
| a2-7          | Function argument                                                  | Caller                                                                                                                                                                                                                                                                                              |
| s2-11         | Saved register                                                     | Callee                                                                                                                                                                                                                                                                                              |
| t3–6          | Temporary                                                          | Caller                                                                                                                                                                                                                                                                                              |
|               | name  Zero  ra  sp  gp  tp  t0  t1–2  s0/fp  s1  a0–1  a2–7  s2–11 | Terro Always zero  ra Return address  sp Stack pointer  gp Global pointer  tp Thread pointer  t0 Temporary / alternate return address  t1–2 Temporary  s0/fp Saved register / frame pointer  s1 Saved register  a0–1 Function argument / return value  a2–7 Function argument  s2–11 Saved register |

## **RISC-V Instruction Format**

| 31 | 27          | 26   | 25   | 24 | 20        | 19      | 15 | 14  | 12  | 11   | 7       | 6     | 0  |
|----|-------------|------|------|----|-----------|---------|----|-----|-----|------|---------|-------|----|
|    | func        | t7   |      | rs | 32        | rs1     |    | fun | ct3 |      | rd      | opcod | le |
|    | in          | nm[: | 11:0 | )] |           | rs1     |    | fun | ct3 |      | rd      | opcod | le |
| ir | nm[1        | 1:5] |      | rs | <b>2</b>  | rs1     |    | fun | ct3 | imn  | n[4:0]  | opcod | le |
| im | m[12        | 10:5 | 5]   | rs | <u> 2</u> | rs1     |    | fun | ct3 | imm[ | 4:1 11] | opcod | le |
|    |             |      |      | im | m[31      | :12]    |    |     |     |      | rd      | opcod | le |
|    | imm[20 10:1 |      |      |    |           | 11 19:1 | 2] |     |     |      | rd      | opcod | le |

R-type I-type S-type B-type U-type J-type

### RISC-V Instructions and Formats

### **RV32I Base Integer Instructions**

| Inst  | Name                    | FMT | Opcode  | funct3 | funct7         | Description (C)      | Note         |
|-------|-------------------------|-----|---------|--------|----------------|----------------------|--------------|
| add   | ADD                     | R   | 0110011 | 0x0    | 0x00           | rd = rs1 + rs2       |              |
| sub   | SUB                     | R   | 0110011 | 0x0    | 0x20           | rd = rs1 - rs2       |              |
| xor   | XOR                     | R   | 0110011 | 0x4    | 0x00           | rd = rs1 ^ rs2       |              |
| or    | OR                      | R   | 0110011 | 0x6    | 0x00           | rd = rs1   rs2       |              |
| and   | AND                     | R   | 0110011 | 0x7    | 0x00           | rd = rs1 & rs2       |              |
| sll   | Shift Left Logical      | R   | 0110011 | 0x1    | 0x00           | rd = rs1 << rs2      |              |
| srl   | Shift Right Logical     | R   | 0110011 | 0x5    | 0x00           | rd = rs1 >> rs2      |              |
| sra   | Shift Right Arith*      | R   | 0110011 | 0x5    | 0x20           | rd = rs1 >> rs2      | msb-extends  |
| slt   | Set Less Than           | R   | 0110011 | 0x2    | 0x00           | rd = (rs1 < rs2)?1:0 |              |
| sltu  | Set Less Than (U)       | R   | 0110011 | 0x3    | 0x00           | rd = (rs1 < rs2)?1:0 | zero-extends |
| addi  | ADD Immediate           | I   | 0010011 | 0x0    |                | rd = rs1 + imm       |              |
| xori  | XOR Immediate           | I   | 0010011 | 0x4    |                | rd = rs1 ^ imm       |              |
| ori   | OR Immediate            | I   | 0010011 | 0x6    |                | rd = rs1   imm       |              |
| andi  | AND Immediate           | I   | 0010011 | 0x7    |                | rd = rs1 & imm       |              |
| slli  | Shift Left Logical Imm  | I   | 0010011 | 0x1    | imm[5:11]=0x00 | rd = rs1 << imm[0:4] |              |
| srli  | Shift Right Logical Imm | I   | 0010011 | 0x5    | imm[5:11]=0x00 | rd = rs1 >> imm[0:4] |              |
| srai  | Shift Right Arith Imm   | I   | 0010011 | 0x5    | imm[5:11]=0x20 | rd = rs1 >> imm[0:4] | msb-extends  |
| slti  | Set Less Than Imm       | I   | 0010011 | 0x2    |                | rd = (rs1 < imm)?1:0 |              |
| sltiu | Set Less Than Imm (U)   | I   | 0010011 | 0x3    |                | rd = (rs1 < imm)?1:0 | zero-extends |

| sb     | Store Byte          | S | 0100011 | 0x0 |         | M[rs1+imm][0:7] = rs2[0:7]   |              |
|--------|---------------------|---|---------|-----|---------|------------------------------|--------------|
| sh     | Store Half          | S | 0100011 | 0x1 |         | M[rs1+imm][0:15] = rs2[0:15] |              |
| SW     | Store Word          | S | 0100011 | 0x2 |         | M[rs1+imm][0:31] = rs2[0:31] |              |
| beq    | Branch ==           | В | 1100011 | 0x0 |         | if(rs1 == rs2) PC += imm     |              |
| bne    | Branch !=           | В | 1100011 | 0x1 |         | if(rs1 != rs2) PC += imm     |              |
| blt    | Branch <            | В | 1100011 | 0x4 |         | if(rs1 < rs2) PC += imm      |              |
| bge    | Branch ≥            | В | 1100011 | 0x5 |         | if(rs1 >= rs2) PC += imm     |              |
| bltu   | Branch < (U)        | В | 1100011 | 0x6 |         | if(rs1 < rs2) PC += imm      | zero-extends |
| bgeu   | Branch $\geq$ (U)   | В | 1100011 | 0x7 |         | if(rs1 >= rs2) PC += imm     | zero-extends |
| jal    | Jump And Link       | J | 1101111 |     |         | rd = PC+4; PC += imm         |              |
| jalr   | Jump And Link Reg   | I | 1100111 | 0x0 |         | rd = PC+4; PC = rs1 + imm    |              |
| lui    | Load Upper Imm      | U | 0110111 |     |         | rd = imm << 12               |              |
| auipc  | Add Upper Imm to PC | U | 0010111 |     |         | rd = PC + (imm << 12)        |              |
| ecall  | Environment Call    | I | 1110011 | 0x0 | imm=0x0 | Transfer control to OS       |              |
| ebreak | Environment Break   | I | 1110011 | 0x0 | imm=0x1 | Transfer control to debugger |              |

### **RV32M Multiply Extension**

| Inst  | Name             | FMT | Opcode  | funct3 | funct7 | Description (C)         |
|-------|------------------|-----|---------|--------|--------|-------------------------|
| mul   | MUL              | R   | 0110011 | 0x0    | 0x01   | rd = (rs1 * rs2)[31:0]  |
| mulh  | MUL High         | R   | 0110011 | 0x1    | 0x01   | rd = (rs1 * rs2)[63:32] |
| mulsu | MUL High (S) (U) | R   | 0110011 | 0x2    | 0x01   | rd = (rs1 * rs2)[63:32] |
| mulu  | MUL High (U)     | R   | 0110011 | 0x3    | 0x01   | rd = (rs1 * rs2)[63:32] |
| div   | DIV              | R   | 0110011 | 0x4    | 0x01   | rd = rs1 / rs2          |
| divu  | DIV (U)          | R   | 0110011 | 0x5    | 0x01   | rd = rs1 / rs2          |
| rem   | Remainder        | R   | 0110011 | 0x6    | 0x01   | rd = rs1 % rs2          |
| remu  | Remainder (U)    | R   | 0110011 | 0x7    | 0x01   | rd = rs1 % rs2          |

### **RV32A Atomic Extension**

| :    | 31     | 27  | 26 | 25 | 24  | 20 1   | 9       | 15 1 | 14 12  | 11            | 7 6    | 0 |
|------|--------|-----|----|----|-----|--------|---------|------|--------|---------------|--------|---|
|      | funct5 |     | aq | rl | r   | s2     | rs1     |      | funct3 | rd            | opcode |   |
|      | 5      |     | 1  | 1  |     | 5      | 5       |      | 3      | 5             | 7      |   |
| [mat | N      | ama |    |    | EMT | Opendo | funct 2 | funo | 45 Doc | parintian (C) |        |   |

| Inst      | Name              | FMT | Opcode  | funct3 | funct5 | Description (C)                         |
|-----------|-------------------|-----|---------|--------|--------|-----------------------------------------|
| lr.w      | Load Reserved     | R   | 0101111 | 0x2    | 0x02   | rd = M[rs1], reserve M[rs1]             |
| SC.W      | Store Conditional | R   | 0101111 | 0x2    | 0x03   | if (reserved) { M[rs1] = rs2; rd = 0 }  |
|           |                   |     |         |        |        | else { rd = 1 }                         |
| amoswap.w | Atomic Swap       | R   | 0101111 | 0x2    | 0x01   | rd = M[rs1]; swap(rd, rs2); M[rs1] = rd |
| amoadd.w  | Atomic ADD        | R   | 0101111 | 0x2    | 0x00   | rd = M[rs1] + rs2; M[rs1] = rd          |
| amoand.w  | Atomic AND        | R   | 0101111 | 0x2    | 0x0C   | rd = M[rs1] & rs2; M[rs1] = rd          |
| amoor.w   | Atomic OR         | R   | 0101111 | 0x2    | 0x0A   | rd = M[rs1]   rs2; M[rs1] = rd          |
| amoxor.w  | Atomix XOR        | R   | 0101111 | 0x2    | 0x04   | rd = M[rs1] ^ rs2; M[rs1] = rd          |
| amomax.w  | Atomic MAX        | R   | 0101111 | 0x2    | 0x14   | rd = max(M[rs1], rs2); M[rs1] = rd      |
| amomin.w  | Atomic MIN        | R   | 0101111 | 0x2    | 0x10   | rd = min(M[rs1], rs2); M[rs1] = rd      |

## RISC-V Instruction Format: 1: R-Type

31 25 24 20 19 15 14 12 11 7 6 0 func7 rs2 rs1 func3 rd opcode

| Field   | No. O f bits | Function                                                                           |
|---------|--------------|------------------------------------------------------------------------------------|
| opcode: | R1, A, B     | Basic operation of the instruction, and this abbreviation is its traditional name. |
| oprand: | R2, C, D     | The register destination operand. It gets the result of the operation              |
| funct3: | X, R1, R2    | An additional opcode field.                                                        |
| rs1:    |              | The first register source operand.                                                 |
| rs2:    |              | The second register source operand.                                                |
| func7   |              | An additional opcode field.                                                        |

#### Assembly

#### Field Values

#### **Machine Code**

|                            |          | funct7 | rs2    | rs1    | funct3 | rd     | op     | fund  |
|----------------------------|----------|--------|--------|--------|--------|--------|--------|-------|
| add s2, s3,<br>add x18,x19 |          | 0      | 20     | 19     | 0      | 18     | 51     | 0000  |
| sub t0, t1,<br>sub x5, x6  | t2<br>x7 | 32     | 7      | 6      | 0      | 5      | 51     | 0100  |
| sub x5, x6                 | , 2.1    | 7 bits | 5 bits | 5 bits | 3 bits | 5 bits | 7 bits | 7 bit |

| funct7   | rs2    | rs1    | funct3 | rd      | op        |              |
|----------|--------|--------|--------|---------|-----------|--------------|
| 0000,000 | 1,0100 | 1001,1 | 000,   | 10010   | 011,0011, | (0x01498933) |
| 0100,000 | 00111  | 0011,0 | 000,   | ,0010,1 | 011,0011  | (0x407302B3) |
| 7 bits   | 5 bits | 5 bits | 3 bits | 5 bits  | 7 bits    |              |

# Immediate: I-Type

31 20 19 15 14 12 11 7 6 0 imm[11:0] rs1 func3 rd opcode

| Field   | No. O f bits | Function                                                                           |
|---------|--------------|------------------------------------------------------------------------------------|
| opcode: | R1, A, B     | Basic operation of the instruction, and this abbreviation is its traditional name. |
| rd:     | R2, C, D     | The register destination operand. It gets the result of the operation              |
| funct3: | X, R1, R2    | An additional opcode field.                                                        |
| rs1:    |              | The first register source operand.                                                 |
| imm     |              | The second register source operand.                                                |

| Assembly |                            |                     | Field  | Value  | S      |        | Machine Code        |        |        |        |          |              |
|----------|----------------------------|---------------------|--------|--------|--------|--------|---------------------|--------|--------|--------|----------|--------------|
| 12802000 |                            | imm <sub>11:0</sub> | rs1    | funct3 | rd     | ор     | imm <sub>11:0</sub> | rs1    | funct3 | rd     | op       |              |
|          | s0, s1, 12<br>x8, x9, 12   | 12                  | 9      | 0      | 8      | 19     | 0000 0000 1100      | 01001  | 000    | 01000  | 001 0011 | (0x00C48413) |
|          | s2, t1, -14<br>x18,x6, -14 | -14                 | 6      | 0      | 18     | 19     | 1111 1111 0010      | 00110  | 000    | 10010  | 001 0011 | (0xFF230913) |
| lw<br>lw | t2, -6(s3)<br>x7, -6(x19)  | -6                  | 19     | 2      | 7      | 3      | 1111 1111 1010      | 10011  | 010    | 00111  | 000 0011 | (0xFFA9A383) |
| lh<br>lh | s1, 27(zero)<br>x9, 27(x0) | 27                  | 0      | 1      | 9      | 3      | 0000 0001 1011      | 00000  | 001    | 01001  | 000 0011 | (0x01B01483) |
| 1b       | s4, 0x1F(s4)               | 0x1F                | 20     | 0      | 20     | 3      | 0000 0001 1111      | 10100  | 000    | 10100  | 000 0011 | (0x01FA0A03) |
| lb       | x20,0x1F(x20)              | 12 bits             | 5 bits | 3 bits | 5 bits | 7 bits | 12 bits             | 5 bits | 3 bits | 5 bits | 7 bits   |              |

# **Store: S-Type**

31 25 24 20 19 15 14 12 11 7 6 0 imm[11:5] rs2 rs1 func3 imm[4:0] opcode

| i         | No. O f bits | Function                                                                           |
|-----------|--------------|------------------------------------------------------------------------------------|
| opcode:   | R1, A, B     | Basic operation of the instruction, and this abbreviation is its traditional name. |
| rd:       | R2, C, D     | The register destination operand. It gets the result of the operation              |
| imm[4:0]: | X, R1, R2    | An additional opcode field.                                                        |
| rs1:      |              | The first register source operand.                                                 |
| rs2:      |              | The second register source operand.                                                |
| imm[11:5] |              | An additional opcode field.                                                        |

**#x1 based address SW x2, 0(x1) # Memory[x1 + 0] = x2** 

# **Branch: B-Type**

20 12 11 31 19 15 14 7 6 25 24 0 imm[11:5] imm[4:0] opcode func3 rs2 rs1

| i         | No. O f bits | Function                                                                           |
|-----------|--------------|------------------------------------------------------------------------------------|
| opcode:   | R1, A, B     | Basic operation of the instruction, and this abbreviation is its traditional name. |
| rd:       | R2, C, D     | The register destination operand. It gets the result of the operation              |
| imm[4:0]: | X, R1, R2    | An additional opcode field.                                                        |
| rs1:      |              | The first register source operand.                                                 |
| rs2:      |              | The second register source operand.                                                |
| imm[11:5] |              | An additional opcode field.                                                        |

## Jump: J-Type

- Unconditional jump with an optional link to store the return address.
  - opcode (7 bits): Operation code that specifies the jump instruction (e.g., JAL).
  - <sup>3</sup> rd (5 bits): Destination register for the return address.
  - immediate (20 bits): Jump target offset, which is used to calculate the jump address relative to the current program counter (PC).

# Topics

- 1. Basic Processor Architecture
- 2. Different Types of Processor Architectures
- 3. RISC-V Processor Architecture
- 4. RISC-V Instruction Set Architecture
- 5. Programming RISC-V using assembly language





# RISCV GCC Assembler

\_start:

ld s3 0x001121

Ld rs2 0x0022233

add rd, rs1, rs2

St rd 0x0000001

# Assembly or C/C++

- Write Efficient Code
- Secure Application
- Multi-Threaded and Complex Program to run multiple devices (OS)
- Real-Time Applications for Real world Problems

## **Programming RISC-V**

- Problem
- Write it in your own words
- Make Pseudo Code
- Create Control and Data-flow Graph
- Program (C/C++, ASM)
- Debug
- Profile
- Optimize/Fine Tune
- Execute
- Test



## Hazards

- Data Hazards: Instructions are waiting for data from other instructions.
- Control Hazards: Changes in instruction flow cause delays.
- Structural Hazards: Limited hardware resources cause delays.



```
// example.c
int global var = 10;
int main() {
   int local var = 5;
   int result = global var +
local var;
   return result;
riscv32-unknown-elf-gcc example.o -o example
```

 The compiler generates an object file in ELF format. This object file contains machine code, data, and metadata, organized into different sections like .text (code), .data (initialized data), and .bss (uninitialized data).

- Instruction Section: Contains the compiled machine code instructions (text section).
- Data Section: Contains initialized data (data section).
- The linker combines the code and data sections, resolves symbols, and sets up memory addresses.
- The linker script defines how different sections are mapped into the memory of the microcontroller.
- It specifies memory regions and assigns addresses to different sections of the code and data.

```
    MEMORY

    ROM (rx) : ORIGIN = 0x08000000, LENGTH = 512K
    RAM (rwx): ORIGIN = 0x20000000, LENGTH = 64K
• }

    SECTIONS

    .text:{
      *(.text)
    } > ROM
    .data : {
      *(.data)
    } > RAM
```

- Next step involves using a programmer or debugger tool to flash the firmware into the RISCV System.
  - Instruction Memory: The code from the .text section is loaded into the system instruction memory.
  - Data Memory: The initialized data from the .data section is loaded into the system data memory.

## Linker Script: Program and Data Memory Allocation

The high addresses are the top of the figure and the low addresses are the bottom.

The stack pointer (sp) starts at BFFF FFF0 hex and grows down toward the Static data. The text (program code) starts at 0001 0000hex and includes the statically-linked libraries.

The Static data starts immediately above the text region; in this example, we assume that address is 1000 0000hex. Dynamic data, allocated in C by malloc(), is just above the Static data. Called the heap, it grows upward toward the stack. It includes the dynamically-linked libraries.



# Testing and Executing the Code

**RIPES** 

https://ripes.me/

https://github.com/mortbopet/Ripes/releases/download/v2.2.6/Ripes-v2.2.6-win-x86\_64.zip

Next:

**RISCV Micro Controller** 

**RISCV Simulator and Emulators** 

RISCV Single Board Computer

