0% found this document useful (0 votes)

9 views10 pages

Micro 2nd

The document discusses efficient structure usage in ARM C programming, emphasizing alignment and padding to optimize memory usage. It also covers register allocation techniques to enhance program performance by minimizing memory access latency and reducing instruction overhead. Additionally, it explains exception handling, interrupt latency, and cache memory architecture, highlighting strategies for effective interrupt management and cache organization.

Uploaded by

Chandru Vaggar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views10 pages

Micro 2nd

Uploaded by

Chandru Vaggar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

1a) Illustrate the concept of efficient and optimized usage of structure in ARM C

Compiler with respect to arrangement and size.

Every data type have alignment requirements ( it is mandated by processor architecture, not by
language). A processor will have processing word length as that of data bus size. On a 32-bit machine,
the processing word size will be 4 bytes.

If an integer of 4 bytes is allocated on X address (X is a multiple of 4), the processor needs only one
memory cycle to read the entire integer. Whereas, if the integer is allocated at an address other than a
multiple of 4, it spans across two rows of the banks as shown in the below figure 3.3. Such an integer
requires two memory read cycles to fetch the data.

Load and store instructions are only guaranteed to load and store values with address aligned to the
size of the access width.
Therefore ARM compilers will automatically align the start address of a structure to a multiple of the
largest access width used within the structure (usually four or eight bytes) and align entries within
structures to their access width by inserting padding.
Example:
struct {
char a;
int b;
char c;
short d;
}
For a little-endian memory system the compiler will lay this out adding padding to ensure that the next
object is aligned to the size of that object:

To improve the memory usage, you should reorder the elements

struct {
char a;
char c;
short d;
int b;
}
This reduces the structure size from 12 bytes to 8 bytes, with the following new layout:

The following rules generate a structure with the elements packed for maximum efficiency:
a) Place all 8-bit elements at the start of the structure.
b) Place all 16-bit elements next, then 32-bit, then 64-bit.
c) Place all arrays and larger elements at the end of the structure.
d) If the structure is too big for a single instruction to access all the elements, then group the elements
into substructures. The compiler can maintain pointers to the individual substructures.

Summary
For Efficient Structure Arrangement we need to consider the below points :
Lay structures out in order of increasing element size. Start the structure with the smallest elements
and finish with the largest.
Avoid very large structures. Instead use a hierarchy of smaller structures.
For portability, manually add padding (that would appear implicitly) into API structures so that the
layout of the structure does not depend on the compiler.
Beware of using enum types in API structures. The size of an enum type is compiler dependent.

1b) Design for implementation an ARM C compiler oriented C program to print the list
of all even numbers between 0 to 100.
#include <stdio.h>
int main() {
unsigned int i;
for (i = 0; i <= 100; i += 2) {
printf("%u\n", i);
}
return 0;
}

1c) Illustrate the concept of how registers are allocated to optimize the program.
Register allocation is a critical optimization technique in ARM C programming that assigns variables to
processor registers to improve execution speed. Efficient register allocation reduces memory access
latency, minimizes spilling to memory, and enhances overall performance.

Concept Illustration:
1. Basic Principle: The compiler tries to assign frequently used and live variables to the limited set of ARM
registers (r0 to r12, with some reserved). Variables that are actively used within a small scope or loop
are prioritized for register assignment.

2. Allocation Strategy:

• Prioritize Variables in Hot Loops: Variables that are used inside inner loops are given registers to avoid
repeated memory loads/stores.

• Limit the Number of Active Variables: As per the report, limit internal function variables to about 12 to
match the register count and avoid spilling.

3. Example – Loop Optimization:

Suppose you want to sum even numbers between 0 and 100:

#include <stdio.h>

int main() {

int sum = 0;

unsigned int i;

for (i = 0; i <= 100; i += 2) {

sum += i;

printf("Sum: %d\n", sum);

return 0;

• Register Allocation:

The compiler allocates i and sum to registers (r0, r1) to avoid reading and writing to memory during
each iteration.

4. Spilling (if necessary): If more variables are needed than available registers, some variables
temporarily spill to memory. The compiler attempts to minimize spilling by:

Reusing registers when variables are out of scope.

Prioritizing variables used within loops or critical sections.

5. Impact: Using registers for loop counters and accumulators:

• Eliminates repeated memory loads/stores

• Reduces instruction count

• Achieves faster execution

Summary: Register allocation assigns the most frequently used variables within a scope to the limited
ARM registers, simplifying memory access, reducing instruction overhead, and maximizing
performance. Efficient register use is crucial in embedded systems where resources are limited.

3a) Discuss the concept of Exception, Exception handling and Vector Table
An exception is an unexpected event during program execution that causes the processor to halt
normal operations and handle the situation. The following can cause exception:

a) Reset
b) Undefined instruction
c) Software interrupt
d) Prefetch abort
e) Data abort
f) Interrupt request

When an exception occurs, the processor switches to a specific mode, saves its state, jumps to a
handler routine to manage the event, and then resumes normal execution after the issue is addressed.
Exceptions are crucial for system stability and error management.

Exception Handling Exception handling involves specialized software routines called exception
handlers that determine the cause of the exception and execute the necessary response. When an
exception occurs, the ARM core automatically switches to a specific processor mode associated with
that exception. During this process, the core saves the current program state (like the cpsr — Current
Program Status Register) into a dedicated save register (spsr) accepted for that mode, and saves the
address of the instruction that was halted (the pc) into a link register (lr). It then loads the program
counter (pc) with the address of the corresponding exception handler (from the vector table). After
servicing the exception, the handler restores the processor's state and resumes normal operation

The vector table is a table of addresses that the ARM core branches to when an exception is raised.
These addresses contain branch instructions. The memory map address 0x00000000 is reserved for the
vector table, a set of 32-bit words. On some processors the vector table can be optionally located at a
higher address in memory (starting at the offset 0xffff0000).
The branch instruction can be any of the following forms:

B <address>

LDR pc, [pc, #offset]

LDR pc, [pc, #-0xff0]

MOV pc, #immediate

3b) Illustrate the concept of Interrupt Latency and strategy to reduce it.
It is the time interval, from an external interrupt request signal being raised to the first fetch of an
instruction of a specific interrupt service routine (ISR).

Interrupt latency depends on a combination of hardware and software.

System designer must balance the system design to handle multiple simultaneous interrupt sources
and minimize interrupt latency.

If the interrupts are not handled in a timely manner, then the system will exhibit slow response times.

Software handlers have two main methods to minimize interrupt latency.

1) Nested interrupt handler,

2) Prioritization.

Nested interrupt handler

Nested interrupt handler allows other interrupts to occur even when it is currently servicing an existing
interrupt.

This is achieved by reenabling the interrupts as soon as the interrupt source has been serviced but
before the interrupt handling is complete.

Once a nested interrupt has been serviced, then control is relinquished to the original interrupt service
routine. Fig 4.3 shows the three level nested interrupt,
Prioritization

We can program the interrupt controller to ignore interrupts of the same or lower priority than the
interrupt we are handling presently, so only

a higher-priority task can interrupt our handler. We then re-enable the interrupts. The processor
spends time in the lower-priority interrupts until a higher-priority interupt occurs. Therefore higher-
priority interrupts have a lower average interrupt latency than the lower-priority interrupts.

It reduces latency by speeding up the completion time on the critical time-sensitive interrupts.

3c) Discuss Enabling and disabling of IRQ and FIQ via programing CPSR.
The ARM processor core has a simple procedure to manually enable and disable interrupts by
modifying the cpsr when the processor is in a privileged mode.

The procedure uses three ARM instructions.

1)The instruction MRS copies the contents of the cpsr into register r1.

2)The instruction BIC clears the IRQ or FIQ mask bit.

3) The instruction MSR then copies the updated contents in register r1 back into the cpsr, to enable
the interrupt request.

Table 4.5 shows how IRQ and FIQ interrupts are enabled.

The postfix _c identifies that the bit field being updated is the control field bit [7:0] of the cpsr.
Table 4.6 shows procedure to disable or mask an interrupt request.

To enable and disable both the IRQ and FIQ exceptions , the immediate value on the data processing
BIC or ORR instruction has to be changed to 0xc0.

The interrupt request is either enabled or disabled only once the MSR instruction has completed the
execution stage of the pipeline. Interrupts

can still be raised or masked prior to the MSR completing this stage.

5a) Explain the basic architecture of cache memory.

The basic architecture of cache memory consists of three main parts for each cache line: a directory
store, a data section, and status information.

1. Directory Store (Cache-Tag):

The directory store, often referred to as the cache-tag, is a dedicated storage area within each cache
line that holds a portion of the main memory address, known as the tag. This tag serves to identify the
origin of the data stored in the cache line. When the processor requests data, the cache controller
compares the tag portion of the requested address with the stored cache-tag to determine if the
required data is already present in the cache (a cache hit) or not (a cache miss). This comparison is
fundamental for maintaining coherence between the cache contents and main memory.

2. Data Section:

The data section contains the actual data fetched from main memory and stored in the cache line.
When the processor accesses a specific memory location, it retrieves the entire cache line from this
data section, which includes multiple words (e.g., four 32-bit words in a line). Loading an entire line at
once exploits the principle of locality of reference, improving access times for subsequent data
requests within the same line. This organization ensures faster data retrieval compared to accessing
main memory directly.

3. Status Bits:

The cache line maintains several status bits that indicate its current state and integrity:

• Valid Bit: This bit indicates whether the cache line contains valid, usable data. If set to '1', the data in
this line is current and can be used by the processor. If it is '0', the data is invalid—possibly because it
has been invalidated or not yet initialized. The valid bit prevents the processor from using stale or
uninitialized data.

• Dirty Bit: This bit indicates whether the data in the cache line has been modified (written to) but not
yet written back to main memory. If the dirty bit is '1', it means the cache contains updated data that
must be written back to main memory before the cache line can be replaced or invalidated. This
ensures data consistency between the cache and main memory during cache line eviction or
replacement.

4. Cache Lines:

A cache is composed of multiple cache lines, each capable of storing a block of data (often multiple
words). Each line is identified by its position within the cache, organized to allow efficient lookup and
management. When a data request occurs, the cache controller identifies the appropriate line using
address fields, and then it compares tags and status bits to determine if the data can be used directly
or needs to be fetched from main memory.

5. Address Fields (Tag, Set Index, Data Index):

The address of a memory request is divided into several fields, each serving a specific purpose within
the cache architecture:

• Tag: The tag is a subset of the address used to identify the specific block in main memory that the
cache line may contain. During a cache lookup, the cache controller compares the address's tag with
the stored cache-tag to verify if the data corresponds to the requested address.

• Set Index: This field determines which set (or group) within the cache to examine. In set-associative or
direct-mapped caches, the set index narrows down the search to a specific subset of cache lines, thus
improving lookup efficiency.
• Data Index: The data index specifies the particular word, byte, or sub-word within the cache line. It
enables the cache controller to select the exact piece of data requested by the processor from within
the cache line.

5b) With a neat block diagram explain associative cache (set Associate cache).

An associative cache, specifically a set-associative cache, is a type of cache memory designed to reduce
conflicts and improve hit rates compared to direct-mapped caches.

Key points about set-associative cache:

- The cache is divided into multiple sets.

- Each set contains a fixed number of cache lines, called "ways" (e.g., 4-way, 8-way).

- A memory address is divided into three fields: tag, set index, and data (or word) index.

- The set index determines which set in the cache might contain the data.

- Within that set, the cache checks multiple lines (ways) simultaneously to find a matching tag (using
hardware such as Content Addressable Memory, CAM).

- If a matching tag is found (a hit), the data is retrieved from that cache line.

- If no match (a miss), the cache replaces one of the lines in the set, usually using a replacement policy
like least recently used (LRU).
Advantages:

- Reduced conflict misses compared to direct-mapped caches.

- More flexible placement of data since a memory location can reside in any line within the set.

In Figure 12.8, the cache maps main memory blocks to any of four cache lines (ways) within a set. The set
index points to the group of lines, and the tag comparison determines the exact line containing the data.

Empowerment Technology: Quarter 1 - Module 1
100% (3)
Empowerment Technology: Quarter 1 - Module 1
20 pages
Android Rooting Guide for Enthusiasts
100% (2)
Android Rooting Guide for Enthusiasts
16 pages
Introduction To Processor Design & The ARM Architecture
100% (1)
Introduction To Processor Design & The ARM Architecture
65 pages
Critical Section
No ratings yet
Critical Section
4 pages
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
No ratings yet
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
15 pages
ARM Architecture: RISC Features & Profiles
No ratings yet
ARM Architecture: RISC Features & Profiles
5 pages
The First Encounter
50% (2)
The First Encounter
44 pages
Reduced Instruction Set Computers: William Stallings Computer Organization and Architecture 7 Edition
100% (1)
Reduced Instruction Set Computers: William Stallings Computer Organization and Architecture 7 Edition
38 pages
ARM Architecture Overview
No ratings yet
ARM Architecture Overview
51 pages
Exception and Interrupt Handling in ARM: Architectures and Design Methods For Embedded Systems Summer Semester 2006
No ratings yet
Exception and Interrupt Handling in ARM: Architectures and Design Methods For Embedded Systems Summer Semester 2006
17 pages
ARM Instruction Sets and Program: Jin-Fu Li Department of Electrical Engineering National Central University
100% (2)
ARM Instruction Sets and Program: Jin-Fu Li Department of Electrical Engineering National Central University
116 pages
On Chip Periperals tm4c UNIT-II
No ratings yet
On Chip Periperals tm4c UNIT-II
30 pages
Selecting A Paper For G7 Calibration PDF
100% (1)
Selecting A Paper For G7 Calibration PDF
3 pages
William Stallings Computer Organization and Architecture 8 Edition
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition
38 pages
MioPocket Readme
100% (1)
MioPocket Readme
30 pages
ARM7 Architecture Overview
100% (1)
ARM7 Architecture Overview
9 pages
How To: Tab "3Rd Party Information"
No ratings yet
How To: Tab "3Rd Party Information"
8 pages
Cisc vs. Risc
No ratings yet
Cisc vs. Risc
53 pages
Computer Architecture Unit 2 - Phase 1 PDF
No ratings yet
Computer Architecture Unit 2 - Phase 1 PDF
52 pages
The First Encounter: Authors: Nemanja Perovic, Prof. Dr. Veljko Milutinovic
No ratings yet
The First Encounter: Authors: Nemanja Perovic, Prof. Dr. Veljko Milutinovic
44 pages
ARM - Advanced RISC Machines: RISC-Reduce Instruction Set Computers
No ratings yet
ARM - Advanced RISC Machines: RISC-Reduce Instruction Set Computers
60 pages
The Acorn RISC Machine (ARM)
100% (1)
The Acorn RISC Machine (ARM)
12 pages
Module 2
No ratings yet
Module 2
41 pages
Instruction Set Arch
No ratings yet
Instruction Set Arch
9 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
13 pages
Arm Instruction Set
No ratings yet
Arm Instruction Set
54 pages
CompTIA SY0-401 Exam Prep Guide
100% (1)
CompTIA SY0-401 Exam Prep Guide
6 pages
Detailed Design and Production Information For Main Hull Steel Structures
No ratings yet
Detailed Design and Production Information For Main Hull Steel Structures
4 pages
Embedded Systems Design & Optimization
No ratings yet
Embedded Systems Design & Optimization
11 pages
Datasheet - XPG SX8100 - EN - 202005
No ratings yet
Datasheet - XPG SX8100 - EN - 202005
2 pages
Embedded C Interview Question
No ratings yet
Embedded C Interview Question
7 pages
Chapter 2
No ratings yet
Chapter 2
214 pages
Embedded Lecture 4 ARM
No ratings yet
Embedded Lecture 4 ARM
47 pages
IPv6 Addressing Simplified
No ratings yet
IPv6 Addressing Simplified
6 pages
04 - The ARM Architecture and ISA
No ratings yet
04 - The ARM Architecture and ISA
73 pages
Arm Proseccor
No ratings yet
Arm Proseccor
198 pages
Lecture 4: Reduced Instruction Set Computers (RISC) and Assembly Language
No ratings yet
Lecture 4: Reduced Instruction Set Computers (RISC) and Assembly Language
39 pages
UNIT-2 Two Marks
No ratings yet
UNIT-2 Two Marks
15 pages
REDO - 2 CD - PDF 3
No ratings yet
REDO - 2 CD - PDF 3
1 page
Module3 ARM
No ratings yet
Module3 ARM
96 pages
ARM Cortex-M4 Programming Guide
No ratings yet
ARM Cortex-M4 Programming Guide
29 pages
Module 4 - Introduction To Embedded System and ARM
No ratings yet
Module 4 - Introduction To Embedded System and ARM
29 pages
REDO - 2 CD - PDF 2
No ratings yet
REDO - 2 CD - PDF 2
2 pages
ARM Microcontroller Basics
No ratings yet
ARM Microcontroller Basics
27 pages
ZENIC ONE Restful VPN Interface Guide
No ratings yet
ZENIC ONE Restful VPN Interface Guide
82 pages
Muez Ahmed: Education
No ratings yet
Muez Ahmed: Education
1 page
ES Unit 1 (A) 2023-1
No ratings yet
ES Unit 1 (A) 2023-1
64 pages
Real Time Braille To Speech Using Python
100% (1)
Real Time Braille To Speech Using Python
10 pages
Unit III Part 1
No ratings yet
Unit III Part 1
47 pages
Tutorial 03 Latch FF State Machines 1
No ratings yet
Tutorial 03 Latch FF State Machines 1
81 pages
21EI43 Model QP - MCP With Sollutions
No ratings yet
21EI43 Model QP - MCP With Sollutions
16 pages
MAN0070A0002 Pilots Manual
No ratings yet
MAN0070A0002 Pilots Manual
37 pages
C For Embedded Systems Programming
No ratings yet
C For Embedded Systems Programming
69 pages
Efficient Embedded C Programming
No ratings yet
Efficient Embedded C Programming
70 pages
1.ARM Architecture, Instruction
No ratings yet
1.ARM Architecture, Instruction
50 pages
03 Cpu Overview
No ratings yet
03 Cpu Overview
86 pages
Ep 2
No ratings yet
Ep 2
50 pages
Real-Time Chat App Design Thesis
No ratings yet
Real-Time Chat App Design Thesis
56 pages
FALLSEM2024-25 BCSE305L TH VL2024250107997 2024-07-30 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE305L TH VL2024250107997 2024-07-30 Reference-Material-I
25 pages
Forms - Reports 122119 Certmatrix
No ratings yet
Forms - Reports 122119 Certmatrix
36 pages
ReaKontrol v110 Manual
No ratings yet
ReaKontrol v110 Manual
8 pages
Risc Processor - Arm 9
No ratings yet
Risc Processor - Arm 9
84 pages
Fat MPMC
No ratings yet
Fat MPMC
97 pages
Question Bank BCS402
No ratings yet
Question Bank BCS402
3 pages
Module 1-Complete
No ratings yet
Module 1-Complete
136 pages
Lec7 RunAProgram
No ratings yet
Lec7 RunAProgram
24 pages
MPMC Unit-3 - Part-1
No ratings yet
MPMC Unit-3 - Part-1
10 pages
312 85 Demo
No ratings yet
312 85 Demo
5 pages
PPC 120T Ce RT - DS
No ratings yet
PPC 120T Ce RT - DS
2 pages
Module 5 ARM
No ratings yet
Module 5 ARM
95 pages
21EC642 Assignment 3
No ratings yet
21EC642 Assignment 3
2 pages
Xii CS Term-1 Retestqp
No ratings yet
Xii CS Term-1 Retestqp
5 pages
6-Introduction To Microprocessors Programming - 085041
No ratings yet
6-Introduction To Microprocessors Programming - 085041
7 pages
MC-BCS402 Tutorial Questions Answers
No ratings yet
MC-BCS402 Tutorial Questions Answers
14 pages
BCS402 IA2 (Version A) Scheme 24-25
No ratings yet
BCS402 IA2 (Version A) Scheme 24-25
10 pages
Secure File Transmission System Using Steganogrphic Algorithm - New
No ratings yet
Secure File Transmission System Using Steganogrphic Algorithm - New
45 pages
10 Isa
No ratings yet
10 Isa
27 pages
SQP 43 - QP
No ratings yet
SQP 43 - QP
10 pages
Network Model
No ratings yet
Network Model
25 pages
25 Zero Investment Business Ideas
No ratings yet
25 Zero Investment Business Ideas
109 pages
Tutorial Microprocessors 2025
No ratings yet
Tutorial Microprocessors 2025
2 pages
Java Lab Manual r23 Updated
No ratings yet
Java Lab Manual r23 Updated
77 pages
Esd-Unit 2 Notes
No ratings yet
Esd-Unit 2 Notes
16 pages
Big Data Multiple Choice Questions
No ratings yet
Big Data Multiple Choice Questions
9 pages
Logcat 1751382613831
No ratings yet
Logcat 1751382613831
1 page
Es Notes Unit 2
No ratings yet
Es Notes Unit 2
24 pages
Module - 5
No ratings yet
Module - 5
87 pages

Micro 2nd

Uploaded by

Micro 2nd

Uploaded by

1a) Illustrate the concept of efficient and optimized usage of structure in ARM C

Compiler with respect to arrangement and size.

To improve the memory usage, you should reorder the elements

3. Example – Loop Optimization:

Suppose you want to sum even numbers between 0 and 100:

for (i = 0; i <= 100; i += 2) {

printf("Sum: %d\n", sum);

Reusing registers when variables are out of scope.

Prioritizing variables used within loops or critical sections.

5. Impact: Using registers for loop counters and accumulators:

• Reduces instruction count

• Achieves faster execution

LDR pc, [pc, #offset]

LDR pc, [pc, #-0xff0]

MOV pc, #immediate

Interrupt latency depends on a combination of hardware and software.

Software handlers have two main methods to minimize interrupt latency.

1) Nested interrupt handler,

Nested interrupt handler

The procedure uses three ARM instructions.

2)The instruction BIC clears the IRQ or FIQ mask bit.

5a) Explain the basic architecture of cache memory.

1. Directory Store (Cache-Tag):

5. Address Fields (Tag, Set Index, Data Index):

Key points about set-associative cache:

- The cache is divided into multiple sets.

- Reduced conflict misses compared to direct-mapped caches.

You might also like