Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
en:multiasm:paarm:chapter_5_8 [2025/12/04 13:40] eriks.klavinsen:multiasm:paarm:chapter_5_8 [2025/12/04 13:50] (current) – [Bare-Metal program code] eriks.klavins
Line 59: Line 59:
  
 As for the second level, the code can be made bare-metal and does not depend on an OS. This type of code is much more complex because it heavily relies on knowledge of hardware, its address space, and related details. On the other hand, this type of code in hardware will execute much faster than the same code in the OS. That’s because the OS schedules multiple tasks, and the program can be halted for some time while other programs run. These types of codes are present on all devices with processors, whether or not the device uses an OS. The bootloader is a bare-metal program that prepares the hardware for the OS. If there is no OS, the program is designed to perform a specific task on the hardware. As for the second level, the code can be made bare-metal and does not depend on an OS. This type of code is much more complex because it heavily relies on knowledge of hardware, its address space, and related details. On the other hand, this type of code in hardware will execute much faster than the same code in the OS. That’s because the OS schedules multiple tasks, and the program can be halted for some time while other programs run. These types of codes are present on all devices with processors, whether or not the device uses an OS. The bootloader is a bare-metal program that prepares the hardware for the OS. If there is no OS, the program is designed to perform a specific task on the hardware.
-The special program runs immediately after a system reset. Taking an example of the bootloader in the Raspberry Pi. Before the bootloader loads into the memory, the very special startup binary code sets the CPU state, initialises memory and sets SP to a high address. Caches and Memory management units (MMU) are turned off by default to save energy in case something fails, but the startup script can be edited to turn them on. At the end of the startup code, the Program Counter register is set to a specified address where the main program starts (i.e., the bootloader). The bootloader checks the hardware, initialises and prepares it to work with the operating system. In this case, the bootloader and the operating system can be replaced with a bare-metal program. Program code runs directly on the processor; there is no system-call layer or stack setup unless the program code provides it. 
  
 +The special program runs immediately after a system reset. Taking an example of the bootloader in the Raspberry Pi. Before the bootloader loads into the memory, the very special startup binary code sets the CPU state, initialises memory and sets SP to a high address. Caches and Memory management units (MMUs) are turned off by default to save energy in case something fails, but the startup script can be edited to enable them. At the end of the startup code, the Program Counter register is set to a specified address where the main program starts (i.e., the bootloader). The bootloader checks the hardware, initialises and prepares it to work with the operating system. In this case, the bootloader and the operating system can be replaced with a bare-metal program. Program code runs directly on the processor; there is no system-call layer or stack setup unless the program code provides it.
 +
 +Example code to blink the LED connected to Raspberry Pi GPIO17 (pin 11 on the physical board). Theoretically, the example code should work on older Raspberry Pi versions, but since the board now contains a dedicated chip, it becomes much harder because the PCIe must be initialised, the address space must be known, and much more. Unfortunately, there is no comprehensive documentation for Raspberry Pi 5, including its address space and other details.
 +
 +<codeblock code_label>
 +<caption>Bare metal program code</caption>
 +<code>
 + .equ PERIPH_BASE,   0x40000000  @ RP1 peripheral base
 + .equ GPIO_BASE,   PERIPH_BASE + 0x0D0000  @ GPIO controller base
 + .equ GPFSEL0,     0x00  @ Function select register 0
 + .equ GPFSEL1,     0x04  @ Function select register 1
 + .equ GPSET0,      0x1C  @ Pin output set register
 + .equ GPCLR0,      0x28  @ Pin output clear register
 +
 +@ the next section defines that ongoing code is program code
 +    .section .text
 +    .global _start
 +_start:
 +    @ Initialise stack pointer
 +    LDR     X0, =stack_top
 +    MOV     SP, X0
 +
 +    @ Configure GPIO17 as output
 +    LDR     X1, =GPIO_BASE
 +    LDR     W2, [X1, #GPFSEL1]     @ Each pin takes 3 bits
 +    BIC     W2, W2, #(0x7 << 21)   @ Clear bits for GPIO17
 +    ORR     W2, W2, #(0x1 << 21)   @ Set bits to 001 (GPIO17 is output)
 +    STR     W2, [X1, #GPFSEL1]
 +
 +blink_loop:
 +    @ Set GPIO17 high
 +    MOV     W3, #(1 << 17)
 +    STR     W3, [X1, #GPSET0]
 +
 +    @ Simple delay loop
 +    MOV     X4, #0x200000  @ set value
 +delay1:
 +    SUBS    X4, X4, #1     @ decrease register value
 +    B.NE    delay1         @ if not equal to zero, repeat
 +
 +    @ Set GPIO17 low
 +    STR     W3, [X1, #GPCLR0]
 +
 +    @ Delay again
 +    MOV     X4, #0x200000
 +delay2:
 +    SUBS    X4, X4, #1
 +    B.NE    delay2
 +
 +    B       blink_loop             @ repeat 
 +
 +@Reserve space for stack
 +    .align 16
 +stack_top:
 +    .space 4096
 +
 +</code>
 +</codeblock>
 +This code must be loaded with the bootloader as “kernel8.img” at address ''0x80000''. These commands will compile and prepare the program to replace the OS:
 +  - ''as blink.s -o blink.o''
 +  - ''ld blink.o -Ttext=0x80000 -o blink.elf''
 +  - ''objcopy -O binary blink.elf kernel8.img''
 +
 +After that, the code will continue to execute until the power is switched off, the processor is RESET, or an unexpected exception occurs. Here, the code is responsible for everything, including setting up the stack pointer, enabling caches, handling interrupts, and working with devices directly at hardware addresses. This requires reviewing all hardware-related documentation. But these programs tend to be much faster and more robust than OS related programs. This is a typical way to design programs such as device firmware, RTOS, or bootloader.
 +
 +The same code can be adjusted to work as a kernel module or a driver. In such a case, the code will require much more editing and investigation into OS-related documentation. The kernel must know how often this program should execute, and it may also need to work with mutexes. This is better implemented in C, as it involves multiple runtime libraries from the Linux OS.
  
en/multiasm/paarm/chapter_5_8.1764855654.txt.gz · Last modified: 2025/12/04 13:40 by eriks.klavins
CC Attribution-Share Alike 4.0 International
www.chimeric.de Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0