| CS | 340 | #4: Welcome | to ECE | | | |---------------|-------------------------|----------------------------------------------------------------------------|-------------------|-----------------|-------------| | Comput | er Systems | #4: Welcome<br>Aug. 31, 2023 · | G Carl Evans | | | | Every ( | CPU has a | et Architectur<br>a set of commar<br>Architecture or | nds it under | | | | 1. | | | | | | | 2. | | | | | | | | | the function of the: | | | – one could | | CPU R with th | e hardwa<br>⁄ired to th | s<br>are on-dye modure performing to<br>the ADD circuit;<br>all CPU operat | the CPU ope<br>). | erations (ex: t | they're | | Three l | key ideas | to know about | CPU registe | ers: | | | 1. | [Size]: | | | | | | 2. | [Speed]: | | | | | | 3. | [Limited | <b>ւ]։</b> | | | | • **x64** (ex: Intel, AMD): • **ARMv8** (ex: Apple M1/M2, Cell Phones): **CPU Register Names**The 16 general purpose x64 CPU registers have names based on how many bits you're working with: | | 64-bits | 32-bits | 16-bits | 8-bits | |---|---------|---------|---------|--------| | 0 | %rax | %eax | %ax | %al | | 1 | %rbx | %ebx | %bx | %b1 | | 2 | %rcx | %ecx | %cd | %c1 | | 3 | %rdx | %edx | %dx | %d1 | | 4 | %rsi | %esi | %si | %sil | | 5 | %rdi | %edi | %di | %dil | | 6 | %dbp | %ebp | %bp | %dp1 | | 7 | %rsp | %esp | %sp | %spl | | | | | | | ## **Instruction Sets** Every ISA defines a set of instructions that a CPU can execute: | Move: | MOV, XCHG, PUSH, POP, | | | |-------------------------|-------------------------------|--|--| | Arithmetic (int): | ADD, SUB, MUL, DIV, NEG, CMP, | | | | Logic: | AND, OR, XOR, SHR, SHL, | | | | Control Flow: | JMP, LOOP, CALL, RET, | | | | <b>Synchronization:</b> | LOCK, | | | | Floating Point: | FADD, FSUB, FMUL, FDIV, FABS, | | | | ARM processors have significantly fewer | instructions and are known | |------------------------------------------|-----------------------------| | as | while x64 processors have a | | greater set of instructions and known as | | **Q:** Advantages of RISC / CISC? ## **CPU Instruction in a Real Program** ``` 04.c 1 #include <stdio.h> 2 3 int main() { 4 int a = 0; 5 a = a + 3; 6 a = a - 2; 7 a = a * 4; 8 a = a / 2; 9 a = a * 5; 10 printf("Hi"); 11 a = a * 479; 12 return a; 13 } ``` To compile a program without optimizations and references back to the original code, the "debug" flag is required: ``` $ gcc -g 04.c ``` Then, we can dump the output object in a human readable format: This result of this command shows **EVERY** operation that the CPU will execute when running the program! The operations that correspond to the main() function are organized to the right $(\Rightarrow)$ . One Special Register: | | 04.c | gcc -g 04.c<br>objdump -d ./a.out | | |----|-------------------------|------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------| | 3 | <pre>int main() {</pre> | f3 0f 1e fa<br>55<br>48 89 e5<br>48 83 ec 10 | endbr64 push %rbp mov %rsp,%rbp sub \$0x10,%rsp | | 4 | int a = 0; | c7 45 fc 00 00 00 00 | movl \$0x0,-0x4(%rbp) | | 5 | a = a + 3; | 83 45 fc 03 | addl \$0x3,-0x4(%rbp) | | 6 | a = a - 2; | 83 6d fc 02 | sub1 \$0x2,-0x4(%rbp) | | 7 | a = a * 4; | c1 65 fc 02 | shll \$0x2,-0x4(%rbp) | | 8 | a = a / 2; | 8b 45 fc<br>89 c2<br>c1 ea 1f<br>01 d0<br>d1 f8<br>89 45 fc | mov -0x4(%rbp),%eax<br>mov %eax,%edx<br>shr \$0x1f,%edx<br>add %edx,%eax<br>sar %eax<br>mov %eax,-0x4(%rbp) | | 9 | a = a * 5; | 8b 55 fc<br>89 d0<br>c1 e0 02<br>01 d0<br>89 45 fc | mov -0x4(%rbp), %edx<br>mov %edx, %eax<br>sh1 \$0x2, %eax<br>add %edx, %eax<br>mov %eax, -0x4(%rbp) | | 10 | printf("Hi"); | 48 8d 3d f0 0d 00 00<br># 20<br>b8 00 00 00 00<br>e8 42 fe ff ff | lea 0xdf0(%rip),%rdi<br>004 <_IO_stdin_used+0x4><br>mov \$0x0,%eax<br>callq 1060 <printf@plt></printf@plt> | | 11 | a = a * 479; | 8b 45 fc<br>69 c0 df 01 00 00<br>89 45 fc | mov -0x4(%rbp),%eax<br>imul \$0x1df,%eax,%eax<br>mov %eax,-0x4(%rbp) | ## **Operation Timings** Q: Do all operations take the same amount of time on the CPU? Q: What are the CPU timings for various operations?