Subject description - AE4M36PAP

Summary of Study | Summary of Branches | All Subject Groups | All Subjects | List of Roles | Explanatory Notes               Instructions
AE4M36PAP Advanced Computer Architectures
Roles:  Extent of teaching:2P+2C
Department:13136 Language of teaching:EN
Guarantors:  Completion:Z,ZK
Lecturers:  Credits:6
Tutors:  Semester:Z

Web page:

https://cw.fel.cvut.cz/wiki/courses/a4m36pap

Anotation:

This course extends knowledge of modern computer architecture. Mainly the architecture of nowadays processors utilizing instruction and/or thread level parallelism and advanced pipelining is in the center of our attention. A special emphasis will be devoted to the implementation of parallelism in hardware, parallel program design, and advanced instruction scheduling and execution.

Study targets:

The course increases the knowledge in the field of modern computer architecture.

Course outlines:

1. Introduction
(Control flow computers and Data flow computers. Flynn's taxonomy. Parallel processing. Amdahl's and Gustavson's law. Metrics.)
2. From scalar processors to superscalar processors - basic organization of superscalar processor
(Static, dynamic and hybrid scheduling of the instruction flow)
3. Superscalar techniques I - Register data flow
(Register renaming - Tomasulo's algorithm. Data speculation. Precise exception support.)
4. Superscalar techniques II - Instruction flow, speculation
(Prediction and Predictors, Branch misprediction recovery)
5. Superscalar techniques III - Memory data flow; VLIW and EPIC
(Load bypassing and Load forwarding. Load speculation. Data parallelism, SIMD and vector instructions in ISA; Loop-unrolling and software pipelining.)
6. Memory subsystem
(Non-blocking cache, Victim cache, Virtual memory and cache)
7. Multiprocessor systems and Memory coherence
(Distributed memory systems and Shared memory systems. Symmetric multiprocessor systems (SMP). Coherence in SMP)
8. Multiprocessor systems and Memory consistency
(Rules for memory operations, Sequential consistency and other consistency models.)
9. Programming of parallel systems I - Data consistency on multiprocessor system
(Parallel computing concepts, Programming issues, Parallel programming paradigms, Message Passing Interface and Open Multi-Processing)
10. Programming of parallel systems II
(Synchronization)
11. I/O subsytem
(PCIe, HyperTransport, QuickPathInterconnect)
12. MPP and clusters, interconnection networks
13. Architecture perspectives

Exercises outline:

1. Introduction, semester project assignment
2. Instruction set and compilation
3. Pipelined processor simulator
4. Optimization algorithms for pipelined processor
5. Superscalar processor simulator
6. Intro of semester projects presentation
7.-11.  Semester projects processing
12.-14.  Semester projects presentation and defense

Literature:

1. Hennesy, J. L., Patterson, D. A.: Computer Architecture : A Quantitative Approach, Third Edition, San Francisco, Morgan Kaufmann Publishers, Inc., 2002
2. Shen, J.P., Lipasti, M.H.: Modern Processor Design : Fundamentals of Superscalar Processors, First Edition, New York, McGraw-Hill Inc., 2004
3. Grama A., Gupta, A. et al.: Introduction to Parallel Computing, Second Edition, Addison Wesley, 2003

Requirements:

Subject is included into these academic programs:

Program Branch Role Recommended semester


Page updated 28.3.2024 07:52:32, semester: Z/2024-5, Z,L/2023-4, Send comments about the content to the Administrators of the Academic Programs Proposal and Realization: I. Halaška (K336), J. Novák (K336)