Skip to main content

Unit 4.1.2 Outline Week 4

  • 4.1 Opening Remarks

    • 4.1.1 Launch

    • 4.1.2 Outline Week 4

    • 4.1.3 What you will learn

  • 4.2 OpenMP

    • 4.2.1 Of cores and threads

    • 4.2.2 Basics

    • 4.2.3 Hello World!

  • 4.3 Multithreading Matrix Multiplication

    • 4.3.1 Lots of loops to parallelize

    • 4.3.2 Parallelizing the first loop around the micro-kernel

    • 4.3.3 Parallelizing the second loop around the micro-kernel

    • 4.3.4 Parallelizing the third loop around the micro-kernel

    • 4.3.5 Parallelizing the fourth loop around the micro-kernel

    • 4.3.6 Parallelizing the fifth loop around the micro-kernel

    • 4.3.7 Discussion

  • 4.4 Parallelizing More

    • 4.4.1 Speedup, efficiency, etc.

    • 4.4.2 Ahmdahl's law

    • 4.4.3 Parallelizing the packing

    • 4.4.4 Parallelizing multiple loops

  • 4.5 Enrichments

    • 4.5.1 Casting computation in terms of matrix-matrix multiplication

    • 4.5.2 Family values

    • 4.5.3 Matrix-matrix multiplication for Machine Learning

    • 4.5.4 Matrix-matrix multiplication on GPUs

    • 4.5.5 Matrix-matrix multiplication on distributed memory architectures

  • 4.6 Wrap Up

    • 4.6.1 Additional exercises

    • 4.6.2 Summary