Reference ========= 1. https://docs.nvidia.com/deeplearning/performance/dl-performance-gpu-background/index.html 2. https://www.nvidia.com/content/PDF/fermi_white_papers/NVIDIA_Fermi_Compute_Architecture_Whitepaper.pdf 3. https://www.sciencedirect.com/science/article/abs/pii/B978012800979600010X 4. https://developer.download.nvidia.com/CUDA/training/StreamsAndConcurrencyWebinar.pdf 5. https://mpitutorial.com Contributers ************ 1. `Joseph John, Staff Scientist, NCI `_ *ChatGPT has been utilized to enhance and generate texts in this document*.