Using of Accelerators aka GPUs (and all other stuff..) shpuld be mentioned here, as well es it can be considered as a distrubuted memor part.
* PGAS languages (UPC, Coarray Fortran) by the compiler
* Auto-parallelization and vectorisation by the compiler (SM)
* The Pthreads library (SM)

