Please disable Adblockers and enable JavaScript for domain CEWebS.cs.univie.ac.at! We have NO ADS, but they may interfere with some of our course material.

Tutorien

Systemsoftware

Linux/Unix

SSH

Programmierung

Matrix Memory Order

C

C++

Fortran

Python

Mixed Language Programming

Compiler

Makefiles

Libraries

BLAS

LAPACK

PAPI -- Performance Messung

Parallele Programmierung

Optimieren

«http://www.intel.com/content/www/us/en/architecture-and-technology/64-ia-32-architectures-optimization-manual.html» 

Matrix Multiplikation Optimieren

Schleifen vertauschen

«http://en.wikipedia.org/wiki/Loop_interchange» 

Blocken

«http://en.wikipedia.org/wiki/Loop_tiling» 

Schleifen ausrollen

«http://en.wikipedia.org/wiki/Loop_unwinding» 

Matrizen transponieren

Blöcke packen

Memory aligning

«http://en.wikipedia.org/wiki/Data_structure_alignment» 

SIMD Operationen (SSE, AVX)

«http://en.wikipedia.org/wiki/Simd» 
«http://software.intel.com/sites/products/documentation/doclib/iss/2013/compiler/cpp-lin/index.htm#GUID-7478B278-2240-44D8-B396-1DC508E3656E.htm» 

Autotuning

«http://www1.icsi.berkeley.edu/~bilmes/phipac/» 
«http://portal.acm.org/citation.cfm?id=1356053» 
«http://suif.stanford.edu/papers/lam-asplos91.pdf» 

How To

«http://wiki.cs.utexas.edu/rvdg/HowToOptimizeGemm/» 

Gute Beschreibung wie GOTO BLAS funktioniert

«http://www.cs.utexas.edu/~flame/pubs/GotoTOMS_revision.pdf» 
Letzte Änderung: 25.04.2016, 13:13 | 232 Worte