Released mCemm a GEMM (General Matrix Multiply) kernel generator for Apple Metal which generates optimized Metal shaders with configurable tile sizes, warp sizes, data types (f16/f32), transpose modes (NN/NT/TN/TT), activations (ReLU/GELU/SiLU), bias and more - https://github.com/MetalLikeCuda/mCemmFixed installation in https://github.com/Okerew/osxiec and https://github.com/MetalLikeCuda/gpumkatReleased https://github.com/MetalLikeCuda/awesome-mlc a list of community maintained metal tools, frameworks, libraries and resources.