DeepLearn, ML, 各种模型
DeepGEMM是一个专注于为FP8高效通用矩阵乘法(GEMM)库,支持普通及混合专家(MoE)分组的矩阵计算需求,可动态优化资源分配以提升算力效率。 该库基于CUDA开发,采用轻量级即时编译(JIT)模块,在运行时动态编译内核,无需预先编译安装。
Please describe the organization's positioning / vision
Please attach the organization brochure
These companies or software are using our open source software:
Please send an application email to
If you feel that our open source software is helpful to you, please scan the QR code below to enjoy a cup of coffee.
Website:
Follow @aaa on Weibo
Email: