Questions tagged [1.3b]

1 questions
1
vote
0 answers

DeepSpeed: no operator matches operands error

When I try to use DeepSpeed example to finetune a OPT 1.3b model on my local machine, I have an unexpected error, which related to following code snippet: template __global__ void moe_res_matmul(T* residual, T* coef, T* mlp_out, int…
coderLMN
  • 3,076
  • 1
  • 21
  • 26