Questions tagged [1.3b]
1 questions
1
vote
0 answers
DeepSpeed: no operator matches operands error
When I try to use DeepSpeed example to finetune a OPT 1.3b model on my local machine, I have an unexpected error, which related to following code snippet:
template
__global__ void moe_res_matmul(T* residual, T* coef, T* mlp_out, int…

coderLMN
- 3,076
- 1
- 21
- 26