
Microsoft DeBERTa surpasses puny humans in SuperGlue reading comprehension test
by Surur
3 weeks
There has been massive progress recently in training networks with millions of parameters. Microsoft recently updated the DeBERTa (Decoding-enhanced BERT with disentangled attention) model by training a larger version that consi...