Feasible to implement non-autoregressive LMs? (M2M) #1072

Axreub · 2023-09-17T15:40:00Z

Hey. I'm starting to try to implement M2M in vLLM and noticed that all the currently supported models use Causal language models (decoder only), while M2M is non-autoregressive and has an encoder-decoder architecture. Is it still possible to implement a vLLM version of M2M or should I just give up 🥲 Appreciate any help with this : )

irasin · 2023-09-18T10:10:22Z

Hi, @Axreub, I just wonder what is the non-autoregressive LMs(M2M), can you share the paper or code about it? Thanks a lot.

Axreub · 2023-09-18T10:38:58Z

I mean specifically M2M-100, the translation model by Meta AI.arXiv paper 418M params model card on HF 1.2B params model card on HF. Sorry for not being clear enough previously. I'm mostly concerned about the fact that non-autoregressive models include a decoder with separate attention etc so I'm wondering if it's still feasible.

hmellor · 2024-03-08T12:06:31Z

Closing as duplicate of #187

hmellor added the duplicate label Mar 8, 2024

hmellor closed this as completed Mar 8, 2024

hmellor closed this as completed Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feasible to implement non-autoregressive LMs? (M2M) #1072

Feasible to implement non-autoregressive LMs? (M2M) #1072

Axreub commented Sep 17, 2023

irasin commented Sep 18, 2023

Axreub commented Sep 18, 2023

hmellor commented Mar 8, 2024

Feasible to implement non-autoregressive LMs? (M2M) #1072

Feasible to implement non-autoregressive LMs? (M2M) #1072

Comments

Axreub commented Sep 17, 2023

irasin commented Sep 18, 2023

Axreub commented Sep 18, 2023

hmellor commented Mar 8, 2024