Liyuan Liu (Lucas)
Liyuan Liu
Home
Goal
Publications
Selected
Full List
Honors
Highlights
All Honors
Activities
Services
Experience
Contact
Blog
CV
Reasoning Model
Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Recent advances in language modeling have demonstrated the effectiveness of State Space Models (SSMs) for efficient sequence modeling. While hybrid architectures such as Samba and the decoder-decoder architecture, YOCO, have shown promising …
Cite
×