Logo image
Open Research University homepage
Surrey researchers Sign in
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN
Preprint   Open access

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Pengxiang Li, Lu Yin, John Collomose and Shiwei Liu
18/12/2024

Abstract

Computer Science - Artificial Intelligence Computer Science - Learning
url
https://doi.org/10.48550/arXiv.2412.13795View
Preprint (Author's original)CC BY V4.0 Open

Metrics

1 Record Views

Details

Logo image

Usage Policy