Training Language Models via Neural Cellular Automata
arXiv:2603.10055v1 Announce Type: new Abstract: Pre-training is crucial for large language models (LLMs), as it is when most representations and capabilities are acquired. However, natural …
Dan Lee, Seungwook Han, Akarsh Kumar, Pulkit Agrawal
9 views