
Posted in r/singularity by u/rationalkat • 91 points and 44 comments
![[Meta] MoCha: Towards Movie-Grade Talking Character Synthesis](https://lazysoci.al/api/v3/image_proxy?url=https%3A%2F%2Fexternal-preview.redd.it%2FNWFzN3JqNXJ0N3NlMZv2AQVK9UqjwfLOAOKWrnZlbYsog5cKrFk9ZDKbMyQx.png%3Foverlay-align%3Dbottom%2Cleft%26crop%3D976%3A510.994764398%2Csmart%26overlay-height%3D15p%26overlay%3D%252Fwatermark%252Ft5_2qh8m.png%253Fs%253D98f09e1133f8405990faedd67a27814bb6b5cdb9%26width%3D976%26height%3D510.994764398%26auto%3Dwebp%26s%3D71443bcf1a43d556aab975a778204dfc85086f09&format=webp)
Posted in r/singularity by u/rationalkat • 91 points and 44 comments
The original was posted on /r/singularity by /u/rationalkat on 2025-04-01 12:16:32+00:00.
Memory Layers at Scale
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense feed-forward layers, providing dedicated capacity to store and retrieve information cheaply. This work t...
The original was posted on /r/singularity by /u/rationalkat on 2025-01-03 15:59:11+00:00.
Apollo: An Exploration of Video Understanding in Large Multimodal Models.
Despite the rapid integration of video perception capabilities into Large Multimodal Models (LMMs), the underlying mechanisms driving their video understanding remain poorly understood. Consequently, many design decisions in this domain are made without proper justification or analysis. The high com...
The original was posted on /r/singularity by /u/rationalkat on 2024-12-16 11:27:27+00:00.
Coconut (Chain of Continuous Thought): Training Large Language Models to Reason in a Continuous Latent Space
Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem. However, we argue that language space may not always be optimal for reasoning. For example, most word ...
The original was posted on /r/singularity by /u/rationalkat on 2024-12-10 12:32:20+00:00.
Time to replace the weekly discussion thread? It's 19 days old
The original was posted on /r/singularity by /u/torb on 2024-09-01 12:51:57+00:00.
Not that Meta. Can we please get a 2024 predictions thread?
The original was posted on /r/singularity by /u/DungeonsAndDradis on 2023-12-28 18:21:34.
Can we have a predictions thread for 2024?
Here was the 2023 predictions thread: