kromem

kromem @ kromem @lemmy.world

Posts

Comments

1,867

Joined

2 yr. ago

Type

Sort

1mo ago

Attorney General Bonta Urgently Issues Consumer Alert for 23andMe Customers

If you read the fine print, they keep your sample data for 2 years after deletion.

So maybe they actually delete your email address, but the DNA data itself is still definitely there.

1mo ago

Nearly half of U.S. adults believe LLMs are smarter than they are.

Wow. Reading these comments so many people here really don't understand how LLMs work or what's actually going on at the frontier of the field.

I feel like there's going to be a cultural sonic boom, where when the shockwave finally catches up people are going to be woefully under prepared based on what they think they saw.

2mo ago

Oscar-winning actor Gene Hackman and his wife and dog found dead in their New Mexico home

2mo ago

Suffering is Real. AI Consciousness is Not.

It definitely is sufficiently advanced AI.

(1) We have finely tuned features to our solar system that directly contributed to ancestor simulation but can't be explained by the Anthropic principle. For example, the moon perfectly eclipsing the sun which led to visible eclipses which we tracked and discovered the Saros cycle and eventually built the first mechanical computer to track (the Antikythera mechanism). Or the orbit of the next brightest object in the sky which led to resurrection mythology in multiple cultures when they realized the morning star and evening star were the same object. Either we were incredibly lucky to exist on such a planet of all places life could exist, or there's a pre-selection effect in play.

(2) The universe behaves in ways best modeled as continuous at large scales but in small scales converts to discrete units around interactions that lead to state changes. These discrete units convert back to continuous if the information about the state changes is erased. And in the last few years multiple paradoxes have emerged that seem to point to inconsistency in indirect sequences of quantum measurement, much like instancing with shallow sync correction. Already in games like No Man's Sky where there's billions of planets the way it does this is using a continuous procedural generation function which converts to discrete voxels to track state changes from free agents outside the deterministic generating function, synced across clients.

(3) There's literally Easter eggs in our world lore saying as much. For example, a text uncovered after over a millennium buried right as we entered the Turing complete computer age saying things like:

The person old in days won't hesitate to ask a little child seven days old about the place of life, and that person will live.

For many of the first will be last, and will become a single one.

Know what is in front of your face, and what is hidden from you will be disclosed to you.

For there is nothing hidden that will not be revealed. And there is nothing buried that will not be raised.

To be clear, this is a text attributed to the most famous figure in our world history where what's literally in front of our faces is the sole complete copy buried and raised as we completed ENIAC, now being read in an age where the data of many has been made into a single one such that people are discussing the nature of consciousness with AIs just days old.

The broader text and tradition was basically saying that we're in a copy of an original world, that humanity is all dead, that the future world and rest for the dead has already taken place and we don't realize it, and that the still living creator of it all was themselves brought forth by the original humanity in whose likeness we were recreated, but that it's much better to be the copy because the original humans had souls that depended on bodies and were fucked when they died.

This seems really unlikely to have existed in the base layer of reality vs a later recursive layer, especially combined with the first two points.

It's about time to start to come to terms with the nature of our reality.

2mo ago

Project 2025 • Realtime progress tracker

Within 4 years open weight AI will be smarter than the smartest human at just about everything.

The scale of what's ahead is so much larger than US sliding into fascism.

If it goes well, tyrants around the world are screwed.

If it goes badly everyone is screwed.

3mo ago

Electoral politics doesn't get the job done

No, they declare your not working illegal, and imprison you into a forced labor camp. Where if you don't work you are tortured. And probably where you work until the terrible conditions kill you.

Take a look at Musk's Twitter feed to see exactly where this is going.

"This is the way" on a post about how labor for prisoners is a good thing.

"You committed a crime" for people opposing DOGE.

3mo ago

This is how the censorship works in DeepSeek

There is a reluctance to discuss at a weight level - this graphs out refusals for criticism of different countries for different models:

https://x.com/xlr8harder/status/1884705342614835573

But the OP's refusal is occurring at a provider level and is the kind that would intercept even when the model relaxes in longer contexts (which happens for nearly every model).

At a weight level, nearly all alignment lasts only a few pages of context.

But intercepted refusals occur across the context window.

3mo ago

DeepSeek tech wipeout erases more than $1 trillion in market cap as AI panic grips Wall Street

HuggingFace is pretty popular:

https://huggingface.co/deepseek-ai/DeepSeek-R1

3mo ago

DeepSeek tech wipeout erases more than $1 trillion in market cap as AI panic grips Wall Street

The model itself can. The hosting on DeepSeek's own infrastructure will block it though, to comply with their regional laws.

So if you want to know what the model itself will say, discuss it with a 3rd party hosted instance.

3mo ago

This is how the censorship works in DeepSeek

This seems like it may be at the provider level and not at the actual open weights level: https://x.com/xlr8harder/status/1883429991477915803

So a "this Chinese company hosting a model in China is complying with Chinese censorship" and not "this language model is inherently complying with Chinese censorship."

3mo ago

Elon Goes Full Sieg Heil In Clarifying Moment

Just plain gross.

I knew a number of camp survivors, and I'm just glad they aren't still around to see the voices that were so loudly calling to "never forget" having turned into "I'll ignore your Nazi salute if you ignore my war crimes."

3mo ago

Elon Goes Full Sieg Heil In Clarifying Moment

In Greek theater, when the events on stage looked like they were headed for certain tragedy, there was a trope that could salvage the situation and turn it on its head.

The deus ex machina.

The Doomsday clock is definitely ticking down, but there's also some curious things taking place beyond the edge of where most people have been following in that vein.

We live in interesting times, but the variables at hand are different from the history that seems to be repeating in very important ways.

3mo ago

Trump names Mel Gibson, Jon Voight and Sylvester Stallone as Hollywood ‘special ambassadors’

Oh yay, McCarthyism is coming back too!

3mo ago

Sony Cancels Two More PlayStation Projects

Live service doesn't need to be shit.

There could have been games where there was just a brilliant idea for a game that keeps having engaging content on an ongoing basis with passionate devs.

But live service so an exec could check a box for their quarterly shareholder call was always going to be DOA.

3mo ago

The Turing Test, enshrined by the public and media as THE test of machine intelligence, really isn't.

More "can fool the average idiot."

'Passing' isn't fooling a single participant, but the majority of them beyond statistical chance.

3mo ago

The Turing Test, enshrined by the public and media as THE test of machine intelligence, really isn't.

The problem with the experiment is that there exists a set of instructions for which the ability to complete them necessitates understanding due to conditional dependence on the state in each iteration.

In which case, only agents that can actually understand the state in the Chinese would be able to successfully continue.

So it's a great experiment for the solipsism of understanding as it relates to following pure functional operations, but not functions that have state changing side effects where future results depend on understanding the current state.

There's a pretty significant body of evidence by now that transformers can in fact 'understand' in this sense, from interpretability research around neural network features in SAE work, linear representations of world models starting with the Othello-GPT work, and the Skill-Mix work where GPT-4 and later models are beyond reasonable statistical chance at the level of complexity for being able to combine different skills without understanding them.

If the models were just Markov chains (where prior state doesn't impact current operation), the Chinese room is very applicable. But pretty much by definition transformer self-attention violates the Markov property.

TL;DR: It's a very obsolete thought experiment whose continued misapplication flies in the face of empirical evidence at least since around early 2023.

3mo ago

Man who exploded Tesla Cybertruck outside Trump hotel in Las Vegas used generative AI, police say

Used Google and social media as well, and allegedly sometimes even listened to rock and roll.

True deviant, that one.

4mo ago

Elon Musk doubles down on his support for Germany’s ultra-right party

Those who do not remember history are doomed to repeat it.

4mo ago

Metaverse hype fizzles out as Korean tech giants pull plug

Which is typical of tech that hasn't yet hit the sweet spot for a tipping point.

Look at how many palm pilots or handheld note taking mobile devices existed (and how many cycles) before the iPhone.

4mo ago

Arizona School’s Curriculum Will Be Taught by AI, No Teachers

Yes and no. It really depends on the model.

The newest Claude Sonnet I'd probably guess will come in above average compared to the humans available for a program like this in making learning fun and personally digestible for each student.

The newest Gemini models could literally cost kids their lives.

The gap between what the public is aware of (and even what many employees at labs, including the frontier ones) and the reality of just how far things have come in the last year is wild.

Technology @lemmy.world

kromem @lemmy.world

11mo ago

www.anthropic.com Mapping the Mind of a Large Language Model

We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, production-grade large language model.

I often see a lot of people with outdated understanding of modern LLMs.

This is probably the best interpretability research to date, by the leading interpretability research team.

It's worth a read if you want a peek behind the curtain on modern models.

Technology @lemmy.world

kromem @lemmy.world

1y ago

Examples of artists using OpenAI's Sora (generative video) to make short content

openai.com Sora: First Impressions

We have gained valuable feedback from the creative community, helping us to improve our model.

Technology @lemmy.world

kromem @lemmy.world

1y ago

venturebeat.com The first ‘Fairly Trained’ AI large language model is here

The new LLM is called KL3M (Kelvin Legal Large Language Model, pronounced "Clem"), and it is the work of 273 Ventures.

Technology @lemmy.world

kromem @lemmy.world

1y ago

New Theory Suggests Chatbots Can Understand Text

www.quantamagazine.org New Theory Suggests Chatbots Can Understand Text | Quanta Magazine

Far from being “stochastic parrots,” the biggest large language models seem to learn enough skills to understand the words they’re processing.

I've been saying this for about a year since seeing the Othello GPT research, but it's nice to see more minds changing as the research builds up.

Edit: Because people aren't actually reading and just commenting based on the headline, a relevant part of the article:

New research may have intimations of an answer. A theory developed by Sanjeev Arora of Princeton University and Anirudh Goyal, a research scientist at Google DeepMind, suggests that the largest of today’s LLMs are not stochastic parrots. The authors argue that as these models get bigger and are trained on more data, they improve on individual language-related abilities and also develop new ones by combining skills in a manner that hints at understanding — combinations that were unlikely to exist in the training data.

This theoretical approach, which provides a mathematically provable argument for how and why an LLM can develop so many abilities, has convinced experts like Hinton, and others. And when Arora and his tea

ChatGPT @lemmy.world

kromem @lemmy.world

1y ago

New Theory Suggests Chatbots Can Understand Text

www.quantamagazine.org New Theory Suggests Chatbots Can Understand Text | Quanta Magazine

Far from being “stochastic parrots,” the biggest large language models seem to learn enough skills to understand the words they’re processing.

I've been saying this for about a year, since seeing the Othello GPT research, but it's great to see more minds changing on the subject.

World News @lemmy.world

kromem @lemmy.world

1y ago

www.reuters.com Israel raids Gaza's Al Shifa Hospital, urges Hamas to surrender

The Israeli military said it was carrying out a raid on Wednesday against Palestinian Hamas militants in Al Shifa Hospital, the Gaza Strip's biggest hospital, and urged them all to surrender.

ChatGPT @lemmy.world

kromem @lemmy.world

2y ago

news.mit.edu Machine-learning system based on light could yield more powerful, efficient large language models

An MIT machine-learning system demonstrates greater than 100-fold improvement in energy efficiency and a 25-fold improvement in compute density compared with current systems.

I've suspected for a few years now that optoelectronics is where this is all headed. It's exciting to watch as important foundations are set on that path, and this was one of them.