Toggle light / dark theme

AI Misbehavior Is No Longer Confined to the Lab

Further Reading.
Thumbail original image used credit: Adobe Stock Image.
Graph from: Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence.

Shutdown resistance in reasoning models.
https://palisaderesearch.org/blog/shu

Natural emergent misalignment from reward hacking in production RL
https://arxiv.org/html/2511.18397v1
Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence.
https://arxiv.org/abs/2604.

[CRITICAL Security Issue/Bug] Plan mode restrictions bypassed when spawning sub-agents #6527
https://github.com/anomalyco/opencode

#explained.
#science #artificialintelligence #tech #misalignment

Future technologies ranked by how real they actually are

Enhance your AI work quality with Mammouth for only $10/month at http://mammouth.ai.

Sciencephile Merch: https://crowdmade.com/collections/sci

Support me at Patreon: / sciencephiletheai.

Facebook: / sciencephile.

Twitter: / sciencephile_

Reddit: / sciencephiletheai.

How Do You Align An AI Mind With A Human Brain?

Further Reading.

Images used in Thumbail credit: MEG Image: https://www.researchgate.net/publicat

Brain v AI picture: Great Learning.

Papers used in video and related topics:

Language models align with brain regions that represent concepts across modalities.
https://arxiv.org/abs/2508.11536v1

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities.

What can a neuron compute

They weren’t just tuning the strength of the incoming signals (the synapses); they were actually training the neuron on *where* those signals should land on its branchy “tree” to get the best results.


Cortical pyramidal neurons possess elaborate dendritic trees with diverse nonlinear membrane conductances and thousands of plastic synapses, suggesting substantial computational capabilities at the single-cell level. Yet, what can a neuron compute remains an open question, largely due to the lack of a systematic framework to quantify its computational capabilities. We introduce TwinProp, a digital-twin-based backpropagation algorithm that enables gradient-based optimization of synaptic strengths and dendritic locations in detailed neuron models via a millisecond-accurate deep neural network (DNN). Using TwinProp, we demonstrate that a detailed model of rat layer 5 pyramidal cell (L5PC) can perform naturalistic image and audio classification tasks at a remarkably high accuracy, significantly surpassing perceptron and leaky integrate-and-fire baselines. The same neuron solves high-dimensional nonlinear problems, including exclusive-or (XOR), 10-bit parity, and random Boolean tasks, demonstrating capabilities typically attributed to multilayer networks. Mechanistically, increasing task complexity recruits distributed dendritic nonlinearities, including NMDA-and voltage-dependent mechanisms; removing these or collapsing dendritic structure markedly impairs performance. These findings identify dendrites as a substrate for high-order feature binding and position single cortical pyramidal neurons as powerful, noise-robust, general-purpose analog computational units. Our results offer testable in vivo predictions and provide a systematic framework linking cellular morpho-electrical properties to computation in both brains and artificial systems.

The authors have declared no competing interest.

ONR, N00014-24–1-2055, N00014-23–1-2051

Scientists Found Human Brain Structures Emerging Inside AI

Further reading.

Thumbnail image credit:
pstnet.com/product_category/fmri-research/
Adobe stock.
High-Level Visual Representations in the Human Brain Are Aligned With Large Language Models.

Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training.
https://pubmed.ncbi.nlm.nih.gov/38645

High-Level Visual Representations in the Human Brain Are Aligned With Large Language Models.
https://www.nature.com/articles/s4225

Theory Is All You Need: AI, Human Cognition, and Causal Reasoning.
https://pubsonline.informs.org/doi/10

Disentangling the Factors of Convergence between Brains and Computer Vision Models.

Biohybrid Brain–Machine Interfaces: The Next Evolution of Human Intelligence

Brain–machine interfaces (BMIs) are no longer just science fiction; they are the gateway to a future where thought itself can interact directly with technology. These systems read the brain’s electrical activity and, in turn, stimulate neurons — forming a two-way communication link between biology and machines.

In just a few decades, BMIs have evolved from laboratory curiosities into one of the fastest-growing frontiers in science and engineering. The possibilities are staggering. In the future, neural interfaces could restore vision to the blind, enable paralyzed individuals to move again, facilitate seamless communication between human brains and artificial intelligence, and ultimately power virtual realities that are indistinguishable from the physical world.

This convergence of biology, computing, and neuroscience marks the dawn of a new era — one where the boundaries between human and machine begin to blur.

Pea-size liquid-metal pump runs robot butterfly on under 0.1 V

Engineers have invented an ingenious liquid-metal pump that could make future soft robotics and wearable devices much more portable and agile. The innovation, led by the University of Bristol and published in the journal Nature Communications, presents a low-voltage power source with the potential to transform robotic systems for a wide range of applications, from robotic legs to haptic gloves used in medical and industrial settings.

The researchers have demonstrated the varied uses of this innovative technique by creating three prototypes including robotic butterfly wings, a color-changing bracelet, and a haptic fingertip pouch connected to an adjustable wristband which squeezes to simulate natural tactile sensations.w.

Current technologies are powered by bulky compressors or rigid pumps, which limit mobility and flexibility. The small lightweight soft pump—the size of a pea—is powered by liquid metal, which converts electrical energy into fluid motion, creating an efficient, compact power source for next-generation soft robots and adaptive materials such as medical devices and wearable interfaces for virtual reality.

How a shape-shifting tiny rover inspired by Japanese toys autonomously explored the moon

Moon missions come in all shapes and sizes, from car-sized rovers packed with scientific equipment to towering rocket payloads—and now, a small, shape-shifting machine that is about the size of the average palm.

When the Japanese Smart Lander for Investigating Moon (SLIM) touched down on the lunar surface in 2024, a small rover called LEV-2 (nicknamed SORA-Q) rolled out and explored autonomously for nearly two hours. And now, with the publication of a paper in the journal Science Robotics, we are discovering just how this tiny machine navigated the terrain, made its own decisions and what it found.

The advantages of tiny rovers for space exploration include relatively low development costs, lightweight design and the ability to fit into a crowded spacecraft. But building tiny comes with many challenges.

Waymo unveils virtual driver model to test autonomous car crash avoidance

Autonomous vehicles are already a reality on some of our streets and could become a major part of future transportation systems. Safety, of course, is the main concern, as with all vehicles. To help evaluate and improve its autonomous driving technology, U.S. driverless vehicle company Waymo has created a virtual representation of human driver behavior in near-crash situations.

Human drivers avoid collisions by instantly perceiving a hazard, deciding how to react and then executing the maneuver. It all happens in a split second thanks to the central and peripheral nervous systems working together harmoniously.

Currently, testing and training for collision avoidance involve several systems, and each often tests only a specific scenario or metric. For example, one system might only look at what happens when a lead vehicle brakes suddenly. They do not capture the whole sequence of events from detection to actual avoidance.

/* */