DeepTrendLab — Top AI News & Research Aggregator

All AI Labs Business News Newsletters Research Safety Tools Sources

Berkeley AI Research 10 articles

Gradient-based Planning for World Models at Longer Horizons

🐻 Research Berkeley AI Research 14 min read

Gradient-based Planning for World Models at Longer Horizons

GRASP is a new gradient-based planner for learned dynamics (a “world model”) that makes long-horizon planning practical by (1) lifting the trajectory into virtual states so optimization is parallel across time, (2) adding stochasticity directly to the state iterates for exploration, and (3) reshaping gradients so actions get clean signals while we avoid brittle “state-input” gradients through high-dimensional vision models.…

🕐 6 days ago

Read →

Identifying Interactions at Scale for LLMs

🐻 Research Berkeley AI Research 7 min read

Identifying Interactions at Scale for LLMs

Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decision-making process more…

🕐 a month ago

Read →

Information-Driven Design of Imaging Systems

🐻 Research Berkeley AI Research 6 min read

Information-Driven Design of Imaging Systems

An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how well…

🕐 3 months ago

Read →

RL without TD learning

🐻 Research Berkeley AI Research 9 min read

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer . Unlike traditional methods, this algorithm is not based on temporal difference…

🕐 5 months ago

Read →

What exactly does word2vec learn?

🐻 Research Berkeley AI Research

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a…

🕐 7 months ago

Read →

🐻 Research Berkeley AI Research

Whole-Body Conditioned Egocentric Video Prediction

× Predicting Ego-centric Video from human Actions (PEVA) . Given past video frames and an action specifying a desired change in 3D pose, PEVA predicts the next video frame. Our…

🕐 9 months ago

Read →

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

🐻 Research Berkeley AI Research

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1…

🕐 1 year, 15 days ago

Read →

Repurposing Protein Folding Models for Generation with Latent Diffusion

🐻 Research Berkeley AI Research

Repurposing Protein Folding Models for Generation with Latent Diffusion

PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D structure, by learning the latent space of protein folding models. The awarding of the 2024 Nobel…

🕐 1 year, 18 days ago

Read →

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

🐻 Research Berkeley AI Research

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to…

🕐 1 year, 1 month ago

Read →

Virtual Personas for Language Models via an Anthology of Backstories

🐻 Research Berkeley AI Research

Virtual Personas for Language Models via an Anthology of Backstories

We introduce Anthology , a method for conditioning LLMs to representative, consistent, and diverse virtual personas by generating and utilizing naturalistic backstories with rich details of individual values and experience.…

🕐 1 year, 5 months ago

Read →