Byte-Sized Breakthroughs Podcast

Transformer2: Self-Adaptive Large Language Models

Artificial Intelligence

Natural Language Processing

Deep Learning

Machine Learning

Adaptive Systems

Learning to Learn Optimization Algorithms with LSTM Networks

Machine Learning

Meta-Learning

Optimization Algorithms

Recurrent Neural Networks

Trust Region Policy Optimization

Reinforcement Learning

Policy Optimization

Trust Region Methods

Artificial Intelligence

Efficient Deep Learning Parallelization using SOAP Search Space and FlexFlow Framework

Deep Learning

Parallelization

Distributed Computing

Neural Networks

Optimization

Deep Retrieval: Learning Efficient Structures for Large-Scale Recommendation Systems

Machine Learning

Recommendation Systems

Information Retrieval

Deep Learning

Scaling User Modeling for Personalized Advertising at Meta

Personalized Advertising

User Modeling

Deep Learning

Neural Networks

LiNR: Revolutionizing Large-Scale Retrieval for Recommendation Systems

Machine Learning

Information Retrieval

Recommender Systems

Deep Learning

GPU-based Systems

Comprehensive Guide to Real-Time Bidding (RTB): Challenges and Opportunities

Online Advertising

Real-Time Bidding

Digital Auctions

User Response Prediction

Bidding Strategies

Dynamic Pricing

Ad Fraud Detection

Efficient Inference for Large Language Models with LLM.int8()

Artificial Intelligence

Natural Language Processing

8-bit Quantization

Transformer Models

Aug 14, 2024

Enhancing Language Models with a Massive Datastore

Artificial Intelligence

Language Models

Data Retrieval

Natural Language Processing

Aug 14, 2024

In-Context Policy Iteration: Enhancing Reinforcement Learning with Large Language Models

Reinforcement Learning

Large Language Models

Policy Iteration

Aug 14, 2024

Optimizing Quantization of Large Language Models for Efficiency and Accuracy

Machine Learning

Natural Language Processing

Quantization

Efficiency

Model Compression

Aug 12, 2024

AutoPruner: End-to-End Trainable Filter Pruning for Efficient Deep Neural Networks

Deep Learning

Neural Networks

Model Compression

Aug 11, 2024

SparseGPT: One-shot Pruning of Large Language Models

Artificial Intelligence

Natural Language Processing

Model Compression

Aug 11, 2024

Efficient Compression of Large Language Models using LLM-Pruner

Artificial Intelligence

Natural Language Processing

Model Compression

Aug 11, 2024

ScreenAgent: A Vision Language Model-driven Computer Control Agent

Artificial Intelligence

Computer Vision

Natural Language Processing

Artificial GUI Interaction

Supervised Pretraining for In-Context Reinforcement Learning with Transformers

Reinforcement Learning

Transformers

Meta-Learning

Deep Neural Networks

Decision-Pretrained Transformer: Bridging Supervised Learning and Reinforcement Learning

Reinforcement Learning

Transformer Models

Decision-Making

How Transformers Learn In-Context Beyond Simple Functions

Artificial Intelligence

Deep Learning

Transformers

In-Context Learning

Representation Learning

In-Context Learning Capabilities of Transformers

Machine Learning

Deep Learning

Transformer Models

In-Context Learning

Spider2-V: Automated Multimodal Agents for Data Science Workflows

Artificial Intelligence

Artificial GUI Interaction

Data Science

Generalization Patterns of Transformers in In-Weights Learning and In-Context Learning

Artificial Intelligence

Deep Learning

Machine Learning

Unmasking the Lottery Ticket Hypothesis

Deep Learning

Neural Networks

Network Pruning

Machine Learning

Aug 9, 2024

Rethinking Scale for In-Context Learning in Large Language Models

Natural Language Processing

Large Language Models

Transformer Architecture

In-Context Learning

Model Pruning

Aug 9, 2024

Ferret-UI: Multimodal Large Language Model for Mobile User Interface Understanding

Artificial Intelligence

Artificial GUI Interaction

Mobile Applications

Aug 8, 2024

Grounded SAM: A Novel Approach to Open-Set Segmentation

Computer Vision

Open-World Visual Perception

Segmentation Models

Aug 8, 2024

SAM 2: Segment Anything in Images and Videos

Computer Vision

Deep Learning

Video Segmentation

SAM 2

Visual Perception

Aug 6, 2024

RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning

Artificial Intelligence

Reinforcement Learning

Deep Learning

Evolutionary Optimization of Model Merging Recipes

Artificial Intelligence

Machine Learning

Natural Language Processing

Exploring Weight Agnostic Neural Networks

Deep Learning

Neural Networks

Evolutionary Algorithms

Speculative Execution for Efficient Inference in Large Language Models on Consumer Devices

Artificial Intelligence

Large Language Models

Systems and Performance

In-context Learning and Induction Heads

Natural Language Processing

Deep Learning

Explainable AI

AI Safety

On the Measure of Intelligence

Artificial Intelligence

Machine Learning

Explainable AI

Geometric Properties of Data Representations in Deep Neural Networks

Deep Learning

Machine Learning

Explainable AI

The Case for Learned Index Structures

Machine Learning

Systems and Performance

AI for Science

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

3D Vision

Computer Vision

Deep Learning

Constitutional AI: Harmlessness from AI Feedback

AI Safety

Machine Learning

Artificial Intelligence

Proximal Policy Optimization Algorithms

Reinforcement Learning

Optimization

Machine Learning

Graph Isomorphism Networks: A Theoretical Framework and Architecture

Graph Neural Networks

Machine Learning

Deep Learning

Rethinking the Value of Network Pruning

Deep Learning

Optimization

Systems and Performance

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

Deep Learning

Machine Learning

Optimization

Adding Conditional Control to Text-to-Image Diffusion Models

Generative Models

Computer Vision

Deep Learning

Multimodal AI

Denoising Diffusion Probabilistic Models

Generative Models

Deep Learning

Computer Vision

Practical Research Problems in AI Safety

AI Safety

Machine Learning

Artificial Intelligence

Segment Anything: A Paradigm Shift in Image Segmentation

Computer Vision

Deep Learning

Machine Learning

Learning Transferable Visual Models From Natural Language Supervision

Computer Vision

Natural Language Processing

Multimodal AI

Language Models are Few-Shot Learners

Natural Language Processing

Few-Shot/Meta-Learning

Deep Learning

Training Deep Reinforcement Learning Systems with Human Preferences

Reinforcement Learning

Deep Learning

AI Safety

Playing Atari with Deep Reinforcement Learning

Deep Learning

Reinforcement Learning

Artificial Intelligence

Single Path One-Shot (SPOS): Efficient Neural Architecture Search with Simplified Supernet

Deep Learning

Optimization

Machine Learning

Aug 1, 2024

Long-CLIP: Extending Text Length for Improved Vision-Language Modeling

Multimodal AI

Natural Language Processing

Computer Vision

Aug 1, 2024

𝑓VDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence

3D Vision

Deep Learning

Systems and Performance

Aug 1, 2024

Unraveling the Connection between In-Context Learning and Gradient Descent in Transformers

Natural Language Processing

Deep Learning

Explainable AI

Jul 24, 2024

Gradient Low-Rank Projection (GaLore): Revolutionizing Memory-Efficient LLM Training

Natural Language Processing

Optimization

Systems and Performance

Jul 24, 2024

Retrieval-Enhanced Transformers (RETRO): A Semi-Parametric Approach to Enhance Performance of Large Language Models

Natural Language Processing

Deep Learning

Systems and Performance

Jul 20, 2024

Foundation Models in Decision Making: Roles, Challenges, and Opportunities

Artificial Intelligence

Machine Learning

Explainable AI

Jul 20, 2024

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Deep Learning

Transformers

Systems and Performance

Jul 19, 2024

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Systems and Performance

Deep Learning

Machine Learning

Jul 19, 2024

Hyper Networks: A Novel Approach to Learning Weights in Deep Neural Networks

Deep Learning

Machine Learning

Neural Networks

DARTS: Differentiable Architecture Search

Deep Learning

Optimization

Machine Learning

TiTok: A Transformer-based 1D Tokenization Approach for Image Generation

Generative Models

Computer Vision

Transformers

NerfBaselines: A Framework for Standardized Evaluation of Novel View Synthesis Methods in Computer Vision

3D Vision

Computer Vision

Systems and Performance

Survey on reinforcement learning in reccomender systems

Reinforcement Learning

Recommender Systems

Machine Learning

Models tell you what to discard

Systems and Performance

Machine Learning

Optimization

Training Large Language Models for Compiler Optimization

Natural Language Processing

Systems and Performance

AI for Science

Metadata-based Color Harmonization for Multi-camera Surround View Systems

Computer Vision

Autonomous Driving

Extrapolated View Synthesis for Urban Scene Reconstruction

3D Vision

Computer Vision

Generative Models

Planning-Oriented Autonomous Driving

Autonomous Driving

Artificial Intelligence

Machine Learning

SafePathNet: Learning a Distribution of Trajectories for Safe and Comfortable Autonomous Driving

Autonomous Driving

AI Safety

Machine Learning

Unsupervised Occupancy Fields for Perception and Forecasting

Computer Vision

Machine Learning

Autonomous Driving

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

Autonomous Driving

Deep Learning

Computer Vision

RT-DETR: Real-Time Object Detection with Transformer

Computer Vision

Transformers

Deep Learning

Robustness Evaluation of HD Map Constructors under Sensor Corruptions for Autonomous Driving

Autonomous Driving

Computer Vision

AI Safety

DriveVLM: Vision-Language Models for Autonomous Driving in Urban Environments

Autonomous Driving

Computer Vision

Multimodal AI

NeuralProphet Explainable Forecasting at Scale

Deep Learning

Machine Learning

Explainable AI

No-Transaction Band Network A Neural Network Architecture for Efficient Deep Hedging

Deep Learning

AI for Science

Machine Learning

TransAct Transformer-based Realtime User Action Model for Recommendation at Pinterest

Recommender Systems

Transformers

Systems and Performance

A Better Match for Drivers and Riders Reinforcement Learning at Lyft

Reinforcement Learning

Recommender Systems

Machine Learning

AutoEmb Automated Embedding Dimensionality Searchg in Streaming Recommendations

Deep Learning

Recommender Systems

Optimization

Zero Bubble Pipeline Parallelism

Systems and Performance

Deep Learning

Machine Learning

The limits to learning a diffusion model

Generative Models

Machine Learning

Deep Learning

ZeRO Memory Optimizations: Toward Training Trillion Parameter Models

Systems and Performance

Deep Learning

Natural Language Processing