Seventy3

By 任雨山

73播客，名字取材于Sheldon最喜欢的数字，内容由NotebookLM生成，每天跟随AI读AI业界论文。... more

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Seventy3:

How many episodes does Seventy3 have?

The podcast currently has 292 episodes available.

Seventy3 episodes:

December 06, 2024 【第67期】BABY-AIGS：AI-Generated Science
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：AIGS: Generating Science from AI-Powered Automated Falsification
Summary
This research paper introduces BABY-AIGS, a multi-agent system designed to autonomously conduct scientific research. The system uses large language models (LLMs) to propose hypotheses, conduct experiments, and perform falsification, a crucial aspect of the scientific method. BABY-AIGS is evaluated on three machine learning tasks, demonstrating its capacity to generate meaningful scientific discoveries, albeit not yet at the level of experienced human researchers. The paper also discusses the ethical implications and potential societal impact of AI-generated science. The authors conclude by outlining limitations and suggesting future research directions.
原文链接：https://arxiv.org/abs/2411.11910
论文链接：https://agent-force.github.io/AIGS/
...more
14min
December 05, 2024 【第66期】Anthropic研究：给LLM评估加点“统计学”
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
Summary
This paper advocates for improved statistical rigor in evaluating large language models (LLMs). It introduces methods for calculating and reporting confidence intervals, accounting for clustered data, and reducing variance in estimates. The authors propose specific techniques, such as using paired analyses and resampling, to enhance the precision of LLM evaluations. Furthermore, they provide formulas for comparing models statistically and conducting power analyses to determine the necessary sample size for reliable hypothesis testing. The ultimate goal is to transform LLM evaluation from a simple comparison of numbers to a more statistically sound experimental process.
原文链接：https://arxiv.org/abs/2411.00640
...more
20min
December 04, 2024 【第65期】Liquid Time-constant Networks：液体（神经）网络是什么？
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：Liquid Time-constant Networks
Summary
This research introduces Liquid Time-Constant Networks (LTCs), a novel type of continuous-time recurrent neural network. LTCs improve upon existing models by incorporating a dynamically adjusted time constant, leading to enhanced stability and expressivity. The authors provide theoretical analyses demonstrating these improvements, including bounds on network dynamics and a novel expressivity measure based on trajectory length. Furthermore, they present experimental results on various time-series prediction tasks, showcasing LTCs' superior performance compared to other recurrent neural networks. The design of LTCs is also partially motivated by biological neural network dynamics.
原文链接：https://arxiv.org/abs/2006.04439
解读链接：https://deepgram.com/learn/liquid-neural-networks
...more
19min
December 03, 2024 【第64期】NeuroClips：从fMRI数据还原大脑中视频
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Summary
The study introduces NeuroClips, a novel framework for reconstructing videos from fMRI brain activity. NeuroClips uses a two-pronged approach, employing separate components for reconstructing keyframes (high-level semantics) and low-level perceptual details to create smooth, high-fidelity videos. The framework significantly improves upon existing methods, achieving longer video reconstruction at higher frame rates. Experiments on a public dataset demonstrate NeuroClips' superior performance across various metrics, and the researchers explore the neural interpretability of their model. Limitations and future research directions are also discussed.
原文链接：https://arxiv.org/abs/2410.19452
...more
22min
December 02, 2024 【第63期】无论DPO还是PPO，Preference Feedback应该怎么用？
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Summary
This NeurIPS 2024 paper investigates the effectiveness of different components in preference-based learning for language models. The authors systematically compare Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO) algorithms, examining the influence of preference data quality, reward model design, and policy training prompts on model performance across various benchmarks. Their findings highlight the importance of high-quality preference data and reveal that PPO generally outperforms DPO, though improvements from enhanced reward models are surprisingly limited. The researchers propose a recipe for effective preference-based learning and publicly release their code and datasets to promote further research in this area.
原文链接：https://arxiv.org/abs/2406.09279
...more
13min
December 01, 2024 【第62期】sCMs：比Diffusion更快的图像生成算法
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：Simplifying, stabilizing, and scaling continuous-time consistency models
Summary
This research paper introduces simplified, stable, and scalable continuous-time consistency models (sCMs) for image generation. The authors propose TrigFlow, a new framework unifying existing diffusion model formulations, and implement key improvements to stabilize training. These improvements include refined time conditioning, adaptive normalization, and adaptive weighting. The resulting sCMs achieve state-of-the-art results on various datasets, even surpassing some competing methods with significantly less computational cost. Furthermore, the study compares sCMs to variational score distillation (VSD), highlighting sCMs' superior sample diversity and guidance compatibility.
原文链接：https://arxiv.org/abs/2410.11081
解读链接：https://openai.com/index/simplifying-stabilizing-and-scaling-continuous-time-consistency-models/
...more
26min
November 30, 2024 【第61期】大模型的「推理」是在做什么？
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Summary
This research investigates how large language models (LLMs) learn to reason, contrasting their strategies for reasoning tasks with those used for factual recall. The study analyzes the influence of pretraining data on model outputs for mathematical reasoning and factual questions, revealing that LLMs utilize procedural knowledge from the pretraining data rather than simple retrieval for reasoning. The findings indicate that LLMs rely less on individual documents for reasoning and show stronger correlations between document influence across similar reasoning problems. Importantly, the presence of code in the pretraining data is highlighted as a significant factor influencing the LLMs' reasoning capabilities. The study's results offer insights into improving LLM reasoning by focusing pretraining data selection on high-quality procedural knowledge examples. Limitations are acknowledged, particularly concerning the inability to analyze the entire pretraining dataset.
原文链接：https://arxiv.org/abs/2411.12580
解读链接：https://www.jiqizhixin.com/articles/2024-11-22-2
...more
15min
November 29, 2024 【第60期】RLTools：基于C++的开源强化学习工具
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Summary
RLtools, a new open-source C++ library, significantly accelerates deep reinforcement learning (RL) for continuous control problems. Its header-only, dependency-free design enables fast training and inference across diverse platforms, from high-performance computers to microcontrollers. This speed improvement is demonstrated through benchmarks showing substantial performance gains over existing RL frameworks. A key contribution is the first-ever demonstration of training a deep RL algorithm directly on a microcontroller, opening the field of "TinyRL." The library's architecture, based on C++ templating and a novel static multiple-dispatch paradigm, is central to its speed and portability.
原文链接：https://arxiv.org/abs/2306.03530
庆祝完成两个月的更新～
...more
20min
November 28, 2024 【第59期】SymDPO：多模态In-context learning提升技巧
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
Summary
This research introduces SymDPO, a novel method to improve the in-context learning capabilities of Large Multimodal Models (LMMs). Current LMMs often prioritize textual information over visual context in demonstrations, leading to inaccurate results. SymDPO addresses this "visual context overlook" by replacing text answers with symbols, forcing the model to rely on both visual and symbolic cues for correct responses. Experiments across various benchmarks demonstrate that SymDPO significantly enhances LMM performance compared to existing methods like General DPO, Video DPO, and MIA-DPO. The improved performance highlights SymDPO's success in fostering a more balanced understanding of multimodal information within in-context learning scenarios.
原文链接：https://arxiv.org/abs/2411.11909
...more
12min
November 27, 2024 【第58期】AM-RADIO，融合多种视觉大模型
Seventy3: 用NotebookLM将论文生成播客，让大家跟着AI一起进步。
今天的主题是：AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One
Summary
This paper proposes a new approach to training vision foundation models (VFMs) called AM-RADIO, which agglomerates the unique strengths of multiple pretrained models like CLIP, DINOv2, and SAM into a single model. The framework uses multi-teacher distillation to achieve this, and the resulting models outperform individual teacher models on various downstream tasks like classification, segmentation, and vision-language modeling. Notably, a new architecture called E-RADIO is introduced, which is significantly more efficient than traditional ViTs, allowing for faster inference and comparable performance. The paper thoroughly analyzes the effectiveness of the AM-RADIO approach, providing comprehensive results and insights into the distillation process.
原文链接：https://arxiv.org/abs/2312.06709
...more
18min

FAQs about Seventy3:

How many episodes does Seventy3 have?

The podcast currently has 292 episodes available.