June 13, 2024

Ep. 261 - Part 1 - June 11, 2024

38 minutes

ArXiv NLP research for Tuesday, June 11, 2024.

00:20: A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

01:41: Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges

02:32: A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

04:08: Evolving Subnetwork Training for Large Language Models

05:31: Missingness-resilient Video-enhanced Multimodal Disfluency Detection

06:37: Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models

08:14: Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference

09:33: Delving into ChatGPT usage in academic writing through excess vocabulary

10:53: Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model

12:12: CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

13:26: Effectively Compress KV Heads for LLM

15:00: Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

16:54: Reading Miscue Detection in Primary School through Automatic Speech Recognition

18:09: HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation

20:01: DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs

21:15: Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning

22:35: Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

24:42: Translating speech with just images

25:35: Never Miss A Beat: An Efficient Recipe for Context Window Extension of Large Language Models with Consistent "Middle" Enhancement

26:51: Teaching Language Models to Self-Improve by Learning from Language Feedback

28:25: Merging Improves Self-Critique Against Jailbreak Attacks

29:18: Towards Human-AI Collaboration in Healthcare: Guided Deferral Systems with Large Language Models

30:11: Improving Autoformalization using Type Checking

31:37: Improving Commonsense Bias Classification by Mitigating the Influence of Demographic Terms

33:19: Decipherment-Aware Multilingual Learning in Jointly Trained Language Models

34:20: DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms

35:20: On the Hallucination in Simultaneous Machine Translation

36:07: MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs

37:42: Scholarly Question Answering using Large Language Models in the NFDI4DataScience Gateway

...more

By Brad Edwards