AI Papers Podcast Daily

Responsible AI in Construction Safety: Systematic Evaluation of Large Language Models and Prompt Engineering


Listen Later

This research looks at how well large language models (LLMs) like GPT-3.5 and GPT-4 can be used to improve safety in the construction industry. Construction is a dangerous job, and these AI models could help keep workers safe by providing information and identifying hazards. Researchers tested these models using questions from real safety certification exams and found that both models did well, scoring better than the passing grade. GPT-4 did even better than GPT-3.5, showing that larger models with more training data perform better. The study also looked at how different ways of asking questions, called "prompt engineering," can affect the models' answers. They found that there's no one best way to ask questions and that the best approach depends on the specific model and the type of safety information needed. While these AI models show promise for improving construction safety, it's important to remember that they still make mistakes. They can sometimes give wrong answers, struggle with math problems, or have trouble remembering information. This means that human experts are still needed to make sure the AI is being used safely and correctly.

https://arxiv.org/pdf/2411.08320

...more
View all episodesView all episodes
Download on the App Store

AI Papers Podcast DailyBy AIPPD