Why Johnny can’t prompt: how non-AI experts try (and fail) to design LLM prompts

Zamfirescu-Pereira, J.D.; Y. Wong, Richmond; Hartmann, Richmond; Yang, Qian

Why Johnny can’t prompt: how non-AI experts try (and fail) to design LLM prompts

Fecha

2023

Autores

Zamfirescu-Pereira, J.D.

Y. Wong, Richmond

Hartmann, Richmond

Yang, Qian

Abrir versión en línea

Resumen

Pre-trained large language models (“LLMs”) like GPT-3 can engage in fluent, multi-turn instruction-taking out-of-the-box, making them attractive materials for designing natural language interactions. Using natural language to steer LLM outputs (“prompting”) has emerged as an important design technique potentially accessible to non-AI-experts. Crafting effective prompts can be challenging, however, and prompt-based interactions are brittle. Here, we explore whether non-AI-experts can successfully engage in “end-user prompt engineering” using a design probe—a prototype LLM-based chatbot design tool supporting development and systematic evaluation of prompting strategies. Ultimately, our probe participants explored prompt designs opportunistically, not systematically, and struggled in ways echoing end-user programming systems and interactive machine learning systems. Expectations stemming from human-to-human instructional experiences, and a tendency to overgeneralize, were barriers to effective prompt design. These findings have implications for non-AI-expert-facing LLM-based tool design and for improving LLM-and-prompt literacy among programmers and the public, and present opportunities for further research.

Palabras clave

Modelos de lenguaje, Lenguaje natural, Expertos en IA

URI

https://hdl.handle.net/20.500.12010/38628

Colecciones

Tecnologías con Inteligencia Artificial

Página completa del ítem

Why Johnny can’t prompt: how non-AI experts try (and fail) to design LLM prompts

Fecha

Fecha

Autores

Director de trabajo de grado

Título de la revista

ISSN de la revista

Título del volumen

Editor

Resumen

Descripción

Palabras clave

Citación

URI

Colecciones

Aprobación

Revisión

Complementado por

Referenciado por