8 months ago

Abstract

Computer end users have spent billions of hours completing daily tasks like tabular data processing and project timeline scheduling. Most of these tasks are repetitive and error-prone, yet most end users lack the skill to automate these burdensome works. With the advent of large language models (LLMs), directing software with natural language user requests become a reachable goal. In this work, we propose a SheetCopilot agent that takes natural language task and control spreadsheet to fulfill the requirements. We propose a set of atomic actions as an abstraction of spreadsheet software functionalities. We further design a state machine-based task planning framework for LLMs to robustly interact with spreadsheets. We curate a representative dataset containing 221 spreadsheet control tasks and establish a fully automated evaluation pipeline for rigorously benchmarking the ability of LLMs in software control tasks. Our SheetCopilot correctly completes 44.3% of tasks for a single generation, outperforming the strong code generation baseline by a wide margin. Our project page:https://sheetcopilot.github.io/.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Hongxin Li1,2, Jingran Su3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Hongxin Li1,2, Jingran Su3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models

Hongxin Li*1,2, Jingran Su*3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models

Hongxin Li*1,2, Jingran Su*3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models

Hongxin Li*1,2, Jingran Su*3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Abstract

Build AI with AI

HyperAI Newsletters

Hongxin Li1,2, Jingran Su3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Hongxin Li1,2, Jingran Su3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5

Hongxin Li1,2, Jingran Su3,4, Yuntao Chen†3, Qing Li†4, and Zhaoxiang Zhang†1,2,3,5