ChatGPT fails when it comes to accounting, finds major study

Microsoft-backed OpenAI has launched its newest AI chatbot product, GPT-4 which uses machine learning to generate natural language text, passed the bar exam with a score in the 90th percentile, passed 13 of 15 advanced placement (AP) exams and got a nearly perfect score on the GRE Verbal test.

Students scored an overall average of 76.7 per cent, compared to ChatGPT`s score of 47.4 per cent. Source: Reuters

AI chatbot ChatGPT is still no match for humans when it comes to accounting and while it is a game changer in several fields, the researchers say the AI still has work to do in the realm of accounting.

Microsoft-backed OpenAI has launched its newest AI chatbot product, GPT-4 which uses machine learning to generate natural language text, passed the bar exam with a score in the 90th percentile, passed 13 of 15 advanced placement (AP) exams and got a nearly perfect score on the GRE Verbal test.

"It`s not perfect; you`re not going to be using it for everything," said Jessica Wood, currently a freshman at Brigham Young University (BYU) in the US. "Trying to learn solely by using ChatGPT is a fool`s errand."

Researchers at BYU and 186 other universities wanted to know how OpenAI`s tech would fare on accounting exams. They put the original version, ChatGPT, to the test.

"We`re trying to focus on what we can do with this technology now that we couldn`t do before to improve the teaching process for faculty and the learning process for students. Testing it out was eye-opening," said lead study author David Wood, a BYU professor of accounting.

Although ChatGPT`s performance was impressive, the students performed better.

Students scored an overall average of 76.7 per cent, compared to ChatGPT`s score of 47.4 per cent.

On a 11.3 per cent of questions, ChatGPT scored higher than the student average, doing particularly well on AIS and auditing.

But the AI bot did worse on tax, financial, and managerial assessments, possibly because ChatGPT struggled with the mathematical processes required for the latter type, said the study published in the journal Issues in Accounting Education.

When it came to question type, ChatGPT did better on true/false questions and multiple-choice questions, but struggled with short-answer questions.

In general, higher-order questions were harder for ChatGPT to answer.

"ChatGPT doesn`t always recognise when it is doing math and makes nonsensical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly," the study found.

ChatGPT often provides explanations for its answers, even if they are incorrect. Other times, ChatGPT`s descriptions are accurate, but it will then proceed to select the wrong multiple-choice answer.

"ChatGPT sometimes makes up facts. For example, when providing a reference, it generates a real-looking reference that is completely fabricated. The work and sometimes the authors do not even exist," the findings showed.

That said, authors fully expect GPT-4 to improve exponentially on the accounting questions posed in their study.

Get Latest Business News, Stock Market Updates and Videos; Check your tax outgo through Income Tax Calculator and save money through our Personal Finance coverage. Check Business Breaking News Live on Zee Business Twitter and Facebook. Subscribe on YouTube.

ChatGPT fails when it comes to accounting, finds major study

Microsoft-backed OpenAI has launched its newest AI chatbot product, GPT-4 which uses machine learning to generate natural language text, passed the bar exam with a score in the 90th percentile, passed 13 of 15 advanced placement (AP) exams and got a nearly perfect score on the GRE Verbal test.

RECOMMENDED STORIES

Power of Rs 15,000 SIP: How long it will take to achieve Rs 7 crore corpus? See calculations to know

PPF For Regular Income: How to get Rs 85,000 a month tax-free income from Public Provident Fund?

Katra-Srinagar Vande Bharat Train: Northern Railway announces train timings; check fare, route and other key details

Power of Rs 5,500 SIP: In how many years, Rs 5,500 monthly step up SIP can generate over Rs 10 crore retirement corpus

Latest SBI Senior Citizens FD Rates: What will you get on maturity if you invest Rs 9,89,898, Rs 8,78,787, and Rs 6,56,565 in Amrit Vrishti, 1-, 3-, and 5-year FDs?

Largecap PSU Stock for 65% Gain in New Year: Anil Singhvi picks PSU bank for long term; know reasons and target prices

Largecap, Midcap, Smallcap Stocks To Buy: Analysts recommend buying 3 stocks for 2 weeks; note down targets