Generative AI Performance in Elite Japanese University Entrance Exams

Apr 27, 2026, 07:53

Introduction

The AI company LifePrompt Inc. reported on April 27 that OpenAI's ChatGPT 5.2 Thinking model scored higher than the top human candidates in the 2026 entrance exams for the University of Tokyo and Kyoto University.

Main Body

To test the AI, the company converted exam questions into images. To ensure the essay answers were graded fairly, educators from the Kawai Juku preparatory school performed the evaluations. At the University of Tokyo, the model scored 503 out of 550 points in the Natural Sciences III medical track, beating the top human score of 453 by 50 points, and achieved a perfect score in mathematics. In the Humanities and Social Sciences exam, the AI scored 452 out of 550, which was higher than the top successful applicant's score of 434. Similarly, at Kyoto University, the model outperformed the top human scores in both the Faculty of Law and the Faculty of Medicine. However, the AI's performance varied depending on the subject. While it achieved a 90% accuracy rate in English, it only scored 25% on World History essay questions. These results show a major improvement in AI capabilities. Previous versions tested by LifePrompt in 2024 failed to pass, while the 2025 model was the first to reach the minimum passing score. Experts have different opinions on what these results mean for human intelligence and education. Satoshi Endo, head of LifePrompt, asserted that the rapid development of AI means businesses must change their long-term strategies over the next twenty years. On the other hand, Satoshi Kurihara, a professor at Keio University, criticized the comparison between humans and AI. He argued that because AI can absorb massive amounts of data, it is like a calculator. Consequently, he emphasized that universities should rethink exams that focus on memory and calculation rather than the ability to create original value.

Conclusion

In summary, while generative AI has outperformed humans in standardized tests and knowledge-based questions, it still faces challenges in specific areas of qualitative essay writing.

Vocabulary Learning

accuracy (n.)

correctness / the quality of being free from errors準確度

Example:The model achieved a 90% accuracy rate in English.

evaluations (n.)

assessment / the act of judging or measuring the quality of something評估

Example:The teachers conducted thorough evaluations of the students' essays.

outperformed (v.)

exceeded / performed better than others超越

Example:The AI outperformed the top human candidates in the exams.

rapid (adj.)

quick / happening at a fast pace快速

Example:The rapid development of AI has changed many industries.

rethink (v.)

reconsider / think about again with a new perspective重新思考

Example:Universities should rethink exams that focus on memory.

Sentence Learning

In the Humanities and Social Sciences exam, the AI scored 452 out of 550, which was higher than the top successful applicant's score of 434.

Relative Clause: The clause 'which was higher than the top successful applicant's score of 434' adds additional information about the AI's score, indicating it was better than the highest applicant's result.關係子句: 此子句為關於 AI 分數的額外資訊，說明其分數高於最高成功申請者的分數。

The essay answers were graded fairly by educators from the Kawai Juku preparatory school.

Passive Voice: The verb phrase 'were graded' is in passive voice, showing that the essay answers received the action performed by the educators.被動語態: 這句使用被動語態，主語『the essay answers』是動作的接受者，由『educators』執行。

While it achieved a 90% accuracy rate in English, it only scored 25% on World History essay questions.

Contrastive Conjunction: The word 'While' introduces a contrast between two different outcomes, showing that despite good performance in English, the AI performed poorly in World History.對比連詞: 這句使用『While』連接兩個對照的子句，表示雖然在英語上取得 90% 的準確率，但在世界歷史題目上僅 25%。

He argued that because AI can absorb massive amounts of data, it is like a calculator.

Causal Clause: The clause 'because AI can absorb massive amounts of data' explains the reason for the comparison, indicating that the AI's data absorption capacity makes it similar to a calculator.原因子句: 此句使用『because』引導原因子句，說明 AI 能吸收大量資料的原因，進而被比作計算機。

On the other hand, Satoshi Kurihara, a professor at Keio University, criticized the comparison between humans and AI.

Contrastive Conjunction: The phrase 'On the other hand' signals an opposing viewpoint, highlighting that another expert criticized the human-AI comparison.對比連詞: 這句使用『On the other hand』表示對立觀點，強調另一位教授的批評。