ロボットは大学入学試験に合格できるのか？:Can a robot pass a university entrance exam?/新井紀子

Meet Todai Robot, an AI project that performed in the top 20 percent of students on the entrance exam for the University of Tokyo — without actually understanding a thing. While it’s not matriculating anytime soon, Todai Robot’s success raises alarming questions for the future of human education. How can we help kids learn the things that humans can do better than AI?

Todai Robot を紹介します。この AI プロジェクトは、実際には何も理解していなくても、東京大学の入学試験で上位 20% の成績を収めた学生です。

すぐに入学できるわけではありませんが、Todai Robot の成功は人間の教育の将来に憂慮すべき問題を引き起こしています。人間が AI よりも優れたことができることを子供たちが学ぶのをどのように支援できるでしょうか?

タイトル	Can a robot pass a university entrance exam?
アップロード	2017年9月14日
キャスト	新井紀子

ロボットは大学入学試験に合格できるのか？:Can a robot pass a university entrance exam?の要約
- 書き起こし

ロボットは大学入学試験に合格できるのか？:Can a robot pass a university entrance exam?の要約

AIと人間の関係

AI研究者たちは、AIが奪うのは単純労働だけで、人間が新しい仕事を見つけられると言ってきました。しかし、実際にどれだけの人が新しい仕事を見つけられるかが問題です。

東大ロボットプロジェクト

私は東京大学の入学試験に合格するAIを作る「東大ロボットプロジェクト」を始めました。このロボットはリモートサーバーに頭脳があり、歴史のエッセイを書くなどのタスクを行います。

AIの動作原理

現代のAIは、検索と最適化が得意で、事実ベースの質問に答える際に統計的な手法を用います。しかし、意味を理解しているわけではありません。例として、ワトソンや東大ロボットがどのように答えを見つけるかを説明しました。

東大入試とAIの限界

東大入試には、全国共通テストと大学独自の筆記試験があります。東大ロボットはエッセイを書いたり、数学の問題を解くことができますが、意味を理解していないために完全な合格はできません。

人間の理解力とAIの比較

AIが多くのタスクで人間を上回ることができますが、意味の理解という点では人間が優れています。しかし、多くの学生が知識を暗記するだけで、理解していない現状が問題です。

教育の必要性

AIが進化する中で、人間がどのように共存するかを考えるために、教育の改革が必要です。意味を理解する教育を強化しなければなりません。

書き起こし

Today, I’m going to talk about AI and us.

AI researchers have always said that we humans do not need to worry, because only menial jobs will be taken over by machines. Is that really true? They have also said that AI will create new jobs, so those who lose their jobs will find a new one. Of course. But the real question is: How many of those who may lose their jobs to AI will be able to land a new one, especially when AI is smart enough to learn better than most of us?

Let me ask you a question: How many of you think that AI will pass the entrance examination of a top university by 2020? Oh, so many. OK. So some of you may say, “Of course, yes!” Now singularity is the issue. And some others may say, “Maybe, because AI already won against a top Go player.” And others may say, “No, never. Uh-uh.” That means we do not know the answer yet, right?

So that was the reason why I started Todai Robot Project, making an AI which passes the entrance examination of the University of Tokyo, the top university in Japan. This is our Todai Robot. And, of course, the brain of the robot is working in the remote server. It is now writing a 600-word essay on maritime trade in the 17th century. How does that sound?

Why did I take the entrance exam as its benchmark? Because I thought we had to study the performance of AI in comparison to humans, especially on the skills and expertise which are believed to be acquired only by humans and only through education. To enter Todai, the University of Tokyo, you have to pass two different types of exams. The first one is a national standardized test in multiple-choice style. You have to take seven subjects and achieve a high score — I would say like an 85 percent or more accuracy rate — to be allowed to take the second stage written test prepared by Todai.

So let me first explain how modern AI works, taking the “Jeopardy!” challenge as an example. Here is a typical “Jeopardy!” question: “Mozart’s last symphony shares its name with this planet.” Interestingly, a “Jeopardy!” question always asks, always ends with “this” something: “this” planet, “this” country, “this” rock musician, and so on. In other words, “Jeopardy!” doesn’t ask many different types of questions, but a single type, which we call “factoid questions.”

By the way, do you know the answer? If you do not know the answer and if you want to know the answer, what would you do? You Google, right? Of course. Why not? But you have to pick appropriate keywords like “Mozart,” “last” and “symphony” to search. The machine basically does the same. Then this Wikipedia page will be ranked top. Then the machine reads the page. No, uh-uh. Unfortunately, none of the modern AIs, including Watson, Siri and Todai Robot, is able to read. But they are very good at searching and optimizing. It will recognize that the keywords “Mozart,” “last” and “symphony” are appearing heavily around here. So if it can find a word which is a planet and which is co-occurring with these keywords, that must be the answer. This is how Watson finds the answer “Jupiter,” in this case.

Our Todai Robot works similarly, but a bit smarter in answering history yes-no questions, like, “‘Charlemagne repelled the Magyars.’ Is this sentence true or false?” Our robot starts producing a factoid question, like: “Charlemagne repelled [this person type]” by itself. Then, “Avars” but not “Magyars” is ranked top. This sentence is likely to be false. Our robot does not read, does not understand, but it is statistically correct in many cases.

For the second stage written test, it is required to write a 600-word essay like this one: [Discuss the rise and fall of the maritime trade in East and Southeast Asia in the 17th century …] and as I have shown earlier, our robot took the sentences from the textbooks and Wikipedia, combined them together, and optimized it to produce an essay without understanding a thing. (Laughter) But surprisingly, it wrote a better essay than most of the students. (Laughter)

How about mathematics? A fully automatic math-solving machine has been a dream since the birth of the word “artificial intelligence,” but it has stayed at the level of arithmetic for a long, long time. Last year, we finally succeeded in developing a system which solved pre-university-level problems from end to end, like this one. This is the original problem written in Japanese, and we had to teach it 2,000 mathematical axioms and 8,000 Japanese words to make it accept the problems written in natural language. And it is now translating the original problems into machine-readable formulas. Weird, but it is now ready to solve it, I think. Go and solve it. Yes! It is now executing symbolic computation. Even more weird, but probably this is the most fun part for the machine. (Laughter) Now it outputs a perfect answer, though its proof is impossible to read, even for mathematicians. Anyway, last year our robot was among the top one percent in the second stage written exam in mathematics. (Applause) Thank you.

So, did it enter Todai? No, not as I expected. Why? Because it doesn’t understand any meaning. Let me show you a typical error it made in the English test. [Nate: We’re almost at the bookstore. Just a few more minutes. Sunil: Wait. ______ . Nate: Thank you! That always happens …] Two people are talking. For us, who can understand the situation — [1. “We walked for a long time.” 2. “We’re almost there.” 3. “Your shoes look expensive.” 4. “Your shoelace is untied.”] it is obvious number four is the correct answer, right? But Todai Robot chose number two, even after learning 15 billion English sentences using deep learning technologies.

OK, so now you might understand what I said: modern AIs do not read, do not understand. They only disguise as if they do. This is the distribution graph of half a million students who took the same exam as Todai Robot. Now our Todai Robot is among the top 20 percent, and it was capable to pass more than 60 percent of the universities in Japan — but not Todai. But see how it is beyond the volume zone of to-be white-collar workers.

You might think I was delighted. After all, my robot was surpassing students everywhere. Instead, I was alarmed. How on earth could this unintelligent machine outperform students — our children? Right? I decided to investigate what was going on in the human world. I took hundreds of sentences from high school textbooks and made easy multiple-choice quizzes, and asked thousands of high school students to answer. Here is an example: [Buddhism spread to … , Christianity to … and Oceania, and Islam to …] Of course, the original problems are written in Japanese, their mother tongue. [ ______ has spread to Oceania. 1. Hinduism 2. Christianity 3. Islam 4. Buddhism ] Obviously, Christianity is the answer, isn’t it? It’s written! And Todai Robot chose the correct answer, too. But one-third of junior high school students failed to answer this question.

Do you think it is only the case in Japan? I do not think so, because Japan is always ranked among the top in OECD PISA tests, measuring 15-year-old students’ performance in mathematics, science and reading every three years. We have been believing that everybody can learn and learn well, as long as we provide good learning materials free on the web so that they can access through the internet. But such wonderful materials may benefit only those who can read well, and the percentage of those who can read well may be much less than we expected.

How we humans will coexist with AI is something we have to think about carefully, based on solid evidence. At the same time, we have to think in a hurry because time is running out. Thank you. (Applause)

Chris Anderson: Noriko, thank you.

Noriko Arai: Thank you.

CA: In your talk, you so beautifully give us a sense of how AIs think, what they can do amazingly and what they can’t do. But — do I read you right, that you think we really need quite an urgent revolution in education to help kids do the things that humans can do better than AIs?

NA: Yes, yes, yes. Because we humans can understand the meaning. That is something which is very, very lacking in AI. But most of the students just pack the knowledge without understanding the meaning of the knowledge, so that is not knowledge, that is just memorizing, and AI can do the same thing. So we have to think about a new type of education.

CA: A shift from knowledge, rote knowledge, to meaning.

NA: Mm-hmm.

CA: Well, there’s a challenge for the educators. Thank you so much.

NA: Thank you very much. Thank you.

(Applause)