Sep 6, 2023
Google translate consistently omits sentences with special characters from Japanese-Korean results
When translating Japanese sentences into Korean, Google translator outright omits certain sentences. I've figured out that such sentences include certain special characters, usually ellipsis. For example:
「ごもっともです。……実は私が待っているのは、今日あたり江を下ってくると聞いている洛陽船でございます」
(From Sangukushi by Eiji Yoshikawa)
Regardless of the platform - Android translate app, web-based translator, Chrome translator function - the sentence after ellipsis is simply not displayed.
「정말입니다. …
Translation result with ellipsis.
「잘 됐습니다. 실은 제가 기다리고 있는 것은 오늘 당강을 내려오겠다고 듣고 있는 낙양선입니다」
Translation result after manual removal of ellipsis. The bold part is what was disappeared from the result with ellipses. The translator also somehow changed the sentence before the ellipsis - incorrectly - but that's not the issue I'm trying to tackle here.
On the other hand, translation to other languages such as English works just fine even with ellipses.
"That makes sense. ...Actually, what I'm waiting for is the Luoyang ship that I heard is coming down the river today."
The issue is not limited to ellipses, but related to other characters, too. Here's an example of translation that omits sentence after Asian quotation brackets:
少女は一本マッチを取り出して――「シュッ!」と、こすると、マッチがメラメラもえだしました!
(From The Little Match Girl by Hans Christian Andersen, translated by Okubo Yu)
소녀는 한 개 매치를 꺼내 ――「슈!
Translation result with Asian quotation brackets.
소녀는 한 개 매치를 꺼내 ――"슈!"라고, 문지르면, 매치가 멜라멜라도 냈습니다!
Translation result with brackets replaced with standard quotation marks. The bold part is what was disappeared from the result with brackets.
The girl took out a match and rubbed it, and the match started glowing!
English translation result. Somehow it omitted only the part with brackets.
This behavior is highly erratic and sometimes the translator fully translates a sentence with special characters just fine. I suspect that the system has fundamental issue with how it handles multiple Asian special characters. This isn't much of an issue when translating ordinary documents and pages which uses those characters sparingly, but a grievous problem when translating literary works as sentences or even paragraphs can go missing.
Community content may not be verified or up-to-date. Learn more.
All Replies (3)