创新实践:深入解析 Genius Scan

When you use your smartphone to photograph a landscape, the device’s orientation is detected, and the final picture is well-oriented. In other words, regardless of how you hold your phone, whether upside down or sideways, causing the camera sensor to be oriented differently, the resulting photo will automatically adjust to ensure that the sky is at the top.
当您使用智能手机拍摄风景时,设备会检测其方向,最终的照片也会正确定向。换句话说,无论您如何握持手机,无论是倒置还是侧握,导致相机传感器方向不同,生成的照片都会自动调整,以确保天空在顶部。
But, when you use your smartphone to scan a document, it’s often laid flat, and you position your device above it. With your device parallel to the ground, the sensors can’t detect the device’s rotation when you turn your smartphone from portrait to landscape. This results in your scanned document appearing sideways or upside down. Historically, you’d need to rotate the image manually, a typical process with our competitors and even standard photo apps.
但是,当您使用智能手机扫描文档时,文档通常是平放的,而您将设备置于其上方。当设备与地面平行时,传感器无法在您将智能手机从竖屏切换到横屏时检测到设备的旋转。这会导致扫描出的文档出现侧向或倒置的情况。过去,您需要手动旋转图像,这是我们的竞争对手甚至标准照片应用中常见的处理流程。
Recognizing the frustration caused by this common scanning hiccup, we embarked on a mission to simplify this process for our users. Our goal was clear: develop an algorithm that could automatically determine the correct orientation of a document solely from the image’s content, eliminating the need for manual adjustments without slowing down the app with new algorithms, as everything we create is embedded into Genius Scan and doesn’t rely on external servers for privacy reasons.
认识到这种常见的扫描小故障带来的挫败感,我们开始了一项为用户简化此流程的任务。我们的目标很明确:开发一种算法,仅从图像内容就能自动确定文档的正确方向,从而消除手动调整的需要,同时不会因为新算法而拖慢应用速度,因为我们创建的所有内容都嵌入到Genius Scan中,并且出于隐私原因不依赖外部服务器。
The Quest for a Solution
寻找解决方案
As we delved into the challenge, we first explored existing options but found them lacking for our needs. Some solutions relied on text recognition to determine the document’s orientation, but this method wasn’t foolproof. It didn’t work well with all document types, especially those without printed text, and was too slow to be us...