Skip navigation.
Home
Vision-based System | Wearable Interface
Intelligent Glasses

Intelligent Glasses

We are developing a novel concept of real-time wearable translating glasses, named intelligent glasses, which can realize real-time multilingual translation. It is useful to a tourist who does not know the local language.

Let's imagine that a foreigner is using wearable translation robot during his foreign trip. The system can tell the wearer which building is a hotel, a restaurant, a bank, or a supermarket, which bus line should be selected, how to go to his destination according to the translation of road signs, which course is suitable for his taste based on the translation of menu, etc. All of these can make his journey much more delightful.

The system consists of a small head-mounted camera, a wearable computer, and a head-mounted display. These three components collect, translate, and present information about the texts. By introducing different Optical Character Recognition (OCR) systems and translation systems, we can realize the translation between different languages by means of flexible configuration.

In summary, the main contribution of this work is to create a novel wearable device using digital camera and wearable computer technologies for the enhancement of vision. This pair of intelligent glasses is an intelligent machine with features below:

(1) Translation: the glasses can translate the text in the camera’s field of view to different languages;

(2) Real-time: the process of text extraction, recognition, translation and displaying cost three seconds in general, and hence can perform in real-time;

(3) Wearable: this machine is light and easy to use and thus highly wearable. In this project, the most important and difficult problem is to detect and extract the texts from images.

We propose a new text detection method called character intrinsic characteristic-based (CIC-based) text detection algorithm. It performs well even in a complex environment. For example, it can extract a correct text string even though the digital camera gets a very vague image.

Key Investigators: Yangsheng Xu, Xi Shi
Related contents
  我们正在开发一副可以实时进行智能翻译的眼镜,这对一个不懂当地语言的旅游者来说,是非常有用的。让我们想像一个外国人正在他旅行期间使用可穿戴的智能翻译眼镜。眼镜能告诉他某建筑物是一间旅馆, 一家餐馆,一个银行,或是一家自选市场,能翻译公共汽车线路的情况,能帮助理解道路标志和饭店的菜单,这些将会使他的旅程变得更加愉快。
  系统由一个微型摄像头、一部可穿戴的计算机和一个头戴式显示器组成。它们采集图像信息,进行翻译并最后显示文字结果。借助不同的光学文字识别(OCR)系统和翻译系统,我们能灵活地对其进行配置,来完成对不同语言的翻译。
  这副聪明的眼镜具有下面的特征:
  (1)翻译:眼镜能翻译在摄像头的视野中出现的文字信息;
  (2)速度快:文本提取、识别、翻译和显示这一过程大约需要三秒;
  (3)可穿戴:机器很轻巧,可以方便地携带在身上。
  项目的难点在于从图像中准确地提取文字。一个新颖的以文字特征为基础的文字提取算法很好地解决了这个难题。即使摄像头获取了一个非常模糊的图像,也能准确地提取出正确的文字字串。