A Non-Document Text and Display Reader for Visually Impaired Persons
适合视障人士的非文档文本和显示阅读器
基本信息
- 批准号:7446299
- 负责人:
- 金额:$ 42.18万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2008
- 资助国家:美国
- 起止时间:2008-04-01 至 2011-03-31
- 项目状态:已结题
- 来源:
- 关键词:Access to InformationAcousticsAddressAlgorithmsAmericanApplications GrantsAuditoryBlindnessCategoriesCellular PhoneCodeComputer Vision SystemsComputer softwareCosmeticsCountCustomDailyDataDatabasesDevelopmentDevicesEconomicsElectronicsEmploymentEvaluationFeasibility StudiesFigs - dietaryFundingGoalsHandHome environmentHumanImageImage AnalysisImpairmentLavandulaMainstreamingMarketingMemoryModificationPaperPrintingProcessPublic HealthRangeReaderReadingResearchResolutionRunningSchoolsSelf-Help DevicesSpeechStandards of Weights and MeasuresSurveysSystemTelephoneTestingTextTrainingUniversitiesVisionVisitVisualVisual impairmentVisually Impaired PersonsVoiceWorkplacebaseblindconsumer productcostdesigndesireimage processingmicrowave electromagnetic radiationoptical character recognitionpreventprogramsrehabilitation technologyresponsetime intervaltooltraffickingvisual information
项目摘要
DESCRIPTION (provided by applicant): The goal of this project is to develop a computer vision system based on standard camera cell phones to give blind and visually impaired persons the ability to read appliance displays and similar forms of non-document visual information. This ability is increasingly necessary to use everyday appliances such as microwave ovens and DVD players, and to perform many daily activities such as counting paper money. No access to this information is currently afforded by conventional text reading systems such as optical character recognition (OCR), which is intended for reading printed documents. Our proposed software runs on a standard, off-the- shelf camera phone and uses computer vision algorithms to analyze images taken by the user, to detect and read the text within each image, and to then read it aloud using synthesized speech. Preliminary feasibility studies indicate that current cellular phones easily exceed the minimum processing power required for these tasks. Initially, the software will read out three categories of symbols: LED/LCD appliance displays, product or user-defined barcodes, and denominations of paper money. Ultimately these functions will be integrated with other capabilities being developed under separate funding, such as reading a broad range of printed text (including signs), recognizing objects, and analyzing photographs and graphics, etc., all available as free or low-cost software downloads for any cell phone user. Our specific goals are to (1) gather a database of real images taken by blind and visually impaired persons of a variety of LED/LCD appliance displays, barcodes and US paper currency; (2) develop algorithms to process the images and extract the desired information; (3) implement the algorithms on a camera phone; and (4) conduct user testing to establish design parameters and optimize the human interface. PUBLIC HEALTH RELEVANCE: For blind and visually impaired persons, one of the most serious barriers to employment, economic self sufficiency and independence is insufficient access to the ever-increasing variety of devices and appliances in the home, workplace, school or university that incorporate visual LED/LCD displays, and to other types of text and symbolic information hitherto unaddressed by rehabilitation technology. The proposed research would result in an assistive technology system (with zero or minimal cost to users) to provide increased access to such display and non- document text information for the approximately 10 million Americans with significant vision impairments or blindness.
描述(由申请人提供):该项目的目标是开发一种基于标准拍照手机的计算机视觉系统,使盲人和视障人士能够阅读设备显示屏和类似形式的非文档视觉信息。这种能力对于使用微波炉和 DVD 播放器等日常用品以及执行许多日常活动(例如数纸币)越来越有必要。目前,传统的文本阅读系统(例如用于阅读印刷文档的光学字符识别 (OCR))无法提供对此信息的访问。我们提出的软件在标准的现成拍照手机上运行,并使用计算机视觉算法来分析用户拍摄的图像,检测和读取每个图像中的文本,然后使用合成语音大声朗读。初步可行性研究表明,当前的蜂窝电话很容易超出这些任务所需的最低处理能力。最初,该软件将读出三类符号:LED/LCD 设备显示屏、产品或用户定义的条形码以及纸币面额。最终,这些功能将与单独资助下开发的其他功能集成,例如阅读各种印刷文本(包括标志)、识别物体以及分析照片和图形等,所有这些功能都可以作为免费或低成本软件提供任何手机用户均可下载。我们的具体目标是 (1) 收集盲人和视障人士拍摄的各种 LED/LCD 电器显示器、条形码和美国纸币的真实图像数据库; (2)开发算法来处理图像并提取所需信息; (3) 在拍照手机上实现算法; (4) 进行用户测试以建立设计参数并优化人机界面。公共卫生相关性:对于盲人和视障人士来说,就业、经济自给自足和独立的最严重障碍之一是无法充分获得家庭、工作场所、学校或大学中不断增加的各种包含视觉功能的设备和电器。 LED/LCD 显示器,以及迄今为止康复技术尚未解决的其他类型的文本和符号信息。拟议的研究将产生一个辅助技术系统(用户成本为零或最低),为大约 1000 万患有严重视力障碍或失明的美国人提供更多对此类显示和非文档文本信息的访问。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
JAMES M COUGHLAN其他文献
JAMES M COUGHLAN的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('JAMES M COUGHLAN', 18)}}的其他基金
Leveraging Maps and Computer Vision to Support Indoor Navigation for Blind Travelers
利用地图和计算机视觉支持盲人旅行者的室内导航
- 批准号:
9934891 - 财政年份:2019
- 资助金额:
$ 42.18万 - 项目类别:
Leveraging Maps and Computer Vision to Support Indoor Navigation for Blind Travelers
利用地图和计算机视觉支持盲人旅行者的室内导航
- 批准号:
10220178 - 财政年份:2018
- 资助金额:
$ 42.18万 - 项目类别:
Leveraging Maps and Computer Vision to Support Indoor Navigation for Blind Travelers
利用地图和计算机视觉支持盲人旅行者的室内导航
- 批准号:
9899994 - 财政年份:2018
- 资助金额:
$ 42.18万 - 项目类别:
Enabling Audio-Haptic Interaction with Physical Objects for the Visually Impaired
为视障人士提供与物理对象的音频触觉交互
- 批准号:
9238777 - 财政年份:2016
- 资助金额:
$ 42.18万 - 项目类别:
Point and Listen: Augmented Reality Interfaces for the Visually Impaired
指向并聆听:为视障人士提供的增强现实界面
- 批准号:
10540115 - 财政年份:2016
- 资助金额:
$ 42.18万 - 项目类别:
Point and Listen: Augmented Reality Interfaces for the Visually Impaired
指向并聆听:为视障人士提供的增强现实界面
- 批准号:
10839155 - 财政年份:2016
- 资助金额:
$ 42.18万 - 项目类别:
Video-based Speech Enhancement for Vision and Hearing Impairment
针对视力和听力障碍的基于视频的语音增强
- 批准号:
8659442 - 财政年份:2013
- 资助金额:
$ 42.18万 - 项目类别:
A Cell Phone-based Sign Reader for Blind & Visually Impaired Persons
基于手机的盲人标志阅读器
- 批准号:
7373002 - 财政年份:2009
- 资助金额:
$ 42.18万 - 项目类别:
A Cell Phone-based Sign Reader for Blind & Visually Impaired Persons
基于手机的盲人标志阅读器
- 批准号:
7911722 - 财政年份:2009
- 资助金额:
$ 42.18万 - 项目类别:
Providing Access to Appliance Displays for Visually Impaired Users
为视障用户提供对设备显示屏的访问
- 批准号:
8916115 - 财政年份:2008
- 资助金额:
$ 42.18万 - 项目类别:
相似国自然基金
鼓泡床密相区温度、颗粒浓度与气泡分布的二维同步声学双参数成像
- 批准号:62301355
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
声学拓扑安德森绝缘体拓扑特性研究
- 批准号:12304486
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
轨道模式依赖的声学拓扑态及其应用研究
- 批准号:12304492
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于深度学习的右心声学造影PFO-RLS和P-RLS智能诊断模型的构建
- 批准号:82302198
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
声学和弹性分层介质反散射问题的理论与数值算法
- 批准号:12371422
- 批准年份:2023
- 资助金额:43.5 万元
- 项目类别:面上项目
相似海外基金
Functional and structural characterization of human auditory cortex using high resolution MRI
使用高分辨率 MRI 表征人类听觉皮层的功能和结构
- 批准号:
10728782 - 财政年份:2023
- 资助金额:
$ 42.18万 - 项目类别:
Randomized controlled trial of a novel digital health solution to enable remote fetal monitoring in high risk pregnancies
新型数字健康解决方案的随机对照试验,可在高风险妊娠中实现远程胎儿监测
- 批准号:
10490447 - 财政年份:2021
- 资助金额:
$ 42.18万 - 项目类别:
Randomized controlled trial of a novel digital health solution to enable remote fetal monitoring in high risk pregnancies
新型数字健康解决方案的随机对照试验,可在高风险妊娠中实现远程胎儿监测
- 批准号:
10242470 - 财政年份:2021
- 资助金额:
$ 42.18万 - 项目类别:
Randomized controlled trial of a novel digital health solution to enable remote fetal monitoring in high risk pregnancies
新型数字健康解决方案的随机对照试验,可在高风险妊娠中实现远程胎儿监测
- 批准号:
10682538 - 财政年份:2021
- 资助金额:
$ 42.18万 - 项目类别:
A Non-Document Text and Display Reader for Visually Impaired Persons
适合视障人士的非文档文本和显示阅读器
- 批准号:
7799708 - 财政年份:2008
- 资助金额:
$ 42.18万 - 项目类别: