Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Feedback-Driven Refinement of Mandarin Speech Recognition Result based on Lattice Modification and Rescoring

Xiangdong Wang, Yang Yang, Hong Liu, Yueliang Qian, Duan Jia

Source Title: International Journal of Advanced Pervasive and Ubiquitous Computing (IJAPUC) 9(2)

DOI: 10.4018/IJAPUC.2017040104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In real world applications of speech recognition, recognition errors are inevitable, and manual correction is necessary. This paper presents an approach for the refinement of Mandarin speech recognition result by exploiting user feedback. An interface incorporating character-based candidate lists and feedback-driven updating of the candidate lists is introduced. For dynamic updating of candidate lists, a novel method based on lattice modification and rescoring is proposed. By adding words with similar pronunciations to the candidates next to the corrected character into the lattice and then performing rescoring on the modified lattice, the proposed method can improve the accuracy of the candidate lists even if the correct characters are not in the original lattice, with much lower computational cost than that of the speech re-recognition methods. Experimental results show that the proposed method can reduce 24.03% of user inputs and improve average candidate rank by 25.31%.

Article Preview

Top

1. Introduction

In recent years, considerable progress has been made in automatic speech recognition (ASR) technology, and applications such as speech assistants and speech input systems are becoming popular. However, in the state-of-art systems, recognition errors remain inevitable, due to environmental noise, accent, specific domain or topic, etc. In many cases, only a few errors can change the meaning of the sentence completely, which greatly affect the user's experience and the feasibility of the ASR technology.

To improve the feasibility of ASR systems, some researchers try to incorporate human-computer interaction technologies into ASR systems and allow the user to provide feedbacks (such as verification and correction) of the recognition results through human friendly interfaces. Many interaction methods for user feedback have been proposed, such as multi-modal interaction combining keyboard, re-speaking and handwriting (Oviatt, Cohen, et al., 2000) and candidate list (also known as alternative list) selection (Ogata and Goto, 2005; Nanjo, Akita and Kawahara, 2006; Cardinal, Boulianne, et al. 2007; Vertanen and Kristensson, 2011). In recent years, word candidate list has become the most popular interface for user feedback. In the interface, a candidate list is provided for each word in the recognition result, and when the 1-best result (namely, the top-1 candidate) is not correct, the error may be corrected by selecting candidate words in the candidate list. This correction method is user friendly and can greatly improve the efficiency of error correction.

For the generation of candidate lists, word confusion network (CN) [Xue and Zhao, 2005] extracted from the N-best lattice is widely used for languages such as English [Vertanen and Kristensson, 2011] and Japanese [Ogata and Goto, 2005]. For an utterance, a sequence of candidate lists can be obtained directly from the CN, with each candidate list providing alternative words (if any) besides the top-1 word. However, for the Chinese language, the word CN is not the best choice. In Chinese, words are formed by characters and most characters can be words by themselves while they are also included in multi-character words. Therefore, in candidate lists obtained from the word CN, a character may be repeatedly included in different candidate words in a candidate list or even in different candidate lists. This makes the candidate lists redundant and sometimes confusing to the user. To solve this problem, in our earlier work, candidate lists based on Chinese characters is introduced and a method for generation of the candidate lists is proposed [Li, Wang, et al., 2009]. In the candidate lists generated, each candidate is a Chinese character, and characters competing for each other is organized in one candidate list. This makes the interface present more information with limited candidates and be much friendlier to Chinese users.

Complete Article List

Search this Journal:

Reset

Open Access Articles: Forthcoming

Volume 11: 4 Issues (2019)

Volume 10: 4 Issues (2018)

Volume 9: 4 Issues (2017)

Volume 8: 4 Issues (2016)

Volume 7: 4 Issues (2015)

Volume 6: 4 Issues (2014)

Volume 5: 4 Issues (2013)

Volume 4: 4 Issues (2012)

Volume 3: 4 Issues (2011)

Volume 2: 4 Issues (2010)

Volume 1: 4 Issues (2009)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Feedback-Driven Refinement of Mandarin Speech Recognition Result based on Lattice Modification and Rescoring

Abstract

1. Introduction

Complete Article List