Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Modified K-Nearest Neighbour Using Proposed Similarity Fuzzy Measure for Missing Data Imputation on Medical Datasets (MKNNMBI)

B. Mathura Bai, Mangathayaru N., Padmaja Rani B.

Source Title: International Journal of Fuzzy System Applications (IJFSA) 11(3)

DOI: 10.4018/IJFSA.306278

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Early disease diagnosis is a burning problem in health sector, medical domain and disease management. During analysis, quality of the data can be achieved only if the data is complete. Missing values reduces the efficiency of data analysis task. Researchers proposed various imputation methods but always there was a need for a better imputation method. This paper objective is to propose a method for imputation using proposed similarity fuzzy measure through which we can impute missing values by finding k similar instances called as Modified k-Nearest Neighbour for imputation of missing data (MKNNMBI). The proposed imputation method outperformed when compared with other existing imputation methods MV EM, MV BPCA, MV Ignore, MV KMeans, MV FKMeans, MV KNN, MV MC, MV WKNNimpute, MV SVDimpute, MV SVMimpute, CBC-IM-FUZZY. These imputation methods were studied on different benchmark datasets and tested for performance on different classifiers like C4.5, SVM, kNN, NB and found that the proposed method leads to accurate imputation and improves the accuracy.

Article Preview

Top

1. Introduction

Decision making must be accurate especially in medical and health sector. A critical decision-making system needs complete information otherwise degrades if the information goes missing by misinterpreting the decisions. To handle missing values, (Khan et al., 2013) proposed a medical decision system. Now-a-days in all fields, sophisticated applications have widely been used which collects huge quantity of data on daily-basis. Storage, analysis, mining such big datum needs computational intelligence techniques and data science analysis tools. The author in (Fernandez-Delgado et al., 2014) has done an exhaustive experiment on different datasets with various classifiers using many data analysis tools like R, Weka, C and Matlab. The performance of such data analysis tools is affected due to various issues. More attention is needed for handling such challenges by the data analysts for better analysis. The commonly occurring challenges or issues during data analysis and machine learning tasks are clearly explored in (Zhang et al., 2003) (Bai et al., 2015) (Zhu & Li, 2016) (Li & Ren, 2015). One such most important challenge is missing values. Missing values pose a hidden and unpredictable challenge which needs to be addressed. Missing values (Allison, 2001) are inevitable in real world data collected from different application domains. These applications use data mining, machine learning techniques to either impute or ignore such values. The possible reasons for missing values can be because of faulty devices, man-made mistakes, inaccurate or inconsistent entries, inadequate measurements, unanswered sensitive queries during survey etc. The existence of such missing values results in biased decisions affecting the accuracy of prediction. Incorrect data analysis or decisions may have severe consequences in medical domains, health sector (Gomila & Clark, 2020) (Stiglic et al., 2019), various financial applications etc. The possible ways to solve missing value could either ignore instances having missing data. Another replace the missing data with the approximate data called as parameter estimation so that correct decision making can be done. Ignoring the instances with missing values is an often-used simple method but it reduces the data thus affecting the learning process. Missing values-Ignore method affects the performance of the prediction model and leads to inaccurate decisions (Stiglic et al., 2019).The parameter estimation method called as model based imputation methods like EM algorithm is sensitive to outliers. The best alternative method would be to impute the missing values using Machine Learning (ML) based method (Lakshminarayan et al., 1999) (Little & Rubin, 2019). Such a process is treated as a data cleaning task in data analysis and machine learning during pre-processing phase. The process of inferring the missing value based on the existing data is called as missing data imputation (Myrtveit et al., 2001). Most of the data mining and machine learning algorithms need a complete dataset for knowledge extraction, pattern recognition and decision making. Researchers have proposed various missing data imputation methods for data analysis tasks like classification, regression, clustering etc.

Complete Article List

Search this Journal:

Reset

Volume 13: 1 Issue (2024)

Volume 12: 1 Issue (2023)

Volume 11: 4 Issues (2022)

Volume 10: 4 Issues (2021)

Volume 9: 4 Issues (2020)

Volume 8: 4 Issues (2019)

Volume 7: 4 Issues (2018)

Volume 6: 4 Issues (2017)

Volume 5: 4 Issues (2016)

Volume 4: 4 Issues (2015)

Volume 3: 4 Issues (2013)

Volume 2: 4 Issues (2012)

Volume 1: 4 Issues (2011)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Modified K-Nearest Neighbour Using Proposed Similarity Fuzzy Measure for Missing Data Imputation on Medical Datasets (MKNNMBI)

Abstract

1. Introduction

Complete Article List