Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Big Data and Healthcare Data: A Survey

Bikash Kanti Sarkar

Source Title: International Journal of Knowledge-Based Organizations (IJKBO) 7(4)

DOI: 10.4018/IJKBO.2017100104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Big data and its analytics yield a lot of opportunities to make great progresses in many fields, ranging from economic and business activities to public administration, from national security to scientific researches and so on. However, the most noticeable point is that healthcare data has been recently identified as a prime example of big data. Undoubtedly, efficient use of healthcare resources has become a key factor in improving overall healthcare system. But for managing healthcare data and obtaining potential results, we need integration and sharing of data that ultimately demand the concept of distributed system. The paper in its first phase gives an overview on big data and healthcare data from different aspects. A review on the state-of-the-art distributed file system (Hadoop) is conducted in this stage too. The primary aim of this phase is to provide an overall picture on big data as well as healthcare data for non-expert readers. In the next phase, a cloud-based e-health system is proposed for the expert audiences. The expected promising characteristics as well as the managerial implications of the model are highlighted in the analysis section.

Article Preview

Top

Introduction

Truly speaking, the concept of ‘big data’ is not new; however, the way it is defined is constantly changing. In practice, a data set is called ‘big’ if it ranges from a few terabytes (1TB = 2⁴⁰ bytes) to many petabytes (1PB = 2⁵⁰ bytes) but the term ‘big data’ technically implies that the generation rate is unprecedented. According to Philip and Zhang (2014), the speed of data-growth has already exceeded Moore’s law. In this context, IBM has reported that 90% of the data created in the world has been produced in the last two years (IBM, 2011). More than 267 million transactions are produced per day over 6 thousand stores of Wal-Mart. Till 2011, almost 3 terabytes of data are collected by the US Library of Congress. According to Bakshi and Kapil (2012), the size of digital data in 2011 was roughly 1.8 zettabytes (1.8x10²¹ bytes) and the supporting network infrastructure has to manage 50 times more information by year 2020. Obviously, the real concern is storing and processing of data. In other words, data sets are growing so rapidly that storing and processing them using traditional data management tools or applications (e.g., DBMS) become very difficult. However, the notable matter is that huge amount of data with variation significantly assist the researchers to frame effective predictive models by finding useful patterns through qualitative data analysis. In fact, decisions that were previously based on guesswork or on painstakingly constructed models of reality, can now be made based on the available data itself. Certainly, we were lacking it earlier.

Big data phenomenon is an emerging area of research at the present date. The field indeed belongs to Data Science and Data Engineering. It is interesting to note that a study on the evolution of big data (a research and scientific topic) claims that the term was found in 1970s but it has been published in 2008 (Gali & Henk, 2012). For defining big data, there exist various forms of V’s such as 3V’s, 4V’s, 5 V’s, etc. For example, Laney (2011) used volume, velocity and variety, known as 3V’s to characterize the concept of big data, whereas people often extend another ‘V’ as per their special requirements. According to Zikopoulos and Chris (2011), the fourth ‘V’ can be value, variability or virtual. Thus, big data is conceptually described by the features, like volume, velocity, variety, etc. (i.e., in terms of V’s) and most of us agree that big data should have four standard characteristics (called as 4V’s namely volume, variety, velocity and veracity) as suggested by IBM (Kevin, 2013). More details on big data definition are unfolded in the immediate next section.

At the present date, the concept of big data yields a lot of progressive opportunities in many fields such as Information Technology, customer care service, online transaction, web-data applications, risk management, astronomy, healthcare system, etc.

In healthcare system, the information stored in health database has been increased over the last ten years, leading it to be considered ‘big data’. According to Raghupathi(2010), this industry has historically generated huge amount of data driven by record-keeping and patient-care. A number of surveys and journal articles describe that the massive quantity of health data holds the promise of supporting a wide range of medical and healthcare services, including clinical decision support, sensor based health condition and food safety monitoring, disease surveillance and population health management (Dembosky, 2012; Feldman, Martin, & Skotnes, 2012; Fernandes, O’Çonnor, & Weaver, 2012). For evidence, Cancer is still a major challenge in research and medical-world despite enormous advances in technology. The reason is, analysis of cancer requires petabytes of data to be gathered from various dispersed clinical and analytical databases with high dimensional scaling and interpretation to identify the state of the disease and the survival potential of the patient (Dutta & Bilbao, 2012). Further, the use of information technology on healthcare data can reduce the cost of healthcare while improving its quality by emphasizing more on preventive and personalized care and basing of continuous monitoring (CCC, 2011). In this context, an estimation of savings of $300 Billion every year in the US alone is reported in a study (James, Michael, Brad, Jacques, Richard, Charles, & Hung, 2012). According to the report prepared by IBM (2013), e-health system supported in U.S. reduces a big amount of healthcare spending and improves healthcare process to a great extent. In this regard, two evidences collected from this report are included in the Appendix.

Complete Article List

Search this Journal:

Reset

Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 13: 1 Issue (2023)

Volume 12: 4 Issues (2022): 3 Released, 1 Forthcoming

Volume 11: 4 Issues (2021)

Volume 10: 4 Issues (2020)

Volume 9: 4 Issues (2019)

Volume 8: 4 Issues (2018)

Volume 7: 4 Issues (2017)

Volume 6: 4 Issues (2016)

Volume 5: 4 Issues (2015)

Volume 4: 4 Issues (2014)

Volume 3: 4 Issues (2013)

Volume 2: 4 Issues (2012)

Volume 1: 4 Issues (2011)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Big Data and Healthcare Data: A Survey

Abstract

Introduction

Complete Article List