Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Big Data: Techniques, Tools, and Technologies – NoSQL Database

Vinod Kumar, Ramjeevan Singh Thakur

Source Title: Pattern and Data Analysis in Healthcare Settings

DOI: 10.4018/978-1-5225-0536-5.ch009

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

With every passing day, data generation is increasing exponentially, its volume, variety, velocity are making it quite challenging to analyze, interpret, visualize for gaining the greater insights from the available data. Billions of networked sensors are being embedded in devices such as smart phones, automobiles, social media sites, laptop, PC's and industrial machines etc. that operates, generate and communicate data. Thus, the data obtained from various resources exists in structured, semi-structured and unstructured form. The traditional database system is not suitable to handle these data formats. Therefore, new tools and techniques are developed to work with these data. NoSQL is one of them. Currently, many NoSQL database are available in the market, each one of them specially designed to solve specific type of data handling problems, most of the NoSQL databases are developed with special attention to problem of business organizations and enterprises. The chapter focuses various aspects of NoSQL as tool for handling the big data.

Chapter Preview

Top

Introduction

Big Data analysis involves making “sense” (Oracle White Paper, 2013) out of huge amount of varied data that are in its raw format. Today, technologies of storage devices have attained a great height in storage capacity. The organizations are capable to store the data of big data Category. But only storage of big data is not enough for any organization. The benefit lies in getting the valuable insights from the stored data to help in making planning, decisions and other organization’s strategies. The valuable information can only be obtained by analysis of big data (Katal A., et al, 2013). Moreover, the data comes from variety of resources such as smart phones, sensor embedded machines, social media sights, IT log Files, Web servers, email servers etc. Due to the large volume, Variety, Velocity, Variability and Complexity (Katal A., et al, 2013) of Big Data, the analysis task becomes very challenging and difficult. Thus, it requires advanced tools, techniques and specialized data analysis skills over the traditional tools and methods.

It has been obviously noticed that Relational Data Base Management Systems (RDBMS) can no longer handle all the data management issues posed by many currently running application software. It is mostly because of:

1.
The large, and constantly increasing, amount of data needed to be stored by many companies/enterprises/diverse organizations.
2.
The tremendous query workload needed to access and analyze these data.
3.
The need of flexibility in the database schema.

With the arrival of Web 2.0 service, the amount of data managed by large-scale web services has grown exponentially, posing new challenges and infrastructure requirements. This has led to new programming paradigms and architectural choices, such as map-reduce and NoSQL databases/data stores, which make up two of the key peculiarities of the specialized extremely, distributed systems well-known as Big Data architectures. The basic computer infrastructures generally encounter complexity requirements, resulting from the need for efficiency and speed in processing over vast surfacing data sets. This is made possible by taking benefits from the features of new technologies, such as the automatic scaling and replica provisioning of Cloud environments. Although performance is the main issue for the considered applications. Precisely, NoSQL (Olivier Cure, et al. 2011) databases are the next generation databases mostly dealt with some of the points: being non-relational, distributed, open-source and horizontally scalable.

Top

Hadoop Distributed File System (Hdfs)

•
Hadoop: Hadoop (White, 2010) is an Apache Open Source software framework for working with big data. It was created by Doug cutting and Michael J. Cafarella. Doug who was working at yahoo at that time, it named after his son’s toy elephant. It was derived from Google technology and put to practice by yahoo and others. But big data is too varied and complex for a one-size-fits-all solution. While Hadoop has surely captured the greatest name, recognition, it is just one of three classes of technologies well suited to storing and managing big data. Hadoop is software framework which means it includes a number of components that were specifically designed to solve large-scale distributed data storage, analysis and retrieval tasks.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Big Data: Techniques, Tools, and Technologies – NoSQL Database

Abstract

Introduction

Hadoop Distributed File System (Hdfs)

Complete Chapter List