Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
11 views4 pages

Class 12 Introduction To Big Data Analytics

Uploaded by

LEO MERCY
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views4 pages

Class 12 Introduction To Big Data Analytics

Uploaded by

LEO MERCY
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

CLASS XII ARTIFICIAL INTELLIGENCE (843)

Introduction to Big Data and Data Analytics


Q. 1. What is Big Data, and how does it differ from small data?
Ans. Big Data refers to vast, complex datasets that traditional database systems cannot handle due to their size,
speed, or structure. Unlike small data, which is manageable and easily understood, Big Data requires
specialized tools and techniques for analysis. It often includes transactional data, machine-generated data, and
social media data, making it harder to analyze using conventional methods. The large scale and complexity of
Big Data make it a significant resource for businesses and researchers looking to extract valuable insights.

Q. 2. What are the main types of Big Data, and how do they differ from each other?
Ans. The main types of Big Data are:
a. Structured Data b. Semi Structured Data c. Unstructured Data
On the basis of different criterias difference between these types of data is as follows -

Q. 3. What Describe the advantages and disadvantages of using Big Data.


Ans. Advantages:
 Enhanced decision-making: Big Data enables organizations to make informed, data-driven decisions
based on vast datasets.
 Improved efficiency: By analyzing large volumes of data, businesses can optimize operations and reduce
inefficiencies.
 Better customer insights: Big Data allows for the personalization of services and targeted marketing.
 Competitive advantage: Organizations can uncover trends and predict future outcomes.
Disadvantages:
 Privacy and security concerns: The collection and analysis of personal data raise ethical issues and data
protection risks.
 Data quality issues: Ensuring the accuracy and consistency of Big Data is challenging, as it often includes
unstructured and heterogeneous data.
 Technical complexity: Big Data requires specialized infrastructure, tools, and expertise, which can be
costly and resource-intensive.
 Compliance challenges: Companies must adhere to regulations such as GDPR, which can be complex and
costly to implement.

Q.4. What are the 6V’s of Big Data, and how do they provide a more holistic view of Big Data?
Ans. The “characteristics of Big Data” refer to the defining attributes that distinguish large and complex
datasets from traditional data sources. These characteristics are commonly described using the “3Vs”
framework: Volume, Velocity, and Variety. The 6Vs framework provides a holistic view of Big Data,
emphasizing not only its volume, velocity, and variety but also its veracity, variability, and value.
 Volume: The sheer amount of data generated every day.
 Velocity: The speed at which data is produced and needs to be processed.
 Variety: The different types of data (structured, semi-structured, unstructured).
 Veracity: The accuracy and trustworthiness of the data, ensuring its suitability for analysis.
 Value: The insights and benefits derived from analyzing the data.
 Variability: The inconsistencies or unpredictability in data flows, requiring systems to adapt.
Together, these dimensions highlight the complexity of managing and analyzing Big Data effectively.

Q.5. What do you understand by Big Data Analytics? How does it emerge?
Ans. Data analytics involves analysing datasets to uncover insights, trends, and patterns. Technologies
commonly used in data analytics include statistical analysis software, data visualisation tools, and relational
database management systems (RDBMS). Big data analytics uses advanced analytic techniques against huge,
diverse datasets that include structured, semi-structured, and unstructured data from different sources and in
various sizes from terabytes to zettabytes.
Big Data Analytics emerges as a consequence of four significant global trends:
1. Moore’s Law: Moore’s Law has enabled the handling and analysis of massive datasets, driving the
evolution of Big Data Analytics.
2. Mobile Computing: Due to smartphones and mobile devices, the vast amount of real-time data can be
collected from anywhere.
3. Social Networking: Social media platforms use data sharing, interactions, and user-generated content
for collecting massive datasets, which helps to analyse the data easily.
4. Cloud Computing: Cloud computing allows organisations to access hardware and software resources
remotely via the Internet, which reduces the investments in software and hardware.

Q.6. How does Big Data Analytics work, and what are its key components?
Ans. Big Data Analytics involves collecting, storing, cleaning, processing, and analyzing large datasets to
uncover insights and trends. Key components include:
 Data collection: Gathering data from various sources (social media, sensors, transactions).
 Data storage: Using distributed storage solutions like Hadoop or cloud-based storage systems.
 Data processing: Cleaning and organizing data to make it ready for analysis.
 Data analysis: Applying techniques such as machine learning, predictive analytics, and statistical modeling
to extract insights.
 Data visualization: Presenting the findings in a visual format, such as graphs and charts, to help
stakeholders make informed decisions.

Q.7. What is Data Stream Mining, and how is it used?


Ans. A data stream is a continuous, real-time flow of data generated by various sources like sensors, satellite
images, the internet, and web traffic, etc. Mining data streams refers to the process of extracting meaningful
patterns, trends, and knowledge from a continuous flow of real-time data. For instance, a sudden spike in
searches for “election results” on a particular day might indicate that elections were recently held in a region or
highlight the level of public interest in the results.

Q.8. What are the future trends in Big Data Analytics?


Ans. Future trends include:
 Real-time analytics: With the increasing speed of data generation, real-time analytics will allow businesses
to make immediate decisions based on live data.
 AI and Machine Learning integration: These technologies will continue to enhance predictive analytics,
enabling more accurate forecasts.
 Quantum computing: This promises to accelerate data processing and enable the analysis of even larger
datasets.
 Data privacy regulations: As Big Data usage grows, more stringent regulations will be implemented to
protect user privacy and ensure ethical data practices.

Q.9. What tools are commonly used in Big Data analytics, and how do they help?
Ans. Common tools include:
 Hadoop: A framework for storing and processing large datasets using a distributed file system.
 Tableau: A data visualization tool that helps present Big Data insights through graphs and charts.
 R and Python: Programming languages widely used for statistical analysis and machine learning in Big
Data.
 Spark: A data processing engine that supports real-time analytics and machine learning.
These tools enable businesses to handle, process, analyze, and visualize Big Data effectively.

Q.10. What are some common types of Big Data Analytics?


Ans. The common types of Big Data Analytics include:
 Descriptive Analytics: Summarizes past data to identify patterns and trends.
 Diagnostic Analytics: Analyzes historical data to understand the causes behind specific outcomes.
 Predictive Analytics: Uses historical data to forecast future trends or events.
 Prescriptive Analytics: Recommends actions based on data insights to achieve desired outcomes.
What are the ethical concerns associated with Big Data?
Answer: Ethical concerns around Big Data include:
 Privacy issues: Collecting personal data raises concerns about how that data is used and whether it is adequately protected.
 Data misuse: There is a risk of using data for purposes other than what it was intended for, such as targeting vulnerable
populations for marketing or surveillance.
 Bias and discrimination: Algorithms based on biased data may result in discriminatory practices, such as denying certain groups
access to services or opportunities.

How does “Batch Processing” differ from “Stream Processing” in Big Data analytics?
Answer:
 Batch Processing involves collecting large amounts of data and processing them in blocks or batches over time. This method is
more suitable for less time-sensitive tasks like analyzing historical data.
 Stream Processing, on the other hand, processes data in real-time or near-real-time as it is generated. This is used in scenarios
where immediate analysis is required, such as monitoring live data streams from sensors or social media feeds.

What role do Machine Learning algorithms play in Big Data analytics?


Answer: Machine Learning algorithms are essential for analyzing large datasets by automatically detecting patterns and making
predictions. In Big Data analytics, they can be used for classification, clustering, regression, and anomaly detection. These algorithms
enable businesses to forecast trends, personalize customer experiences, and detect fraud, among other tasks. They can also learn
from new data over time, improving accuracy and efficiency.

What is the significance of “Cloud Computing” in Big Data Analytics?


Answer: Cloud computing provides scalable and cost-effective infrastructure for storing and processing Big Data. With cloud services,
organizations can access powerful computing resources on-demand, without the need for large upfront investments in hardware. This
allows businesses to analyze vast datasets quickly and efficiently, while also enabling collaboration across multiple locations.
Additionally, cloud-based tools offer flexibility, security, and reliability for Big Data analytics.

How can Big Data be used in the healthcare industry?


Answer: Big Data analytics in healthcare can be used to predict disease outbreaks, personalize patient care, and improve medical
research. For example, predictive analytics can help hospitals forecast patient admissions and optimize resource allocation. By
analyzing patient data, doctors can offer personalized treatments, improving outcomes. Big Data can also help detect fraud and
improve drug development by identifying trends in clinical trial data.

What challenges are associated with the “Veracity” of Big Data?


Answer: Veracity in Big Data refers to the trustworthiness, quality, and accuracy of the data. Challenges include:
 Data inconsistencies: Large volumes of data may contain errors, duplications, or missing values, making it difficult to trust the
insights derived from them.
 Data bias: Inaccurate or biased data sources can lead to misleading conclusions.
 Data cleaning issues: Ensuring that data is properly cleaned and formatted is a time-consuming process, especially when dealing
with unstructured data.

You might also like