Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
29 views6 pages

Chapter 1 Bigdata Introduction Questions Answers

The document contains multiple-choice, single-choice, and true/false questions related to big data concepts, technologies, challenges, and applications. It covers topics such as the 4 Vs of big data, computing modes, data types, and the role of various technologies in big data processing. Additionally, it addresses the implications of big data on industries and the importance of data management strategies.

Uploaded by

Mahmoud Ibrahim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views6 pages

Chapter 1 Bigdata Introduction Questions Answers

The document contains multiple-choice, single-choice, and true/false questions related to big data concepts, technologies, challenges, and applications. It covers topics such as the 4 Vs of big data, computing modes, data types, and the role of various technologies in big data processing. Additionally, it addresses the implications of big data on industries and the importance of data management strategies.

Uploaded by

Mahmoud Ibrahim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Multiple-Choice Questions (Select multiple correct answers):

1. Which of the following are considered "4 Vs" of big data?


○ A. Volume
○ B. Velocity
○ C. Veracity
○ D. Variety
○ E. Variability
2. What are the major computing modes of big data applications?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query and analysis computing
3. What are some challenges faced by enterprises in the big data era?
○ A. Data silos
○ B. Data security
○ C. Indefinite big data requirements
○ D. High processing power requirements
4. Which countries have implemented national big data strategies?
○ A. China
○ B. United States
○ C. United Kingdom
○ D. Australia
5. In the context of big data processing, which of the following are characteristics of
distributed databases?
○ A. Scalability
○ B. Fault tolerance
○ C. Low cost
○ D. Performance improvement
6. Which of the following technologies are used for big data stream computing?
○ A. Flink
○ B. Hadoop
○ C. Spark Streaming
○ D. Storm
7. Which of the following are advantages of Kunpeng Computing?
○ A. High computing efficiency
○ B. Open ecosystem
○ C. High scalability
○ D. High cost
8. Which of the following big data applications involve unstructured data?
○ A. Audio & video analysis
○ B. Financial transaction records
○ C. Social media posts
○ D. SQL databases
9. Which of the following are examples of I/O-intensive tasks?
○ A. Web applications
○ B. Image rendering
○ C. Network I/O
○ D. Log analysis
10. What are some opportunities in the big data era?
○ A. Data mining as a core business analysis tool
○ B. Growth in the information industry
○ C. Expensive processing infrastructure
○ D. Integration with next-gen technologies like IoT
11. Which fields are highly dependent on big data analysis?
○ A. Telecom
○ B. Healthcare
○ C. Retail
○ D. Government
12. What are some of the key features of cloud-based big data solutions?
○ A. Decoupled storage and computing
○ B. Single data source management
○ C. Ultimate scalability
○ D. Low data durability
13. Which are common data types encountered in the big data era?
○ A. Structured data
○ B. Semi-structured data
○ C. Unstructured data
○ D. Manual data
14. What are some major challenges in traditional data processing methods?
○ A. Costly storage
○ B. Limited scalability
○ C. Complexity in analyzing structured data
○ D. Lack of automation
15. Which of the following are examples of semi-structured data?
○ A. JSON files
○ B. SQL databases
○ C. Emails
○ D. XML documents

Single-Choice Questions (Select one correct answer):

1. Which V of big data refers to the speed at which data is generated and processed?
○ A. Volume
○ B. Variety
○ C. Velocity
○ D. Veracity
2. Which computing mode focuses on real-time data processing?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query computing
3. What is the primary function of MapReduce in big data applications?
○ A. Real-time data processing
○ B. Batch data processing
○ C. Graph data analysis
○ D. Query processing
4. What is the focus of big data technologies in modern business?
○ A. Data storage and transmission
○ B. Data mining and application
○ C. Data collection
○ D. Data security
5. What is the key challenge faced by enterprises when dealing with big data?
○ A. Lack of talent
○ B. Low cost of storage
○ C. Easy data availability
○ D. Lack of computing tools
6. What is the estimated scale of China's big data industry by 2025?
○ A. 1 trillion CNY
○ B. 3 trillion CNY
○ C. 5 trillion CNY
○ D. 2 trillion CNY
7. What is the core component of the Kunpeng Big Data Solution?
○ A. Data mining tools
○ B. High-performance computing power
○ C. Low-value density data
○ D. Relational databases
8. Which type of data processing involves relationships and connections in data?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query computing
9. What is the primary goal of CPU-intensive tasks in big data applications?
○ A. Maximize I/O operations
○ B. Efficient use of CPU resources
○ C. Minimize data storage
○ D. Reduce energy consumption
10. What is the main function of a distributed database?
○ A. Centralize all data in one server
○ B. Divide data across multiple servers for efficient access
○ C. Store data in a single location
○ D. Use structured data only
11. Which technology in big data is responsible for low-latency real-time analytics?
○ A. Spark Streaming
○ B. MapReduce
○ C. Flink
○ D. Hive
12. What does the "value" aspect of the 5Vs of big data focus on?
○ A. The usefulness of data
○ B. The speed of data generation
○ C. The accuracy of data
○ D. The size of the data
13. Which sector benefits from big data for user behavior analysis and targeted advertising?
○ A. Finance
○ B. Telecom
○ C. Retail
○ D. Manufacturing
14. Which data type is the hardest to process in traditional systems?
○ A. Structured data
○ B. Semi-structured data
○ C. Unstructured data
○ D. Quantitative data
15. Which application uses big data for better energy management and CO2 reduction?
○ A. Telecom networks
○ B. Clean energy systems
○ C. Retail supply chains
○ D. Financial institutions

True/False Questions:

1. The "veracity" of big data refers to the speed at which data is generated.
○ True / False
2. Stream computing handles data in real-time without waiting for large datasets to
accumulate.
○ True / False
3. Distributed databases store data in a single location to ensure quick access.
○ True / False
4. Big data applications are primarily used in industries such as healthcare, telecom, and
government.
○ True / False
5. MapReduce is used in graph computing for real-time analysis of network structures.
○ True / False
6. Semi-structured data combines elements of both structured and unstructured data, such
as JSON files.
○ True / False
7. Kunpeng computing focuses on developing customized processors for gaming
applications only.
○ True / False
8. Batch computing is ideal for real-time data processing and analytics.
○ True / False
9. The main challenge in the big data era is the lack of proper data storage solutions.
○ True / False
10. The Kunpeng Big Data Solution is a secure and scalable solution designed for intelligent
big data construction.
○ True / False
11. Flink and Storm are technologies used in stream computing for big data processing.
○ True / False
12. I/O-intensive tasks in big data computing primarily utilize CPU resources.
○ True / False
13. The arrival of the intelligent era is marked by the rise of technologies such as cloud, big
data, and AI.
○ True / False
14. The boom in big data is driven by the increasing amount of data generated by connected
devices and systems.
○ True / False
15. Data silos in organizations result in fragmented data that cannot be easily shared or
utilized.
○ True / False
True/False Questions:

1. The "veracity" of big data refers to the speed at which data is generated.
○ False (Veracity refers to the accuracy and trust in data.)
2. Stream computing handles data in real-time without waiting for large datasets to
accumulate.
○ True
3. Distributed databases store data in a single location to ensure quick access.
○ False (Distributed databases are spread across multiple locations.)
4. Big data applications are primarily used in industries such as healthcare, telecom,
and government.
○ True
5. MapReduce is used in graph computing for real-time analysis of network
structures.
○ False (MapReduce is used in batch processing.)
6. Semi-structured data combines elements of both structured and unstructured
data, such as JSON files.
○ True
7. Kunpeng computing focuses on developing customized processors for gaming
applications only.
○ False (Kunpeng computing is used for a variety of applications.)
8. Batch computing is ideal for real-time data processing and analytics.
○ False (Batch computing processes large volumes of data at scheduled intervals.)
9. The main challenge in the big data era is the lack of proper data storage solutions.
○ False (There are other challenges like data silos, security, and talent shortages.)
10. The Kunpeng Big Data Solution is a secure and scalable solution designed for
intelligent big data construction.
○ True
11. Flink and Storm are technologies used in stream computing for big data
processing.
○ True
12. I/O-intensive tasks in big data computing primarily utilize CPU resources.
○ False (I/O-intensive tasks involve more waiting for I/O operations than CPU
usage.)
13. The arrival of the intelligent era is marked by the rise of technologies such as
cloud, big data, and AI.
○ True
14. The boom in big data is driven by the increasing amount of data generated by
connected devices and systems.
○ True
15. Data silos in organizations result in fragmented data that cannot be easily shared
or utilized.
○ True

You might also like