Multiple-Choice Questions (Select multiple correct answers):
1. Which of the following are considered "4 Vs" of big data?
○ A. Volume
○ B. Velocity
○ C. Veracity
○ D. Variety
○ E. Variability
2. What are the major computing modes of big data applications?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query and analysis computing
3. What are some challenges faced by enterprises in the big data era?
○ A. Data silos
○ B. Data security
○ C. Indefinite big data requirements
○ D. High processing power requirements
4. Which countries have implemented national big data strategies?
○ A. China
○ B. United States
○ C. United Kingdom
○ D. Australia
5. In the context of big data processing, which of the following are characteristics of
distributed databases?
○ A. Scalability
○ B. Fault tolerance
○ C. Low cost
○ D. Performance improvement
6. Which of the following technologies are used for big data stream computing?
○ A. Flink
○ B. Hadoop
○ C. Spark Streaming
○ D. Storm
7. Which of the following are advantages of Kunpeng Computing?
○ A. High computing efficiency
○ B. Open ecosystem
○ C. High scalability
○ D. High cost
8. Which of the following big data applications involve unstructured data?
○ A. Audio & video analysis
○ B. Financial transaction records
○ C. Social media posts
○ D. SQL databases
9. Which of the following are examples of I/O-intensive tasks?
○ A. Web applications
○ B. Image rendering
○ C. Network I/O
○ D. Log analysis
10. What are some opportunities in the big data era?
○ A. Data mining as a core business analysis tool
○ B. Growth in the information industry
○ C. Expensive processing infrastructure
○ D. Integration with next-gen technologies like IoT
11. Which fields are highly dependent on big data analysis?
○ A. Telecom
○ B. Healthcare
○ C. Retail
○ D. Government
12. What are some of the key features of cloud-based big data solutions?
○ A. Decoupled storage and computing
○ B. Single data source management
○ C. Ultimate scalability
○ D. Low data durability
13. Which are common data types encountered in the big data era?
○ A. Structured data
○ B. Semi-structured data
○ C. Unstructured data
○ D. Manual data
14. What are some major challenges in traditional data processing methods?
○ A. Costly storage
○ B. Limited scalability
○ C. Complexity in analyzing structured data
○ D. Lack of automation
15. Which of the following are examples of semi-structured data?
○ A. JSON files
○ B. SQL databases
○ C. Emails
○ D. XML documents
Single-Choice Questions (Select one correct answer):
1. Which V of big data refers to the speed at which data is generated and processed?
○ A. Volume
○ B. Variety
○ C. Velocity
○ D. Veracity
2. Which computing mode focuses on real-time data processing?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query computing
3. What is the primary function of MapReduce in big data applications?
○ A. Real-time data processing
○ B. Batch data processing
○ C. Graph data analysis
○ D. Query processing
4. What is the focus of big data technologies in modern business?
○ A. Data storage and transmission
○ B. Data mining and application
○ C. Data collection
○ D. Data security
5. What is the key challenge faced by enterprises when dealing with big data?
○ A. Lack of talent
○ B. Low cost of storage
○ C. Easy data availability
○ D. Lack of computing tools
6. What is the estimated scale of China's big data industry by 2025?
○ A. 1 trillion CNY
○ B. 3 trillion CNY
○ C. 5 trillion CNY
○ D. 2 trillion CNY
7. What is the core component of the Kunpeng Big Data Solution?
○ A. Data mining tools
○ B. High-performance computing power
○ C. Low-value density data
○ D. Relational databases
8. Which type of data processing involves relationships and connections in data?
○ A. Batch computing
○ B. Stream computing
○ C. Graph computing
○ D. Query computing
9. What is the primary goal of CPU-intensive tasks in big data applications?
○ A. Maximize I/O operations
○ B. Efficient use of CPU resources
○ C. Minimize data storage
○ D. Reduce energy consumption
10. What is the main function of a distributed database?
○ A. Centralize all data in one server
○ B. Divide data across multiple servers for efficient access
○ C. Store data in a single location
○ D. Use structured data only
11. Which technology in big data is responsible for low-latency real-time analytics?
○ A. Spark Streaming
○ B. MapReduce
○ C. Flink
○ D. Hive
12. What does the "value" aspect of the 5Vs of big data focus on?
○ A. The usefulness of data
○ B. The speed of data generation
○ C. The accuracy of data
○ D. The size of the data
13. Which sector benefits from big data for user behavior analysis and targeted advertising?
○ A. Finance
○ B. Telecom
○ C. Retail
○ D. Manufacturing
14. Which data type is the hardest to process in traditional systems?
○ A. Structured data
○ B. Semi-structured data
○ C. Unstructured data
○ D. Quantitative data
15. Which application uses big data for better energy management and CO2 reduction?
○ A. Telecom networks
○ B. Clean energy systems
○ C. Retail supply chains
○ D. Financial institutions
True/False Questions:
1. The "veracity" of big data refers to the speed at which data is generated.
○ True / False
2. Stream computing handles data in real-time without waiting for large datasets to
accumulate.
○ True / False
3. Distributed databases store data in a single location to ensure quick access.
○ True / False
4. Big data applications are primarily used in industries such as healthcare, telecom, and
government.
○ True / False
5. MapReduce is used in graph computing for real-time analysis of network structures.
○ True / False
6. Semi-structured data combines elements of both structured and unstructured data, such
as JSON files.
○ True / False
7. Kunpeng computing focuses on developing customized processors for gaming
applications only.
○ True / False
8. Batch computing is ideal for real-time data processing and analytics.
○ True / False
9. The main challenge in the big data era is the lack of proper data storage solutions.
○ True / False
10. The Kunpeng Big Data Solution is a secure and scalable solution designed for intelligent
big data construction.
○ True / False
11. Flink and Storm are technologies used in stream computing for big data processing.
○ True / False
12. I/O-intensive tasks in big data computing primarily utilize CPU resources.
○ True / False
13. The arrival of the intelligent era is marked by the rise of technologies such as cloud, big
data, and AI.
○ True / False
14. The boom in big data is driven by the increasing amount of data generated by connected
devices and systems.
○ True / False
15. Data silos in organizations result in fragmented data that cannot be easily shared or
utilized.
○ True / False
True/False Questions:
1. The "veracity" of big data refers to the speed at which data is generated.
○ False (Veracity refers to the accuracy and trust in data.)
2. Stream computing handles data in real-time without waiting for large datasets to
accumulate.
○ True
3. Distributed databases store data in a single location to ensure quick access.
○ False (Distributed databases are spread across multiple locations.)
4. Big data applications are primarily used in industries such as healthcare, telecom,
and government.
○ True
5. MapReduce is used in graph computing for real-time analysis of network
structures.
○ False (MapReduce is used in batch processing.)
6. Semi-structured data combines elements of both structured and unstructured
data, such as JSON files.
○ True
7. Kunpeng computing focuses on developing customized processors for gaming
applications only.
○ False (Kunpeng computing is used for a variety of applications.)
8. Batch computing is ideal for real-time data processing and analytics.
○ False (Batch computing processes large volumes of data at scheduled intervals.)
9. The main challenge in the big data era is the lack of proper data storage solutions.
○ False (There are other challenges like data silos, security, and talent shortages.)
10. The Kunpeng Big Data Solution is a secure and scalable solution designed for
intelligent big data construction.
○ True
11. Flink and Storm are technologies used in stream computing for big data
processing.
○ True
12. I/O-intensive tasks in big data computing primarily utilize CPU resources.
○ False (I/O-intensive tasks involve more waiting for I/O operations than CPU
usage.)
13. The arrival of the intelligent era is marked by the rise of technologies such as
cloud, big data, and AI.
○ True
14. The boom in big data is driven by the increasing amount of data generated by
connected devices and systems.
○ True
15. Data silos in organizations result in fragmented data that cannot be easily shared
or utilized.
○ True