Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
11 views33 pages

Seminar Report Final 123

The document is a seminar report titled 'Digital Campus Big Data Using Cloud Storage Technology' submitted by Jayesh Nagaraj Patil for a B.Tech degree in Computer Engineering. It discusses the integration of cloud storage technology in digital campus environments, highlighting its advantages over traditional storage systems, and presents a construction scheme for utilizing cloud storage in managing campus big data. The report includes sections on the introduction, objectives, literature survey, and a conclusion, along with acknowledgments and a certificate of completion.

Uploaded by

patiljayesh95666
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views33 pages

Seminar Report Final 123

The document is a seminar report titled 'Digital Campus Big Data Using Cloud Storage Technology' submitted by Jayesh Nagaraj Patil for a B.Tech degree in Computer Engineering. It discusses the integration of cloud storage technology in digital campus environments, highlighting its advantages over traditional storage systems, and presents a construction scheme for utilizing cloud storage in managing campus big data. The report includes sections on the introduction, objectives, literature survey, and a conclusion, along with acknowledgments and a certificate of completion.

Uploaded by

patiljayesh95666
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 33

Digital Campus Big Data

Using Cloud Storage Technology


A Seminar I report submitted in

partial fulfillment of the

requirements for the Degree of

B.Tech.
in

Computer Engineering
Submitted by

Jayesh Nagaraj Patil

DEPARTMENT OF COMPUTER ENGINEERING


S.S.V.P.S.’s B.S. DEORE COLLEGE OF ENGINEERING,
DHULE 2022-23

I
Digital Campus Big Data
Using Cloud Storage Technology
A Seminar I report submitted in

partial fulfillment of the

requirements for the Degree of

Bachelor of Technology
in

Computer Engineering
Submitted by

Jayesh Nagaraj Patil


Guided by

Prof. Bhushan S. Parakh

DEPARTMENT OF COMPUTER ENGINEERING


S.S.V.P.S.’s B.S. DEORE COLLEGE OF ENGINEERING,
DHULE 2022-23

II
S.S.V.P.S.’s B.S. DEORE COLLEGE OF
ENGINEERING, DHULE
DEPARTMENT OF COMPUTER ENGINEERING

CERTIFICATE

This is to certify that the Seminar I entitled “Digital Campus Big Data
Using Cloud Storage Technology” has been carried out by

Jayesh Nagaraj Patil

under my guidance in partial fulfillment of the degree of Bachelor of


Technology in Computer Engineering of Dr. Babasaheb Ambedkar
Technological University, Lonere during the academic year 2022-23. To the
best of my knowledge and belief this work has not been submitted elsewhere
for the award of any other degree.

Date:

Place: Dhule

Guide

Prof. Bhushan S. Parakh

Head Principal
Prof. B. R. Mandre Dr. Hitendra D. Patil

III
ACKNOWLEDGEMENT

We would like to express our deepest appreciation to all those who provide us the
possibility to complete this paper. A special gratitude we would like to give to our guide,
Prof. Bhushan Parakh sir whose contribution in stimulating suggestion and
encouragement, helped us to coordinate our research paper.

Also we would like to thank We are gladly thankful to principal Dr. Hitendra D. Patil for
providing such a great opportunities to learn something new and innovative. We are also
thankful to our faculty who gave us a great guidance regarding Seminar report.

Jayesh Nagaraj Patil

IV
Table of Contents

ABSTRACT........................................................................................................................1
1. INTRODUCTION.........................................................................................................2
2. OBJECTIVES................................................................................................................3
3. LITERATURE SURVEY..............................................................................................4
3.1 What is Cloud Storage ?..........................................................................................4
3.1.1 Digital Campus..............................................................................................5
3.1.2 Big Data.........................................................................................................6
3.2 Traditional Storage System.....................................................................................7
3.2.1 Structure of Traditional Storage System........................................................8
3.2.2 Defects Traditional Storage System..............................................................9
3.3 Need of Cloud Storage..........................................................................................12
4. CLOUD STORAGE....................................................................................................13
4.1 Advantages............................................................................................................14
4.2 Disadvantages........................................................................................................16
4.3 Construction of Cloud Storage System.................................................................18
4.3.1 Architecture Design of Cloud Storage System.............................................19
CONCLUSION...............................................................................................................................23

BIBLIOGRAPHY............................................................................................................................24

V
Figure Index

1. Fig 3.2.1 a : Design of Traditional Storage System............................................8

2. Fig 3.2.2 a : Comparison between Old and New Data Storage System.............10

3. Fig. 4.3.1 Architectural model of cloud storage system.....................................19

4. Fig. 4.3.1 High level cloud storage architecture.................................................21

VI
ABSTRACT

Realizing campus intelligent management and service mode by combining cloud storage
technology is the focus of current research in digital campus construction work. By
analyzing the IT development trend of digital campus, the defective problems of
traditional storage system, from the architecture design and platform construction of
cloud storage system for digital campus, we propose a cloud storage construction
scheme for campus big data scenario, and discuss the application of virtualization,
streamlined configuration, storage optimization and other technologies in cloud storage.
The cloud storage technology is applied in the construction of digital campus, which
strongly accelerates the process of digital campus informatization.

Key Words: Digital campus, Cloud storage, Virtualization, Big data.

1
Chapter – 1
INTRODUCTION
Cloud Storage is a model of data storage in which the digital data is stored in logical
pools, the physical storage spans multiple servers (and often locations), and the physical
environment is typically owned and managed by a hosting company. These cloud storage
providers are responsible for keeping the data available and accessible, and the physical
environment protected and running. People and organizations buy or lease storage
capacity from the providers to store user, organization, or application data. Cloud storage
services may be accessed through a co-located cloud computer service, a web service
application programming interface (API) or by applications that utilize the API, such as
cloud desktop storage, a cloud storage gateway or Web-based content management
systems. Cloud storage is a model of computer data storage in which the digital data is
stored in logical pools, said to be on "the cloud". The physical storage spans multiple
servers (sometimes in multiple locations), and the physical environment is typically
owned and managed by a hosting company. These cloud storage providers are
responsible for keeping the data available and accessible, and the physical environment
secured, protected, and running. People and organizations buy or lease storage capacity
from the providers to store user, organization, or application data. Cloud storage services
may be accessed through a co- located cloud computing service, a web service
application programming interface (API) or by applications that use the API, such as
cloud desktop storage, a cloud storage gateway or Web-based content management
systems.

2
Chapter – 2

OBJECTIVES
Cloud storage allows you to save data and files in an off-site location that you access
either through the public internet or a dedicated private network connection. Data that
you transfer off-site for storage becomes the responsibility of a third-party cloud
provider. The provider hosts, secures, manages, and maintains the servers and associated
infrastructure and ensures you have access to the data whenever you need it. Cloud
storage delivers a cost-effective, scalable alternative to storing files on on-premise hard
drives or storage networks. Computer hard drives can only store a finite amount of data.
When users run out of storage, they need to transfer files to an external storage device.
Traditionally, organizations built and maintained storage area networks (SANs) to
archive data and files. SANs are expensive to maintain, however, because as stored data
grows, companies have to invest in adding servers and infrastructure to accommodate
increased demand. Cloud storage services provide elasticity, which means you can scale
capacity as your data volumes increase or dial down capacity if necessary. By storing
data in a cloud, your organization save by paying for storage technology and capacity as
a service, rather than investing in the capital costs of building and maintaining in-house
storage networks. You pay for only exactly the capacity you use. While your costs might
increase over time to account for higher data volumes, you don’t have to overprovision
storage networks in anticipation of increased data volume.

3
Chapter – 3

LITERATURE SURVEY

 3.1 What is Cloud Storage ?


Cloud storage is a model of computer data storage in which the digital data is
stored in logical pools, said to be on "the cloud". The physical storage spans
multiple servers, and the physical environment is typically owned and managed
by a hosting company.
The cloud is simply a network of computers. It refers to a network of computers
owned by one person or company, where other people or companies can store
their data. On your personal machine, everything is stored on one physical storage
device – your hard drive. Cloud storage refers to a virtual storage area that can
span across many different physical storage devices. When you use cloud storage,
some of your files may be on a physical server in New York while other files are
on a physical server in California. Since most users do not know where their
physical files are, using cloud storage can be thought of as a vague, untouchable
thing – much like a cloud itself! Most data you access via the internet that is not
stored on your personal computer is part of the cloud. For example, if you use an
internet- based email service like Gmail, Yahoo, or Live, you can access your
email anywhere you have an internet connection. This is because the data is
stored on servers owned by the respective e-mail providers, not your local
machine. Your email is in the cloud. Although the cloud is more like an idea than
something you can physically touch, the computers that make up the “cloud”
require physical space. The facilities that store the physical equipment used by the
cloud are called data centers. Sometimes you may also hear the term server farm.
Data centers can be anywhere in the world. They are generally in warehouses that
have complex cooling systems to keep the computers from overheating.
Cloud Storage is technology that allows you to save files in storage, and then
access those files via the Cloud. Let's break down this definition. First, storage is
the computer's ability to save files and other resources for later use. When you
restart a computer, the files that are still available after the computer turns back
on are saved and read from storage. Such storage commonly consists of a hard

4
drive, a

5
USB Flash drive, or another type of drive. Because local data drives can be
damaged or stolen, an idea was developed to use data drives over a network as
storage. This allows the drives to be secured in a data center and backed up
automatically. Initially, network storage required fast local networks (LAN), but
today we have a ubiquitous network called the Internet. The second part of Cloud
Storage, the Cloud, represents the Internet. Any service, including storage,
available over the Internet, is called Cloud service. If you use GMAIL it is email
in the Cloud, if you use an Amazon MP3 player, that's music in the Cloud.

 3.1.1 Digital Campus


A Digital Campus refers to the online offerings of a college or university where
college work is completed either partially or wholly online, often with the
assistance of the teacher, professor, or teaching assistant. Many colleges and
universities now offer such courses (or entire degree programs) either partially or
wholly online. There are an estimated 4,500 such institutions with total
enrollments approaching perhaps 2 million.
The majority of students using virtual campuses to obtain online degrees are
adults students for three main reasons:

 Flexibility – Adults with full-time jobs and families would find it impossible
to attend daily at a traditional school setting. Online classes allow students to
work at their own pace and work around their busy lives.

Some of the aspects that go under virtual campus includes various types of
learning activities such as lectures, homework, discussions, readings,
assignments. Classes are usually self paced using online documents and databases
that might be available to them. Tests and other assignments are available online
in specific programs used for online classes. Other methods used in virtual
campus are live sessions, videoconferencing, discussing and sharing various
applications. Individuals are able to access the materials any time they want
under the teacher's control and are able to access anywhere online where they're
able to access internet usage. Email is a big part of the virtual campuses and is
often used before, during and after sessions. This aids individuals in exchanging
information and or point them to the

6
right direction that would be useful in increasing and understanding various
methods available to them via documents and online sources.

 3.1.2 Big Data


Big data primarily refers to data sets that are too large or complex to be dealt with
by traditional data-processing application software. Data with many fields (rows)
offer greater statistical power, while data with higher complexity (more attributes
or columns) may lead to a higher false discovery rate. Though used sometimes
loosely partly because of a lack of formal definition, the interpretation that seems
to best describe Big data is the one associated with large body of information that
we could not comprehend when used only in smaller amounts.

Big data analysis challenges include capturing data, data storage, data analysis,
search, sharing, transfer, visualization, querying, updating, information privacy,
and data source. Big data was originally associated with three key
concepts: volume, variety, and velocity. The analysis of big data presents
challenges in sampling, and thus previously allowing for only observations and
sampling. Thus a fourth concept, veracity, refers to the quality or insightfulness of
the data. Without sufficient investment in expertise for big data veracity, then the
volume and variety of data can produce costs and risks that exceed an
organization's capacity to create and capture value from big data.

7
 3.2 Traditional Storage System
Data storage is the recording (storing) of information (data) in a storage medium.
Handwriting, phonographic recording, magnetic tape, and optical discs are all
examples of storage media. Biological molecules such as RNA and DNA are
considered by some as data storage. Recording may be accomplished with
virtually any form of energy. Electronic data storage requires electrical power to
store and retrieve data.
 Recording Media
A recording medium is a physical material that holds information. Newly
created information is distributed and can be stored in four storage media–
print, film, magnetic, and optical–and seen or heard in four information
flows–telephone, radio and TV, and the Internet as well as being observed
directly. Digital information is stored on electronic media in many
different recording formats.
 Global Capacity
A 2003 UC Berkeley report estimated that about five exabytes of new
information were produced in 2002 and that 92% of this data was stored
on hard disk drives. This was about twice the data produced in 2000. The
amount of data transmitted over telecommunication systems in 2002 was
nearly 18 exabytes—three and a half times more than was recorded on
non- volatile storage. Telephone calls constituted 98% of the
telecommunicated information in 2002. The researchers' highest estimate
for the growth rate of newly stored information (uncompressed) was more
than 30% per year.
 Digitalization
A 2011 Science (journal) article estimated that the year 2002 was the
beginning of the digital age for information storage: an age in which more
information is stored on digital storage devices than on analog storage
devices. In 1986, approximately 1% of the world's capacity to store
information was in digital format; this grew to 3% by 1993, to 25% by
2000, and to 97% by 2007. These figures correspond to less
than three compressed exabytes in 1986, and 295 compressed exabytes in
2007. The quantity of digital storage doubled roughly every three years.

8
 3.2.1 Structure of Traditional Storage System

Traditional data systems, such as relational databases and data warehouses, have
been the primary way businesses and organizations have stored and analysed their
data for the past 30 to 40 years. Although other data stores and technologies exist,
the major percentage of business data can be found in these traditional systems.
Traditional systems are designed from the ground up to work with data that has
primarily been structured data. Characteristics of structured data include the
following:

Fig 3.2.1 a : Design of Traditional Storage System.

 Clearly defined fields organized in records. Records are usually stored in


tables. Fields have names, and relationships are defined between different
fields.
 The use of Structured Query Language (SQL) for managing and accessing
the data.
 Relational and warehouse database systems that read data in 8k or 16k
block sizes. These block sizes load data into memory, and then the data
are processed by applications. When processing large volumes of data,
reading the data in these block sizes is extremely inefficient.
 Organizations today contain large volumes of information that is not
actionable or being leveraged for the information it contains.
 In a number of traditional siloed environments data scientists can spend
80% of their time looking for the right data and 20% of the time doing

9
analytics. A data-driven environment must have data scientists spending a
lot more time doing analytics.
 Schema-on-write that requires data be validated against a schema before it
can be written to disk. A significant amount of requirements analysis,
design, and effort up front can be involved in putting the data in clearly
defined structured formats. This can increase the time before business
value can be realized from the data.
 An order management system is designed to take orders. A web
application is designed for operational efficiency. A customer system is
designed to manage information on customers. Data from these systems
usually reside in separate data silos. However, bringing this information
together and correlating with other data can help establish detailed
patterns on customers.
 Every year organizations need to store more and more detailed
information for longer periods of time. Increased regulation in areas such
as health and finance are significantly increasing storage volumes.
Expensive shared storage systems often store this data because of the
critical nature of the information. Shared storage arrays provide features
such as striping (for performance) and mirroring (for availability).
Managing the volume and cost of this data growth within these traditional
systems is usually a stress point for IT organizations. Examples of data
often stored in structured form include Enterprise Resource Planning
(ERP), Customer Resource Management (CRM), financial, retail, and
customer information.

 Defects of Traditional Storage System


Traditional storage such as server hard drives, DAS, NAS, etc. are more focused
on hardware fault isolation and better stability and performance, but they also
have some defects and are not suitable for the needs of high performance
computing architectures and applications in big data scenarios. The main
drawbacks are:

1
Fig 3.2.2 a : Comparison between Old and New Data Storage System.

 Integration of technology stacks from multiple vendors. Storage systems


use different components from different vendors' technology stacks and
lack the ability to design and deliver end-to-end HPC solutions. This
dramatically increases the complexity of the system and leads to I/0
bottlenecks, making testing, quality control, and long-term maintenance
very difficult.
 Limited automation in the configuration and tuning process. Multi-vendor
HPC systems typically include a step-by-step installation process, which
is often long and difficult due to the involvement of components from
different vendors. This time-consuming installation and configuration
process can impact project duration, making the system highly unsuitable
for dynamically changing requirements. The lack of 7 integration settings
also affects the ability to interact with all the components of the system
and performance tuning.

1
 Lack of management software support. Management software is a very
critical component of the storage system, because it plays a very important
role in the smooth operation of the entire system. Users of traditional
storage systems will encounter great difficulty in capturing and recording
various parameters (such as performance) of the storage system. The lack
of sophisticated diagnostic tools makes log analysis and troubleshooting
extremely difficult.
 The unused storage capacity allocated for various applications in the
college campus network is costly to the college, including the energy
waste it creates, as unused storage consumes energy and generates heat,
increasing total energy consumption, and the space it takes up, as unused
storage will take up unnecessary space that could be allocated to other
useful components. To address such situations, thin provisioning is a
necessary storage application technology. Thin provisioning refers to
configuring storage according to actual demand.
 With this technique, logical storage is allocated to applications based on
predicted demand. The actual allocated storage is based on the current
demand of the application and is much smaller than the logical storage.
Once the application's storage needs increase, more storage is allocated to
the application from a pool of resources. With this approach, thin
provisioning provides efficient use of storage and reduces waste of
physical storage.
 In addition to upward and downward scaling, outward and inward scaling
is the key difference in the cloud. The ability to automatically add new
resources or withdraw allocated resources to meet changing needs makes
the cloud the most suitable choice for low-cost, high-yield HPC (high-
performance computing).

1
 3.2 Need of Cloud Storage

 Cloud storage is not only safe and secure, it's also cost-effective. Without
cloud storage, you're constantly spending money on local on-premises
storage, whether it's through purchase or just maintenance. Not to mention
the manpower that's required to run them.

 Cloud storage can be used for copying virtual machine images from the
cloud to on-premises locations or to import a virtual machine image from
an on-premises location to the cloud image library. In addition, cloud
storage can be used to move virtual machine images between user
accounts or between data centres.

 Storage availability and data protection is intrinsic to object storage


architecture, so depending on the application, the additional technology,
effort and cost to add availability and protection can be eliminated.

 Organizations can choose between off-premises and on-premises cloud


storage options, or a mixture of the two options, depending on relevant
decision criteria that is complementary to initial direct cost savings
potential; for instance, continuity of operations (COOP), disaster recovery
(DR), security (PII, HIPAA, SARBOX, IA/CND), and records retention
laws, regulations, and policies.

 Cloud storage can be mapped as a local drive with the WebDAV protocol.
It can function as a central file server for organizations with multiple
office locations.
 Cloud storage can be used as natural disaster proof backup, as normally
there are 2 or 3 different backup servers located in different places around
the globe.

1
Chapter – 4
CLOUD STORAGE
The cloud is simply a network of computers. It refers to a network of computers owned
by one person or company, where other people or companies can store their data. On
your personal machine, everything is stored on one physical storage device – your hard
drive. Cloud storage refers to a virtual storage area that can span across many different
physical storage devices. When you use cloud storage, some of your files may be on a
physical server in New York while other files are on a physical server in California.
Since most users do not know where their physical files are, using cloud storage can be
thought of as a vague, untouchable thing – much like a cloud itself.

Most data you access via the internet that is not stored on your personal computer is part
of the cloud. For example, if you use an internet-based email service like Gmail, Yahoo,
or Live, you can access your email anywhere you have an internet connection. This is
because the data is stored on servers owned by the respective e-mail providers, not your
local machine. Your email is in the cloud.

Although the cloud is more like an idea than something you can physically touch, the
computers that make up the “cloud” require physical space. The facilities that store the
physical equipment used by the cloud are called data centers. Sometimes you may also
hear the term server farm. Data centers can be anywhere in the world. They are generally
in warehouses that have complex cooling systems to keep the computers from
overheating.

Cloud storage is a model where data is stored, managed and backed up through a cloud
based remote access system. There are three distinctly different storage models for this
type of storage. Public cloud services provide a storage environment that is shared by
multiple tenants. This is supposed to be the simplest model of storage that is considered
to be most suitable for starters and is befitting for those dealing with unstructured data.

1
 4.1 Advantages
 Usability: All cloud storage services reviewed in this topic have desktop
folders for Mac’s and PC’s. This allows users to drag and drop files
between the cloud storage and their local storage.
 Bandwidth: You can avoid emailing files to individuals and instead send a
web link to recipients through your email.
 Accessibility: Stored files can be accessed from anywhere via Internet
connection.
 Disaster Recovery: It is highly recommended that businesses have an
emergency backup plan ready in the case of an emergency. Cloud storage
can be used as a back‐up plan by businesses by providing a second copy
of important files. These files are stored at a remote location and can be
accessed through an internet connection.
 Cost Savings: Businesses and organizations can often reduce annual
operating costs by using cloud storage; cloud storage costs about 3 cents
per gigabyte to store data internally. Users can see additional cost savings
because it does not require internal power to store information remotely.
 Syncing: Many cloud storage services offer syncing services. When you
set up the software on your local machine, it will create a special folder on
your computer and any changes made to the files in that folder will be
automatically synced to your cloud storage as long as your computer is
connected to the internet. When you access your cloud storage from a
different computer, you will see the same files that are in that folder on
your personal computer. Also, any files you upload to your cloud storage
from a different computer will be synced to that same cloud storage
folder.
 Convenience: One of the biggest benefits of using the cloud is that your
data can be accessed from multiple devices. Cloud storage services can
usually be accessed from any device with an internet 4 connection. You
can check your email, read your documents, view your online photos, or
listen to your online music from any computer, tablet, or internet-enabled
cell phone.
 Online Backups: One advantage to using a cloud service is that it can
serve as an online backup for important files in case your computer stops

1
working. If your computer crashes or you accidentally delete pictures
from

1
your hard drive, you can simply login to the cloud service and download
your files again. Some cloud storage services may prompt you to
automatically add files to the synced folder as soon as you add them to
your computer to serve as a backup.
 Cloud storage can be used for copying virtual machine images from the
cloud to on-premises locations or to import a virtual machine image from
an on-premises location to the cloud image library. In addition, cloud
storage can be used to move virtual machine images between user
accounts or between data centers.
 Cloud storage can be used as natural disaster proof backup, as normally
there are 2 or 3 different backup servers located in different places around
the globe.
 Storage availability and data protection is intrinsic to object storage
architecture, so depending on the application, the additional technology,
effort and cost to add availability and protection can be eliminated.
 Companies need only pay for the storage they actually use, typically an
average of consumption during a month, quarter, or year. [11] This does
not mean that cloud storage is less expensive, only that it incurs operating
expenses rather than capital expenses.
 Hybrid cloud storage is a term for a storage infrastructure that uses a
combination of on-premises storage resources with cloud storage. The on-
premises storage is usually managed by the organization, while the public
cloud storage provider is responsible for the management and security of
the data stored in the cloud.
 Hybrid cloud storage can be implemented by an onpremises cloud storage
gateway that presents a file system or object storage interface which the
users can access in the same way they would access a local storage
system. The cloud storage gateway transparently transfers the data to and
from the cloud storage service, providing low latency access to the data
through a local cache.

1
 4.2 Disadvantages

 Usability: Be careful when using drag/drop to move a document into the


cloud storage folder. This will permanently move your document from its
original folder to the cloud storage location. Do a copy and paste instead
of drag/drop if you want to retain the document’s original location in
addition to moving a copy onto the cloud storage folder.
 Bandwidth: Several cloud storage services have a specific bandwidth
allowance. If an organization surpasses the given allowance, the
additional charges could be significant. However, some providers allow
unlimited bandwidth. This is a factor that companies should consider
when looking at a cloud storage provider.
 Accessibility: If you have no internet connection, you have no access to
your data.
 Data Security: There are concerns with the safety and privacy of
important data stored remotely. The possibility of private data
commingling with other organizations makes some businesses uneasy. If
you want to know more about those issues that govern data security and
privacy, here is an interesting article on the recent privacy debates.
 Software: If you want to be able to manipulate your files locally through
multiple devices, you’ll need to download the service on all devices.
 Longevity: Companies are not permanent and the services and products
they provide can change. Outsourcing data storage to another company
needs careful investigation and nothing is ever certain. Contracts set in
stone can be worthless when a company ceases to exist or its
circumstances change. Companies can.
 Accessibility: Performance for outsourced storage is likely to be lower
than local storage, depending on how much a customer is willing to spend
for WAN bandwidth. Reliability and availability depend on wide area
network availability and on the level of precautions taken by the service
provider. Reliability should be based on hardware as well as various
algorithms used.
 Users with specific records-keeping requirements, such as public agencies

1
that must retain electronic records according to statute, may encounter

1
complications with using cloud computing and storage. For instance, the
U.S. Department of Defense designated the Defense Information Systems
Agency (DISA) to maintain a list of records management products that
meet all of the records retention, personally identifiable information (PII),
and security (Information Assurance; IA) requirements.
 Piracy and copyright infringement may be enabled by sites that permit
filesharing. For example, the CodexCloud ebook storage site has faced
litigation from the owners of the intellectual property uploaded and shared
there, as have the GrooveShark and YouTube sites it has been compared
to.
 The legal aspect, from a regulatory compliance standpoint, is of concern
when storing files domestically and especially internationally.
 The resources used to produce large data centers, especially those needed
to power them, is causing nations to drastically increase their energy
production. This is leads to further climate damaging implications.
 Internet Connection Required: Since data stored on the cloud is on a third
party’s computer, you will need an internet connection to access it. If you
do not have access to the internet, you will not be able access the network
of computers that store your data.
 Bandwidth Costs: Bandwidth refers to the amount of data that can be
carried over the internet from one point to another. Some internet service
providers give users a monthly bandwidth allocation. By transferring data
to and from the cloud, you will be using part of the allocation and may
have to pay overage charges if you go over that allocation.
 Privacy: One downside of the cloud is potential privacy issues. Make sure
you read the terms of service and privacy policy of any cloud storage
service closely. You want to make sure that you are not agreeing to
anything that you are uncomfortable with. For example, any cloud storage
company in the U.S. can be required to give the government access to
your files with a subpoena.

2
 4.3 Construction of Cloud Storage System.
Cloud computing has revolutionized the technology used to store information and
run applications. Cloud storage is storage based on cloud computing technology.
It does not refer to a specific storage device or technology, but rather to a large
collection of storage devices and servers that are used to hold data in a cloud
computing environment. Users of cloud storage do not use a specific storage
device, but use the cloud storage system through some kind of access service.
Storage resources in a cloud storage system are provided in the form of pools and
are allocated in real time on demand; capacity is provided on demand and storage
resources can be used according to the needs of big data applications, which are
highly scalable and elastic; and cost-benefit, which provides significant
economies of scale by paying for the use of resources. The above features of
cloud storage make it the first choice for data storage in big data scenarios.
 Cloud storage is:
1) Made up of many distributed resources, but still acts as one, either
in a federated or a cooperative storage cloud architecture.
2) Highly fault tolerant through redundancy and distribution of data.
3) Highly durable through the creation of versioned copies.
4) Typically eventually consistent with regard to data replicas.
The closed and rigid nature of traditional infrastructure brings problems such as
expensive and low utilization of IT infrastructure. Transforming traditional
infrastructure into modular, developmental, scalable, and programmable
infrastructure through consolidation, virtualization, automation, and sharing can
effectively improve utilization as well as optimize infrastructure and bring more
flexibility. With the rapid spread of big data analytics platforms and applications,
data collection and processing and analysis are done quickly and cheaply,
requiring an inexpensive infrastructure with super computing power as well as
unlimited storage. The solution is to use cloud computing technology to build
clusters that form dynamic, elastic and scalable resource pools to effectively meet
the increasing and intermittent demand for computing power.

2
 4.3.1 Architecture Design of Cloud Storage System.
The cloud based storage architecture model is designed in Fig. 4.3.1 a

Fig. 4.3.1 Architectural model of cloud storage system

 Storage layer: The bottom layer of the cloud storage architecture. It


contains different types of storage devices. The storage devices included
in this layer include Fibre Channel storage devices, IP storage devices,
DAS storage devices, and so on. These storage devices may be
geographically located in different regions and are connected together
through the Internet or LAN. This layer has a unified storage management
system that can manage all the heterogeneous types of devices located in
the same storage pool and deliver them as a service on demand, which can
be called Storage as a Service or Infrastructure as a Service. The key
technology used in this layer is storage virtualization.
 Infrastructure management layer: The underlying storage devices are
managed in a unified manner and the necessary infrastructure is provided.
This infrastructure is important because it provides various key functions
such as security, space management, backup, storage consolidation, etc.

2
using technologies such as clusters and grids. Other services provided by
this layer are backup, disaster recovery, encryption, compression, etc.
 Application interface layer: The application interface layer provides for
various interfaces or APIs to support or use cloud storage use cases. Some
common cloud storage use cases are data archiving applications, backup
applications, and so on. Different cloud storage service providers develop
their own custom application interfaces based on the services they
provide.
 Application interface layer: The application interface layer provides for
various interfaces or APIs to support or use cloud storage use cases. Some
common cloud storage use cases are data archiving applications, backup
applications, and so on. Different cloud storage service providers develop
their own custom application interfaces based on the services they
provide.
 Storage Virtualization: Storage virtualization is a mechanism to ensure
that different heterogeneous types of storage devices are stored and
managed as a single unit. This will support unified memory management,
easier deployment, and integrated monitoring of the entire storage
infrastructure. Storage virtualization primarily involves partitioning
available storage into virtual volumes. Virtual volumes can be created by
combining different types of storage devices. These virtual volumes
abstract away the details of the storage devices in the volume and then
present them as storage devices to the operating system. Virtual volumes
can be expanded, created, and deleted based on storage requirements
without any downtime. Key benefits provided by storage virtualization
include providing unified storage management capabilities; facilitating
aggregation of heterogeneous storage devices; allowing storage resources
to be allocated and released as storage requirements change; and
providing scalability to the storage infrastructure.

2
Fig. 4.3.1 High level cloud storage architecture.

 Cloud storage is based on highly virtualized infrastructure and is like


broader cloud computing in terms of interfaces, near-instant elasticity and
scalability, multi-tenancy, and metered resources. Cloud storage services
can be used from an off-premises service (Amazon S3) or deployed on
premises (ViON Capacity Services).
 There are three types of cloud storage: a hosted object storage service, file
storage, and block storage. Each of these cloud storage types offer their
own unique advantages.
 Examples of object storage services that can be hosted and deployed with
cloud storage characteristics include Amazon S3, Oracle Cloud Storage
and Microsoft Azure Storage, object storage software like Openstack
Swift, object storage systems like EMC Atmos, EMC ECS and Hitachi
Content Platform, and distributed storage research projects like
OceanStore and VISION Cloud.
 Examples of file storage services include Amazon Elastic File System
(EFS) and Qumulo Core, used for applications that need access to shared

2
files and require a file system. This storage is Architecture A high level
architecture of cloud storage. often supported with a Network Attached
Storage (NAS) server, used for large content repositories, development
environments, media stores, or user home directories.
 A block storage service like Amazon Elastic Block Store (EBS) is used
for other enterprise applications like databases and often require
dedicated, low latency storage for each host. This is comparable in certain
respects to direct attached storage (DAS) or a storage area network
(SAN).
 Cloud storage typically refers to a hosted object storage service, but the
term has broadened to include other types of data storage that are now
available as a service, like block storage.
 Personal Cloud Storage: Also known as mobile cloud storage, personal
cloud storage is a subset of public cloud storage that applies to storing an
individual's data in the cloud and providing the individual with access to
the data from anywhere. It also provides data syncing and sharing
capabilities across multiple devices. Apple's iCloud is an example of
personal cloud storage.
 Public Cloud Storage: Public cloud storage is where the enterprise and
storage service provider are separate and there aren't any cloud resources
stored in the enterprise's data center. The cloud storage provider fully
manages the enterprise's public cloud storage.
 Private Cloud Storage: A form of cloud storage where the enterprise and
cloud storage provider are integrated in the enterprise's data center. In
private cloud storage, the storage provider has infrastructure in the
enterprise's data center that is typically managed by the storage provider.
Private cloud storage helps resolve the potential for security and
performance concerns while still offering the advantages of cloud storage.

2
CONCLUSION
Building a digital campus is a fundamental means to promote education informatization.
Technologies such as cloud computing, big data and 5G bring new opportunities and
challenges to the construction of digital campus. The scale, changeability and diversity of
big data bring great challenges to IT. Big data requires high-quality IT infrastructure,
platforms, databases, file storage systems, etc. Cloud storage provides high-performance
and high-efficiency data storage infrastructure for campus construction, and strongly
promotes campus informatization.

Cloud storage allows individuals and businesses to store and retrieve computer files via
an internet-connected device. Cloud storage has grown increasingly popular among
individuals who need larger storage space and for businesses seeking an efficient off-site
data back-up solution. Because of cloud storage's increasing popularity and use, cloud
security has become a major concern to protect data integrity, prevent hacking attempts,
and avoid file or identity theft.

Cloud storage helps businesses with major data storage needs to save a significant
amount of space and money by eliminating the need for data storage infrastructure on the
business premises. The cloud storage provider owns and maintains all the necessary
hardware and software so the cloud users don’t have to. Purchasing ongoing cloud
storage may cost more in the long run, but it can be significantly less expensive upfront.
Further, businesses can almost instantly scale up or down how much cloud storage they
have access to as their storage needs change.

The cloud also enables employees to collaborate with colleagues—and work remotely
and outside of regular business hours—while facilitating smooth document collaboration
by allowing authorized employees easy access to the most updated version of a file. At
the personal level, cloud storage allows mobile data and enables digital life in the holistic
way we live it today. Without the cloud, smart phones would not be able to be the
interface of so much data ( photos, documents, information on the go). Using the cloud to
store files can also have a positive effect on the environment since it cuts down energy
consumption.

2
Bibliography

1) https://web.archive.org/web/20120329061438/http://oceanstore.cs.berkeley.edu/p

ublications/papers/pdf/ieeeic.pdf

2) https://web.archive.org/web/20160322022215/http://www.vion.com/capacity-

services/vion-capacity-services.html

3) http://oceanstore.cs.berkeley.edu/publications/papers/pdf/ieeeic.pdf

4) http://www.computerweekly.com/Articles/2009/06/10/235429/A-history-of-

cloud-computing.htm

You might also like