A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.
Database services take care of scalability and high availability of the database. Database services make the underlying software-stack transparent to the user.
There are two primary methods to run a database on a cloud platform:
- Virtual machine image
- Cloud platforms allow users to purchase virtual-machine instances for a limited time, and one can run a database on such virtual machines. Users can either upload their own machine image with a database installed on it, or use ready-made machine images that already include an optimized installation of a database.
- Database-as-a-service (DBaaS)
- With a database as a service model, users pay fees to a cloud provider for services and computing resources, reducing the amount of money and effort needed to develop and manage databases. Users are given tools to create and manage database instances, and control users. Some cloud providers also offer tools to manage database structures and data. Many cloud providers offer both relational (Amazon RDS, SQL Server) and NoSQL (MongoDB, Amazon DynamoDB) databases. This is a type of software as a service (SaaS).
Architecture and common characteristics
- Most database services offer web-based consoles, which the end user can use to provision and configure database instances.
- Database services consist of a database-manager component, which controls the underlying database instances using a service API. The service API is exposed to the end user, and permits users to perform maintenance and scaling operations on their database instances.
- Underlying software-stack stack typically includes the operating system, the database and third-party software used to manage the database. The service provider is responsible for installing, patching and updating the underlying software stack and ensuring the overall health and performance of the database.
- Scalability features differ between vendors – some offer auto-scaling, others enable the user to scale up using an API, but do not scale automatically.
- There is typically a commitment for a certain level of high availability (e.g. 99.9% or 99.99%). This is achieved by replicating data and failing instances over to other database instances.
The design and development of typical systems utilize data management and relational databases as their key building blocks. Advanced queries expressed in SQL work well with the strict relationships that are imposed on information by relational databases. However, relational database technology was not initially designed or developed for use over distributed systems. This issue has been addressed with the addition of clustering enhancements to the relational databases, although some basic tasks require complex and expensive protocols, such as with data synchronization.
Modern relational databases have shown poor performance on data-intensive systems, therefore, the idea of NoSQL has been utilized within database management systems for cloud based systems. Within NoSQL implemented storage, there are no requirements for fixed table schemas, and the use of join operations is avoided. "The NoSQL databases have proven to provide efficient horizontal scalability, good performance, and ease of assembly into cloud applications." Data models relying on simplified relay algorithms have also been employed in data-intensive cloud mapping applications unique to virtual frameworks.
It is also important to differentiate between cloud databases which are relational as opposed to non-relational or NoSQL:
- SQL databases
- SQL databases are one type of database which can run in the cloud, either in a virtual machine or as a service, depending on the vendor. While SQL databases are easily vertically scalable, horizontal scalability poses a challenge, that cloud database services based on SQL have started to address.[need quotation to verify]
- NoSQL databases
- NoSQL databases are another type of database which can run in the cloud. NoSQL databases are built to service heavy read/write loads and can scale up and down easily, and therefore they are more natively suited to running in the cloud. However, most contemporary applications are built around an SQL data model, so working with NoSQL databases often requires a complete rewrite of application code.
- Some SQL databases have developed NoSQL capabilities including JSON, binary JSON (e.g. BSON or similar variants), and key-value store data types.
- A multi-model database with relational and non-relational capabilities provides a standard SQL interface to users and applications and thus facilitates the usage of such databases for contemporary applications built around an SQL data model. Native multi-model databases support multiple data models with one core and a unified query language to access all data models.
The following table lists notable database vendors with a cloud database offering, classified by their deployment model – machine image vs. database as a service – and data model, SQL vs. NoSQL.
|Virtual Machine Deployment||Database as a Service|
|SQL Data Model|| |
|NoSQL Data Model|| || |
- ^ Hwang, G.; Fu, S. (May 2016). "Proof of Violation for Trust and Accountability of Cloud Database Systems". 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid): 425–433. doi:10.1109/CCGrid.2016.27. ISBN 978-1-5090-2453-7. S2CID 18373753.
- ^ a b c Chao, Lee (2014). Cloud database development and management. Boca Raton: Taylor & Francis. ISBN 978-1-4665-6506-7. OCLC 857081580.
- ^ a b McHaney, Roger (2021). Cloud technologies: an overview of cloud computing technologies for managers. Hoboken, NJ. ISBN 978-1-119-76951-4. OCLC 1196822611.
- ^ Sakr, Sherif (June 2014). "Cloud-hosted databases: technologies, challenges and opportunities". Cluster Computing. 17 (2): 487–502. doi:10.1007/s10586-013-0290-7. ISSN 1386-7857. S2CID 254370104.
- ^ A. Anjomshoaa and A. Tjoa, "How the cloud computing paradigm could shape the future of enterprise information processing", Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services - iiWAS'11, pp. 7-10, 2011.
- ^ S. Cass, "Designing for the Cloud", MIT Technology Review, 2009. [Online]. Available: https://www.technologyreview.com/s/414090/designing-for-the-cloud/. Retrieved 2016-10-04.
- ^ "NoSQL", Wikipedia, 2016. Retrieved 2016-10-04.
- ^ Modi, A (2017). "Live migration of virtual machines with their local persistent storage in a data intensive cloud". International Journal of High Performance Computing and Networking. 10 (1): 134. doi:10.1504/IJHPCN.2017.083213.
- ^ https://docs.microsoft.com/en-us/azure/architecture/data-guide/big-data/non-relational-data Article in 'Microsoft Azure'
- ^ Dave Rosenberg, Are databases in the cloud really all that different?, CNET, Retrieved 2011-11-6
- ^ Agrawal, Rakesh; et al. (2008). "The Claremont report on database research" (PDF). SIGMOD Record. 37 (3): 9–19. CiteSeerX 10.1.1.211.5963. doi:10.1145/1462571.1462573. ISSN 0163-5808. S2CID 666280.
- ^ Ken North, "SQL, NoSQL or SomeSQL?", Dr. Dobb's, Retrieved 2011-11-9.
- ^ Deploy your database applications and projects on the cloud, IBM.com, Retrieved 2011-9-1
- ^ Chris Kanaracus, "Ingres rolls out cloud database offerings", InfoWorld, Retrieved 2011-8-28.
- ^ "Amazon Web Services Announces Two New Database Services – AWS Database Migration Service and Amazon RDS for MariaDB Archived 2017-06-01 at the Wayback Machine, Amazon Press Releases, retrieved 2015-11-17
- ^ "MariaDB Enterprise Cluster + MariaDB MaxScale Archived 2016-12-04 at the Wayback Machine, Microsoft Azure, retrieved 2015-11-17
- ^ "Running MySQL on Amazon EC2 with EBS (Elastic Block Store), Amazon Web Services, retrieved 2011-11-20
- ^ Swoyer, Stephen. "NuoDB: A Database for the Cloud." TDWI. Nov. 13, 2012. Retrieved Nov. 26, 2012
- ^ Amazon Machine Images - Oracle Database 11g Release 2 (188.8.131.52) Enterprise Edition - 64 Bit Archived 2011-10-16 at the Wayback Machine, Amazon Web Services, Retrieved 2011-11-9.
- ^ "Oracle Database in the Cloud", Oracle.com, Retrieved 2011-11-9.
- ^ Chris Kanaracus, "EnterpriseDB Adding New Cloud Option for PostgreSQL Database", PCWorld, retrieved 2011-8-28
- ^ "AWS | SAP HANA". Amazon Web Services, Inc. Retrieved 2016-07-07.
- ^ "SAP Solutions". Microsoft Azure. Retrieved 2016-07-07.
- ^ "SAP HANA Enterprise Cloud". hana.sap.com. Archived from the original on 2016-08-15.
- ^ "Clustrix Enters the Rackspace Partner Program". Yahoo! Finance. Archived from the original on 2016-04-14.
- ^ Tony Baer, "Cockroach DB introduces a serverless tier", ZDNet.com, Retrieved 2021-12-13.
- ^ a b EnterpriseDB#cite note-10
- ^ "Cloud SQL - MySQL Relational Database Service". Retrieved 2016-11-28.
- ^ "Announcing Heroku PostgreSQL Database Add-on", Heroku Blog, Retrieved 2011-11-9.
- ^ Noel Yuhanna, SQL Azure Raises The Bar On Cloud Databases, Forrester, Retrieved 2011-11-9.
- ^ Pethuru, Raj (2014-03-31). Handbook of Research on Cloud Infrastructures for Big Data Analytics. IGI Global. ISBN 9781466658653.
- ^ Klint Finley, "7 Cloud-Based Database Services" Archived 2011-11-09 at the Wayback Machine, ReadWriteWeb, Retrieved 2011-11-9.
- ^ "Setting up Cassandra in the Cloud Archived 2015-11-13 at the Wayback Machine", Cassandra Wiki, Retrieved 2011-11-10.
- ^ "Google Cloud Platform Blog: Click to Deploy Apache Cassandra on Google Compute Engine". Retrieved 2016-11-28.
- ^ "
- ^ "Clusterpoint Database Virtual Box VM Installation Guide Archived 2015-03-10 at archive.today", Clusterpoint, Retrieved 2015-03-08.
- ^ "Amazon Machine Images, CouchDB 0.10.x 32 bit Ubuntu[permanent dead link]", Amazon Web Services, Retrieved 2011-11-10.
- ^ "CouchDB Cloud Hosting on Google Cloud Platform". Retrieved 2016-11-28.
- ^ "Amazon Machine Image, Hadoop AMI[permanent dead link]", Amazon Web Services, Retrieved 2011-11-10.
- ^ "Cloud Dataproc: Managed Spark & Managed Hadoop Service". Retrieved 2016-11-28.
- ^ ["http://www.rackspace.com/blog/cloud-big-data-platform-limited-availability/ Hadoop at Rackspace] Archived 2014-03-02 at the Wayback Machine", Rackspace Big Data Platforms, Retrieved 2014-02-24.
- ^ "MarkLogic Developer 8 (HVM) on AWS Marketplace". aws.amazon.com. Retrieved 2016-03-31.
- ^ marklogic.com. "Flexible Deployment" (PDF). Retrieved 2016-11-28.
- ^ "MongoDB on Amazon EC2, MongoDB.org, Retrieved 2011-11-10.
- ^ "Deploying MongoDB on Google Compute Engine". Retrieved 2016-11-28.
- ^ "MongoDB on Azure Archived 2012-10-31 at the Wayback Machine, MongoDB.org, Retrieved 2011-11-10.
- ^ "Easily Scale MongoDB at Rackspace Archived 2014-03-02 at the Wayback Machine", Managed MongoDB ObjectRocket by Rackspace, Retrieved 2014-02-24.
- ^ "Neo4J in the Cloud Archived 2011-09-25 at the Wayback Machine", Neo4J Wiki, Retrieved 2011-11-10.
- ^ "Announcing Neo4J on Windows Azure", Neo4J Blog, Retrieved 2011-11-10.
- ^ a b Adrian Bridgwater, "Scylla's real-time NoSQL database tapped by 'super app'", Computerworld, Retrieved 2012-12-27.
- ^ Andrew Brust, "Cloudant Makes NoSQL as a Service Bigger", ZDNet, Retrieved 2012-5-22.
- ^ "DataStax Astra DB: DataStax managed services powered by Apache Cassandra". DataStax. Retrieved 2022-03-07.
- ^ "Bigtable: Scalable NoSQL Database Service". Retrieved 2016-11-28.
- ^ "Datastore: NoSQL Schemaless Database". Retrieved 2016-11-28.
- ^ "MongoDB Atlas: Hosted MongoDB as a Service". Retrieved 2016-08-30.
- ^ "NoSQL Database Cloud Service". Oracle Cloud. Retrieved 2017-11-29.