Companies are acquiring massive amounts of data and providing Internet-based service to millions of people. This is extremely challenging due to the large scale of data involved and the huge number of concurrent requests by users. This subject will cover the current state-of-the-art technologies for analysing huge amounts of data and responding to millions of user requests within one second. Currently the most cost-efficient way of achieving the above aim is to use large-scale cloud-based services offered by vendors such as Amazon, Google, IBM, Microsoft, etc. You will have the opportunity to learn how to use the cloud services provided by Amazon Web Services to meet the big data needs of businesses. The subject will also cover how to program the world’s most popular Big Data analytics framework called Hadoop with a focus on Map Reduce, Hive and Apache Spark. This subject will also cover the following topics: cloud architectures, parallel database systems, key value stores, transaction support in the cloud, virtualization, and multi-tenant database systems.