Information technology alters our lifestyle, the collection of digital information in terms of structured and non-structured data known as big data is rapidly developing. Big data (i.e., volume, velocity, variety, veracity, value, variability, complexity) is definitely a phenomenon with direct impact on quality of life. Now-a-days social media usage is very high used across all the organizations and in business. Data is exploding at higher rate in the volume day by day.Due to the new economic policies like liberalization, privatization and globalization, sustained economic growth and development, mass education etc., the social media usage is gaining popularity and being used as central focus of communication across the globe. The data is being published in blink of an eye through social media. Social media like Face book, Twitter, LinkedIn, whatsapp etc., are the core usage of data over the network. When unstructured data is updated, the problems such as inefficient to handle huge volume of incremental data, poor scalability and information loss arises, at the same time privacy requirements also violated when new data joins over time. In addition to heavy network traffic and applications running on unencrypted data set. Due to this there are numerous potential data is still hesitating to take advantage of Hadoop.The amount of data being created and used by organizations is going to
grow in future. According to an expert Dominic Pollard, editor at the Big Data Insight Group, said that the information generated and stored is “increasing exponentially”. Mr. Pollard said that every minute of 3 every day, uploading 48 hours of video footage to YouTube, making 47″,000 downloads from Apple’s App Store and sending over 200 million emails and 100″,000 tweets.
Cisco estimated that mobile data traffic will grow 13 fold from 2013 to 2019 and compound annual growth rate is 67%. Mobile data traffic will reach 1.1 Exabyte per month by 2019 up from 87.9% beta bytes per month in 2014. Data storage grows 60% annually and global international traffic in 2019 will be equivalent to 66 times the volume ofthe entire global internet in 2005, broadband speed will be more thandouble by 2019.Mobile data traffic will be increased three times from 2014 to2019. Large number of potential cloud customers till hesitate to take
Advantages of cloud computing. The problem arises due to the datasets in Big data applications are insecure.
To rectify these problems, the novel system is introduced where datasets are distributed in cloud efficiently for handling huge volume of incremental data by Map Reduce. Here a parallel processing framework is adopted. No need to access all data as new data joins over time. This new system helps to reduce the information loss as well as updating time for huge volume of data. The Big data paradigm has been receiving significant excitement and attention in the media and blogosphere. Personal records of people are progressively being collected by numerous government and company establishments and published in Big data for the purpose of data analysis. It is facilitated by various organizations to publish sufficiently private ideas over this information that are collected. Big data can help reduce costs, increase business agility and enable to focus on projects with a high return on investment. Nowadays cyber crime is the greatest threat to the society. Irrespective of time, space, caste, creed, nation, urban or rural, rich or poor and literate or illiterate. Security remains the number one obstacle to adoption of Big data for businesses and federal agencies. Security and privacy concerns are a significant obstacle that is preventing the extensive adoption of the public cloud across the industry. Public Big data solutions are seen as the most vulnerable options from a security perspective, leaving many federal customers to seek private alternatives to overcome security challenges. Regardless of the deployment model selected – private, public, community, or hybrid—conquering security concerns is required for cloud computing to achieve its full potential as the next generation of IT architecture. Many organizations are typically reluctant to place sensitive and valuable data in infrastructure that they do not have control. Privacy could be a double edged brand there have to be enough privacy to make sure that sensitive data concerning the people is not disclosed by the views and at a similar time there have to be enough information to perform the analysis. Moreover, an adversary who needs to collect sensitive data from the revealed views sometimes has some information concerning the people within the information. HDFS is widely used in industries and became the de facto standard platform for Big Data storage. HDFS has been evolved to provide the storage service, where data is reliably kept in a distributed fashion into different servers. In this connection, client stores Big Data into the geographically dispersed remote third party servers through an insecure channel. This excavates several security concerns as the storage service access to be made over an insecure communication channel.