Skip to main content

Week 2

Week 2

Open Data

Open data, by definition, is data that is free to use by anyone for any purpose at any time. This includes republishing data and not being affected by Copyrights or Patents by doing so. One proposed example of this is satellite data. By allowing this data to be "open data" we can allow it to be analysed to learn more about how our atmosphere is being affected by greenhouse gasses as well as monitor current changes and prevent new problems from arising. 

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420374


Data Scientist

A data scientist is someone who's job it is to analyse and interpret data, like usage statistics for a website in order to aid/assist a company or business with making important decisions. IEA data scientist Dr Ben Lloyd-Hughes states that there are 7 phases of a data science project which a data scientist will complete.


1. Problem Statement

2. Data Acquisition

3. Data Preparation

4. Data Exploration

5. Model Building

6. Documentation

7. Publicity Material

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420379

Metadata

Metadata can be defined as "data describing data". It can include a wide range of information including who wrote the data, why it was recorded, when it was recorded, the units of measurement the data is in, and if any copyrights are contained on the data. Metadata can be vital when wanting to compare on data set with another, especially when the data has different authors.

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/443652







Comments

Popular posts from this blog

3. Growth of Big Data

3. Growth of Big Data  Kryder's law - The idea that data storage will double almost every year (13 months), as the storage capacity increases, storage will decrease in price. https://searchstorage.techtarget.com/definition/Kryders-Law growth of data  - Data is increasing at an exponential rate, more data has been made in the past two years than ever before combined. It is estimated that by the year 2020, 1.7MB of new data will be made every second for every human on earth. Within 5 years there will be 50 billion smart connected devices on earth. also by 2020, at least 1/3 of all the world's data will pass through the cloud.  The image below contains more examples of data growth as well as a graph for visual representation https://dvmobile.io/dvmobile-blog/feeling-overwhelmed-by-a-deluge-of-iot-data-iot-data-analytics-dashboards-can-help

Big Data and Bitcoins

Big Data and Bitcoins Recently, Bitcoin has seen a recent spike in value, which has prompted many people to begin "mining" for more. This in turn has had a knock on effect with computer parts, sending the price of dedicated graphics cards far more expensive than before. Big data can be used for both of these problems through methods such as google trends. For bitcoin, google trends can be tracked for how often people are giving attention and searching google for bitcoin or similar terms. Big data can also be used for tracking the current value of bitcoin and analyzing any patterns in the price changes to help predict whether the value will rise more or fall.

Week 3

Week 3 Big and small data Small data is a way of keeping the size of data down. A file such as an image could take a lot of space depending on the size of the image, but by compressing it and losing some of the quality you can massivly reduce ths size. An example of small data could be Meteorological Aviation Report (METAR). The image below shows this off, each section of the code represents a different piece of information and saves entire data entries from being typed out and takes less time to process.  https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420387 Citizen Science Is the collection of data about the natural world that has been gathered by the general public, usually as part of a project with other data scientists. This can be used to gain vast amounts of information quickly since many people at once can gather the information as opposed to just one scientist or one sensor. One example is Thames 21, a charity which worked with Cit...