Skip to main content

Week 2

Week 2

Open Data

Open data, by definition, is data that is free to use by anyone for any purpose at any time. This includes republishing data and not being affected by Copyrights or Patents by doing so. One proposed example of this is satellite data. By allowing this data to be "open data" we can allow it to be analysed to learn more about how our atmosphere is being affected by greenhouse gasses as well as monitor current changes and prevent new problems from arising. 

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420374


Data Scientist

A data scientist is someone who's job it is to analyse and interpret data, like usage statistics for a website in order to aid/assist a company or business with making important decisions. IEA data scientist Dr Ben Lloyd-Hughes states that there are 7 phases of a data science project which a data scientist will complete.


1. Problem Statement

2. Data Acquisition

3. Data Preparation

4. Data Exploration

5. Model Building

6. Documentation

7. Publicity Material

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420379

Metadata

Metadata can be defined as "data describing data". It can include a wide range of information including who wrote the data, why it was recorded, when it was recorded, the units of measurement the data is in, and if any copyrights are contained on the data. Metadata can be vital when wanting to compare on data set with another, especially when the data has different authors.

https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/443652







Comments

Popular posts from this blog

3. Growth of Big Data

3. Growth of Big Data  Kryder's law - The idea that data storage will double almost every year (13 months), as the storage capacity increases, storage will decrease in price. https://searchstorage.techtarget.com/definition/Kryders-Law growth of data  - Data is increasing at an exponential rate, more data has been made in the past two years than ever before combined. It is estimated that by the year 2020, 1.7MB of new data will be made every second for every human on earth. Within 5 years there will be 50 billion smart connected devices on earth. also by 2020, at least 1/3 of all the world's data will pass through the cloud.  The image below contains more examples of data growth as well as a graph for visual representation https://dvmobile.io/dvmobile-blog/feeling-overwhelmed-by-a-deluge-of-iot-data-iot-data-analytics-dashboards-can-help