Skip to main content

Week 3

Week 3

Big and small data

Small data is a way of keeping the size of data down. A file such as an image could take a lot of space depending on the size of the image, but by compressing it and losing some of the quality you can massivly reduce ths size. An example of small data could be Meteorological Aviation Report (METAR). The image below shows this off, each section of the code represents a different piece of information and saves entire data entries from being typed out and takes less time to process. 
EGBB - Airfield identifier 161150Z - Time 27011KT- windspeed and direction 4000 - horizontal visibility RA - weather OVC015 - cloudbase and cover 13/11 - air temperature and dewpoint Q1003 - air pressure
https://www.futurelearn.com/courses/big-data-and-the-environment/3/steps/420387

Citizen Science

Is the collection of data about the natural world that has been gathered by the general public, usually as part of a project with other data scientists. This can be used to gain vast amounts of information quickly since many people at once can gather the information as opposed to just one scientist or one sensor. One example is Thames 21, a charity which worked with Citizen Science to create a catalog of all the types of litter which affect the flow of the river.


Visualization 

When putting data into a visual form, it is important that people are able to understand the data. Data should be visualized as useful, easy to read, and reliable. Visualization comes with many advantages and disadvantages such as:
Advantages - 
-Saves time as you don't have to read through large amounts of data
-reduces the amount of info people need to retain
-arguably more engaging as it is more expressive and illustrated

Disadvantages - 
-often only scratches the surface of how much data there actually is
-powerful visualization techniques can require extensive learning to do

Comments

Popular posts from this blog

3. Growth of Big Data

3. Growth of Big Data  Kryder's law - The idea that data storage will double almost every year (13 months), as the storage capacity increases, storage will decrease in price. https://searchstorage.techtarget.com/definition/Kryders-Law growth of data  - Data is increasing at an exponential rate, more data has been made in the past two years than ever before combined. It is estimated that by the year 2020, 1.7MB of new data will be made every second for every human on earth. Within 5 years there will be 50 billion smart connected devices on earth. also by 2020, at least 1/3 of all the world's data will pass through the cloud.  The image below contains more examples of data growth as well as a graph for visual representation https://dvmobile.io/dvmobile-blog/feeling-overwhelmed-by-a-deluge-of-iot-data-iot-data-analytics-dashboards-can-help