Important
Unstructured data is data that does not conform to a data model and has no easily identifiable structure.
Unstructured data cannot be easily used by programs, and is difficult to analyze. Examples of unstructured data could be the contents of an email, contents of a word document, data from social media, photos, videos, survey results, etc.
An example of unstructured data
An simple example of unstructured data is a string that contains interesting information inside of it, but that has not been formatted into a well defined schema. An example is given below:
UnstructuredString | |
---|---|
Record 1 | ”Bob is 29” |
Record 2 | ”Mary just turned 30” |
Unstructured vs structured data
In contrast with unstructured data, structured data refers to data that has been formatted into a well-defined schema. An example would be data that is stored with precisely defined columns in a relational database or excel spreadsheet. Examples of structured fields could be age, name, phone number, credit card numbers or address. Storing data in a structured format allows it to be easily understood and queried by machines and with tools such as SQL.