Data Types
Structured Data
Structured data is data that is organized in a tabular format with rows and columns. Each column represents a specific attribute, and each row represents a single record.
Structured data is highly organized and easily searchable. It is typically stored in relational databases and can be queried using SQL.
Examples:
- CSV
- Excel
- SQL databases (MySQL, PostgreSQL, Oracle)
- Google Sheets
- Parquet
- ...
Semi-Structured Data
Semi-structured data is data that does not fit into a tabular format but has some organizational properties that make it easier to process.
Semi-structured data is flexible and adaptable. It can be stored in NoSQL databases or document-oriented databases.
Examples:
- JSON
- XML
- YAML
- Log files
- ...
Unstructured Data
Unstructured data is data that does not have a predefined format or structure. It can be text-heavy and difficult to analyze using traditional methods.
Unstructured data is complex and requires advanced processing techniques such as natural language processing (NLP) or machine learning.
Examples:
- Text files
- Images
- Audio files
- Video files
- Social media posts
- ...