6 Different Data Formats Commonly Used in Data Analytics
dev.to·4h·
Discuss: DEV

In the world of data analytics, the choice of data format plays a crucial role in efficiency, storage, and processing. Different formats cater to various needs, from simple text-based exchanges to optimized binary storage for big data systems. In this article, we’ll dive into six common data formats: CSV, SQL (relational tables), JSON, Parquet, XML, and Avro.

For each format, I’ll explain it in simple terms and represent a small dataset using it. The dataset is a simple collection of student records:

  • Name: Alice, Register Number: 101, Subject: Math, Marks: 90
  • Name: Bob, Register Number: 102, Subject: Science, Marks: 85
  • Name: Charlie, Register Number: 103, Subject: English, Marks: 95 Let’s explore each format one by one.

1. CSV (Comma Separated Values)

CSV is a strai…

Similar Posts

Loading similar posts...