Object Storage
A summary of Object Storage.
[01] Definition
A storage system that assigns a unique ID (identifiable by the user or server) to data regardless of its physical location, stores it in a container or bucket, and retrieves it on demand using that ID.
- Modern data is split into structured data (classifiable by some criteria: student ID, name, age, etc.) and unstructured data (typically large images, videos, etc.) that can’t be neatly classified.
- Existing storage methods — file systems (file-based, hierarchical) and block storage (block-based with sectors/tracks) — suit structured data. Object storage suits unstructured data.
- Analogy: a valet parking service. If the data object is your car, the unique ID is your receipt. You hand over the receipt and you get your car back — you don’t have to know where it was parked.
[02] Advantages
2-1. Scalability
- No need to partition
- Holds data regardless of capacity
2-2. Efficiency
- No hierarchical directory system → no inter-layer bottleneck
2-3. Availability
- Auto-replication and rolling updates supported
- No downtime
[03] How It Works (IDA)
3-1. IDA (Information Dispersal Algorithm)
- Splits data into pieces
- Distributes pieces across local or globally distributed storage nodes over the network
- The distributed nodes form a single namespace storage
Reference: IBM Tech Forum — Object Storage