S3 Notes

S3 stands for Simple Store Service. It provides object based storage

Buckets

Folders are called buckets

S3 Buckets follows universal Namespace. It has to be unique across across all amazon accounts

Objects

Files are Objects
Objects in S3 can be from 0 bytes to 5TB
Objects are stored in Bucket
Objects in S3 consists of key-value pair, Version ID, Metadata and sub resources
HTTP code is returned when actions are take against objects in S3 buckets
An individual object can be versioned. So you could access different versions of same objects
Read after Write consistency for PUTS of new objects
Eventual Consistency for Overwriting PUTS and deletes

Storage classes

There are six different storage classes that provides guarantees different availability and durability. Billing is performed based on chosen storage classes. Lifecyle management can be done on both bucket level and object level to move object between different storage classes to save on costs;

Security

Encryption is Transit is provided by SSL/TLS otherwise HTTPS
Access to buckets are provided by Bucket policies
Access to objects are provided by ACL
MultiFactor authentication can be enabled to at bucket level
IAM Policies & groups are used to control access to S3 buckets
An bucket with Public access can have objects with Private access

Governance and Compliance

Write once read many(WORM) Model
Object Lock can be enabled both at object level or bucket level
Object Lock has governance mode and compliance mode.
Governance mode, the object can be place to lock for certain amount of period and the root user or other user with permission still can delete the object.
Complaince mode object can’t be deleted even by the root user
S3 Valut lock is for S3 and S3 Glacier valult lock is for S3 Glacier.

Performance

S3 Perfix is the layer between filename and bucketname
The more prefix we have the better the performance. Why?
3500 requests per second for PUT/COPY/POST/DELETE and 5,500 for GET/HEAD request per prefixx SSE-KMS the KMS quota limit kicks in
Multipart uploads recommended for file 100MB but required for files over 5GB. This increase efficiancy
Byte-Range Fetches. Parallelize downloads If there is a failure , then need to download only necessary part

S3 Select & Glacier Select

SQL Expression to download file from S3 to achieve 400% increase inefficiency.

Use bucket policies & IAM to apply across entire bucket.
Using Bucket ACl and IAM.
Both Allow only Programmatic access only not via console
Cross-account IAM Roles, Programmatic and Consoles access

Cross Region Replication

Version should be enabled for cross region replication
Storage class can be different for cross region replication
Old objects doesn’t get replicated. Only new objects get replicated.
Only new version get replicated
Object permission on source doesn’t reflected to destination bucket.
Delete markers are not replicated

Transfer Acceleration

Cloudfront Edge Network.
User upload to edge location and it get transfered to S3 bucket