Read file content from S3 bucket with boto3

Question

I read the filenames in my S3 bucket by doing

objs = boto3.client.list_objects(Bucket='my_bucket') while 'Contents' in objs.keys(): objs_contents = objs['Contents'] for i in range(len(objs_contents)): filename = objs_contents[i]['Key']

Now, I need to get the actual content of the file, similarly to a open(filename).readlines(). What is the best way?

score +1 · Answer 1 · Oct 23, 2018

boto3 offers a resource model that makes tasks like iterating through objects easier. Unfortunately, StreamingBody doesn't provide readline or readlines.

s3 = boto3.resource('s3')
bucket = s3.Bucket('test-bucket')
# Iterates through all the objects, doing the pagination for you. Each obj
# is an ObjectSummary, so it doesn't contain the body. You'll need to call
# get to get the whole body.
for obj in bucket.objects.all():
    key = obj.key
    body = obj.get()['Body'].read()

For a detailed explanation on S3, check this out!

https://www.youtube.com/watch?v=XjPUyGKRjZs

Hope this helps!

answered Oct 23, 2018 by anonymous

where we have to pass the access key and endpoint URL

commented Dec 13, 2019 by krishnaetl

You could declare it when you create the client

client = boto3.client( 's3', aws_access_key_id="***", aws_secret_access_key="****" )

commented Dec 13, 2019 by Aron

Kalgi · Answer 2 · Jul 4, 2019

s3_client=boto3.resource('s3')
bucket = s3_client.Bucket('test')
for obj in bucket.objects.all():
contents=obj.get()['Body'].read().decode(encoding="utf-8",errors="ignore")
for line in contents.splitlines():
print(line)

answered Jul 4, 2019 by reddy

edited Jul 4, 2019 by Kalgi

I tried iterating through a bucket using above code,but I am getting below error:

AttributeError: 'str' object has no attribute 'objects'

Kindly check and advise.

commented Jan 20, 2020 by vishal.koul27@gmail.com

Hi Vishal,

Check if you are passing your bucket in the following line.

bucket = s3_client.Bucket('test')

You might be getting an error for this line as far I know. s3_client.Bucket should have a bucket argument passed.

commented Jan 20, 2020 by Kalgi
• 52,340 points

Read file content from S3 bucket with boto3

Your comment on this question:

2 answers to this question.

Your answer

Your comment on this answer:

Your comment on this answer:

Related Questions In AWS

How to delete a file from S3 bucket using boto3?

I want to get file name from key in S3 bucket wanted to read single file from list of file present in bucket

How to directly read excel file from s3 with pandas in airflow dag?

Want my AWS s3 Bucket to read Name from CloudWatch Event

I want download all the versions of a file with 100,000+ versions from Amazon S3

public-ish write access to S3 bucket with file size limiting

How to upload a file in S3 bucket using boto3 in python

Error while uploading file to S3 bucket using Python boto3 library

Python AWS Boto3: How do i read files from S3 Bucket?

How to copy .csv file from Amazon S3 bucket?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES