While I was able to get to the answer without using GLUE, I liked the approach in the lesson. I had trouble accessing the data from the crawler. I received the following access denied error. Any ideas?
[bd8255b9-d148-40f2-b9e7-d79a4b685eb4] ERROR : Error Access Denied (Service: Amazon S3; Status Code: 403; Error Code: AccessDenied; Request ID: F50F538224BE8072; S3 Extended Request ID: Fmw9aJo8bA2Njl43gwOs29a1IHEZxFsGOVABNlmrurjGTMet+Ne+iahx2WoKPtNCorNAdvS1mZU=) retrieving file at s3://openaq-fetches/realtime/2018-10-09/1539047052.ndjson. Tables created did not infer schemas from this file.
Not sure what the issue might be. I wasn’t able to reproduce it and can see that the file is still public. Maybe a temporary glitch?
Hi Jmcquirt, you may want to check that your IAM role has access to the s3 bucket. Check for typos as well
I have the same issue using Glue Crawler. Terri Johnson’s answer doesn’t really help since the S3 bucket is public and we do not have the ability to control the bucket permissions.