1 Answers
Hello, Amazon S3 bucket can only store flat files, but Athena service can read files in your bucket, for example JSON, and then you can perform SQL query based on json file store in your S3 Bucket. So for example, you can create a JSON that describes file in your bucket, associate keywords to file and then use Athena to query the Json and get all files that contain a cat.
You can use other types of files to describe your S3 bucket, including CSV or columnar data formats such as Apache Parquet and Apache ORC.
To generate these description files, you can for example use Amazon Rekognition to extract key value from images and videos or append the description file when you add files in the bucket.
Thank you! Your point on "columnar data formats" possibly hints at the one implementation I saw – where select structured relational db tables were moved to S3 and queried using Athena.