I want to move data from our dynamo DB to redshift, but I want to do this is real time as we are currently under a migration and still using the dynamo DB, any suggestions would be greatly appreciated.
I did a bit of research and found the following possible solutions
- AWS Data pipelines
- AWS Glue – heard it is not the greatest at times
- Amazon Kinesis
- AWS Lambda
- Amazon DynamoDB Streams
- AWS DMS
I don’t have to much experience in some of those services, so any help of suggestions would be appreciated greatly
2
Answers
I would look into using DynamoDB Streams. There are two options:
When changes occur, the changes are added to the stream which either Kinesis or a triggered lambda function can access. In the case of lambda, the events are stored in the triggering
event
object. You can then use the lamdba function to move your data toRedShift
.I have a tool that easily lets you integrate data from anywhere and provision usable data to any data system using any of the integration styles: ETL, ELT, API Integration, Data as a Service, or API Proxy.
Lmk if you want more info!