Questions tagged [amazon-data-pipeline]

Simple service to transfer data between Amazon data storage services, kick off Elastic MapReduce jobs, and connect with outside data services.

From the AWS Data Pipeline homepage:

AWS Data Pipeline is a web service that helps you reliably process and move data between different AWS compute and storage services as well as on-premise data sources at specified intervals. With AWS Data Pipeline, you can regularly access your data where it’s stored, transform and process it at scale, and efficiently transfer the results to AWS services such as Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon Elastic MapReduce (EMR).

AWS Data Pipeline helps you easily create complex data processing workloads that are fault tolerant, repeatable, and highly available. You don’t have to worry about ensuring resource availability, managing inter-task dependencies, retrying transient failures or timeouts in individual tasks, or creating a failure notification system. AWS Data Pipeline also allows you to move and process data that was previously locked up in on-premise data silos.

470 questions
-1
votes
1 answer

File conversion in AWS

I am trying to find the most efficient way to process files in AWS. Read a json, xml, csv from S3 bucket Map it to another type of json, xml, csv Save it to S3 bucket Right now we are using Java with AWS lambdas but we write lots of code. AWS Data…
user3417479
  • 1,830
  • 3
  • 18
  • 23
-1
votes
1 answer

How to make SELECT from RDS cluster or How to configure data pipeline with RDS cluster?

I have 1 RDS this is Cluser (Serverless) How to connect to the RDS Cluster and make "Select * from.." using boto3? Why I asked: I tried to configure data pipeline to save data from DB Cluster to S3 bucket, but I have to use only DB Instances. And if…
-1
votes
1 answer

Sync data from RDS MySQL to Amazon Redshift

I'm trying to Sync data from RDS MySQL to Amazon Redshift. For that, created a Data Pipeline scheduled for run once. Synced one table then tried with another table named 'roles' but it failed with the following error message "output table named…
-2
votes
1 answer

cron like events - AWS Lambda or AWS Data pipeline

I have an EC2 instance that schedule many tasks (using crontab). some of them are executed every 1 min, 5 min, and so on.. I want to move all cron tasks into AWS service. I am trying to figure which AWS service can give me the best solution. I found…
Eyal Ch
  • 9,552
  • 5
  • 44
  • 54
-2
votes
1 answer

Precondition on data pipeline

I did some set of activities on json script. Need to add precondition on it. how to use precondition on json script. In precondition, i need to check the mysql db field. once flag as 'Y', start execution else should be stop the execution.
pragowin
  • 1
  • 2
1 2 3
31
32