Amazon DynamoDB

Populate an Amazon DynamoDB table with the results to be the source for low latency queries from your applications or APIs.

Amazon DynamoDB is a key-value and document database that delivers single-digit millisecond performance at any scale. It’s a fully managed, multiregion, multimaster, durable database with built-in security, backup and restore, and in-memory caching for internet-scale applications. DynamoDB can handle more than 10 trillion requests per day and can support peaks of more than 20 million requests per second.

bp 0

We are going to use a Hive script to perform the query on source table and save it DynamoDB.

Step 1: Go to EMR console.

bp 1

Step 2: click on c360cluster.

bp 1

Step 3: click on Steps tab.

bp 1

Step 4: Click on Add step.

  • Step type: Hive program
  • Name: loadtodynamodb
  • Script S3 location: s3://**your_stage_bucket**/library/c360dynamodbload.q

Use the bucket browser to select the application location.

bp 1

  • Input S3 location: leave blank
  • Output s3 location: leave blank
  • Arguments: leave blank
  • Action on failure: continue

bp 1

Step 5: check the job status, going from pending to running.

bp 1

Step 6: Check all applications status on the Application user interfaces tab.

bp 1