How to Migrate From AWS S3 to Linode Object Storage

Traducciones al Español
Estamos traduciendo nuestros guías y tutoriales al Español. Es posible que usted esté viendo una traducción generada automáticamente. Estamos trabajando con traductores profesionales para verificar las traducciones de nuestro sitio web. Este proyecto es un trabajo en curso.
Create a Linode account to try this guide with a $ credit.
This credit will be applied to any valid services used during your first  days.

Linode Object Storage is an S3-compatible service used for storing large amounts of unstructured data. This guide includes steps on how to migrate up to 100TB of static content from AWS S3 to Linode Object Storage using rclone, along with how to monitor your migration using rclone’s WebUI GUI.

Migration Considerations

  • Migration time: Migration time varies depending on various factors, including: the size and type of data being copied, the number of overall objects, network conditions, and the hardware limits of the infrastructure involved (i.e. bandwidth and throughput limits, network interfaces, CPU cores, RAM, etc.).

  • Egress: Egress is the measurement of outbound data being transferred and often results in a cost to the user. Egress costs may vary depending on provider rates and the amount of data being transferred. See your provider for specific egress rates.

  • Bucket architecture: The example in this guide shows steps for migrating content from a single object storage bucket.

    Note that AWS scales by prefixes within a single bucket, whereas Linode Object Storage scales per bucket with each bucket having its own rate limit. As a result, AWS content may need to be moved by prefix to individual Object Storage buckets to meet your specific performance requirements. See: Object Storage: Optimize Applications to Avoid Rate Limiting

    Migrating content and changing over workloads intermittently is recommended to ensure maximum uptime and reliability.

  • Multiple machines running in parallel: This guide provides steps for initiating and monitoring a single rclone copy job for a single object storage bucket. One option for speeding up a distributed workload migration is to run multiple rclone instances to migrate multiple buckets at the same time.

    See Linode Object Storage’s technical specifications for rate and bandwidth limits if you are running multiple virtual machines in parallel.

  • Large amounts of data: This guide assumes you are migrating less than 100 million objects and 100TB of static data. If you require a larger amount of data transferred, contact our sales and professional services teams.

Migration Architecture Diagram

There are two architecture options for completing a data migration from AWS S3 to Linode Object Storage. One of these architectures is required to be in place prior to initiating the data migration:

Architecture 1: Utilizes an EC2 instance running rclone in the same region as the source S3 bucket. Data is transferred internally from the S3 bucket to the EC2 instance and then over the public internet from the EC2 instance to the target Linode Object Storage bucket.

  • Recommended for: speed of transfer, users with AWS platform familiarity

Architecture 2: Utilizes a Linode instance running rclone in the same region as the target Object Storage bucket. Data is transferred over the public internet from the AWS S3 bucket to the Linode instance and then internally via IPv6 to the Linode Object Storage bucket.

  • Recommended for: ease of implementation, users with Akamai platform familiarity
Rclone performance
Rclone generally performs better when placed closer to the source data being copied. During testing for both architectures, Architecture 1 achieved about 20% higher transfer speed than Architecture 2.

Architecture 1

  1. A source AWS S3 bucket with the content to be transferred.

  2. An AWS EC2 instance running rclone in the same region as the source S3 bucket. The S3 bucket communicates with the EC2 instance via VPC Endpoint within the AWS region. Your IAM policy should allow S3 access only via your VPC Endpoint.

  3. Data is copied across the public internet from the AWS EC2 instance to a target Linode Object Storage bucket. This results in egress (outbound traffic) being calculated by AWS.

  4. The target Linode Object Storage bucket receives data from the EC2 instance. The migration status can be monitored using rclone’s WebUI.

Architecture 2

  1. A source AWS S3 bucket with the content to be transferred.

  2. A Compute Instance running rclone in the same Akamai core compute region as the target Linode Object Storage bucket.

  3. Data is copied across the public internet from the AWS S3 bucket to the target Linode instance. This results in egress being calculated by AWS.

  4. The target Linode Object Storage bucket receives the data via IPv6 from the Compute Instance on the region’s private network. Inbound, private IPv6 data to Linode Object Storage is free of charge. The migration status can be monitored using rclone’s WebUI.

Prerequisites and Required Information

  • A virtual machine with rclone installed. This guide recommends a 16GB dedicated virtual machine with 8 CPU cores. The plan you require may vary depending on your workload.

  • The public IPv4 address of your virtual machine.

  • As a security best practice, use a firewall to only allow inbound port 5572. This is the default port used by rclone and enables more secure access to the WebUI since it is served over HTTP. For an additional layer of security, consider setting up an HTTPS gateway.

  • An up-to-date web browser. This is used to access the rclone WebUI while monitoring the migration.

  • SSH access to the virtual machine with sudo user privileges.

  • An existing AWS S3 bucket with:

    • IAM policy allowing S3 access only to VPC Endpoint
    • Bucket name
    • Access key
    • Secret key
    • Region ID
  • If using Architecture 1, there must be a VPC gateway endpoint created for S3 in the same VPC where your EC2 instance is deployed. This should be the same region as your S3 bucket.

  • An existing Linode Object Storage bucket with:

    • Bucket name
    • Access key
    • Secret key
    • Region ID and endpoint URL
    Object Storage Access Keys
    When creating Object Storage access keys, it is a best practice to limit individual bucket access by region along with read/write permissions. See: Manage access keys

Migration Steps

Initiating the Data Migration

  1. On the instance running rclone, configure rclone to communicate with your source AWS S3 bucket and your target Linode Object Storage bucket.

    To view the location of the rclone config file, run:

    rclone config file

    If the file does not exist yet, you should see output similar to the following:

    Configuration file doesn't exist, but rclone will use this path:
    /home/user/.config/rclone/rclone.conf
  2. Using the text editor of your choice, add the following configuration to your config file. Replace the following fields with your own corresponding provider and bucket values. Save your changes when complete:

    AWS S3

    • AWS-ACCESS-KEY: Your AWS access key
    • AWS-SECRET-KEY: Your AWS secret key
    • us-east-1: The AWS region ID for your S3 bucket

    Linode Object Storage

    • LINODE-ACCESS-KEY: Your Linode Object Storage access key
    • LINODE-SECRET-KEY: Your Linode Object Storage secret key
    • us-lax-1: The region ID for your Linode Object Storage bucket
     1
     2
     3
     4
     5
     6
     7
     8
     9
    10
    11
    12
    13
    14
    
    [aws]
    type = s3
    provider = AWS
    access_key_id = AWS-ACCESS-KEY
    secret_access_key = AWS-SECRET-KEY
    region = us-east-1
    
    [linode]
    type = s3
    provider = Ceph
    access_key_id = LINODE-ACCESS-KEY
    secret_access_key = LINODE-SECRET-KEY
    endpoint = us-lax-1.linodeobjects.com
    acl = private
    Rclone Providers
    The lines [aws] and [linode] define the remote providers for your source and target endpoints, respectively. See Supported Providers for a complete list of supported rclone providers.
  3. Confirm connectivity to AWS S3 using your defined remote provider, aws:

    rclone lsd aws:

    If successful, you should see a list of available buckets:

              -1 2024-08-30 09:10:47        -1 aws-bucket-name
  4. Confirm connectivity to Linode Object Storage using the other defined remote provider, linode:

    rclone lsd linode:

    Similar to above, you should see a list of available buckets:

              -1 2024-08-28 14:46:47        -1 linode-bucket-name
  5. Run the rclone copy command to initiate the migration.

    Replace aws-bucket-name and linode-bucket-name with the names of your AWS S3 and Linode Object Storage buckets, respectively. Replace USERNAME and PASSWORD with the username and password you want to use to access the rclone WebUI.

    If using Architecture 2, also include the --bind ::0 flag to write data from your Compute Instance to your Object Storage bucket using IPv6:

    rclone copy aws:aws-bucket-name/ linode:linode-bucket-name/ --transfers 50 --rc --rc-addr=0.0.0.0:5572 --log-file=rclone.log --log-level=ERROR --rc-web-gui --rc-user USERNAME --rc-pass PASSWORD

Rclone Copy Command Breakdown

  • aws:aws-bucket-name/: The AWS remote provider and source S3 bucket. Including the slash at the end informs the copy command to include everything within the bucket.

  • linode:linode-bucket-name/: The Linode remote provider and target Object Storage bucket.

  • --transfers 50: The transfers flag tells rclone how many items to transfer in parallel. Defaults to a value of 4. 50 here speeds up the transfer process by moving up to 50 items in parallel at a given time.

    Your transfers value may be different depending on how many objects you are transferring, and you may need to experiment to find the value that works best for your use case. High enough values may result in bandwidth limits being reached. Increasing this value also increases the CPU usage used by rclone.

  • --rc: Stands for “remote control”. The rc option deploys the http listen server for remote requests.

  • --rc-addr=0.0.0.0:5572: Specifies the web address and port number used to access the WebUI GUI. 0.0.0.0 instructs the remote to listen on all IPv4 addresses, and 5572 is the default port number used by rclone to access the WebUI.

  • --log-file=rclone.log: The file where rclone writes logs. This file is created in the working directory from where the copy command is run.

  • --log-level=ERROR: The type of logs to be written to your log file. ERROR here specifies only errors are written to the rclone.log file.

  • --rc-web-gui: Serves the WebUI GUI on the default rclone port (5572).

  • --rc-user USERNAME and --rc-pass PASSWORD: The username and password used to access the WebUI GUI.

Using the htpasswd flag
An alternative to the --rc-user and --rc-pass combination is the --rc-htpasswd flag. This creates a htpasswd file containing a generated username and password combination you can use to log into the rclone WebUI. See Remote controlling rclone with its API
  • --bind ::0 (for use with Architecture 2): Tells rclone to write data via IPv6. Note that writing data over IPv6 from a Linode instance to an Object Storage bucket in the same region is free of charge.

Optional Flags

  • --tpslimit XXX : Specifies the number of HTTP transactions per second. For larger transfers, it is considered a best practice to set the tpslimit below the infrastructure requests per second (rps) limit. Should an rps limit be reached, a 503 SlowDown error may result.

    Example: If the infrastructure’s requests per second limit is 750 rps, set the tpslimit to 725:

    --tpslimit 725

Monitoring the Migration

To monitor the status of the rclone copy command above, you can access the rclone WebUI GUI from a web browser.

  1. In a web browser window, navigate to your instance’s address over port 5572. Replace IP-ADDRESS with the IPv4 address of your instance:

    http://IP-ADDRESS:5572
  2. When prompted, enter the username and password you specified with the --rc-user and --rc-pass flags:

  3. Once logged in, you should see active running jobs along with multiple monitoring statistics, including: job status, throughput and speed, bandwidth max speed, total objects transferred, amount of data transferred, and more.

  4. The WebUI will disconnect automatically when the copy job is complete.

Verify the Migration

You can compare the number of objects in both your source and target buckets along with the total size of the buckets to verify full completion of the copy job.

From the Command Line

AWS S3:

rclone size aws:aws-bucket-name/
Total objects: 87.275k (87275)
Total size: 647.612 GiB (695368455398 Byte)

Linode Object Storage:

rclone size linode:linode-bucket-name/
Total objects: 87.275k (87275)
Total size: 647.612 GiB (695368455398 Byte)

From a Browser

Alternatively, you can compare the number of objects and total bucket sizes from the AWS Management Console and Cloud Manager on Akamai:

AWS Management Console:

  • Navigate to Amazon S3
  • Select the source bucket name
  • Click the Metrics tab
  • Under Bucket metrics, see Total bucket size and Total number of objects

Cloud Manager:

  • Navigate to Object Storage
  • Find your target bucket name
  • See the Size column for the total bucket size and the Objects column for the total number of objects

Next Steps

There are several next steps to consider after a successful object storage migration:

  • Change over your object storage endpoints to your new target bucket. For example, if you have backups or logs being sent to your old source bucket, edit your jobs to point to the new bucket endpoints.

  • Edit your configurations to match your new object storage credentials. In addition to changing the target endpoints for your workloads, your access keys and secret keys need to be updated to your new Linode Object Storage keys.

  • Confirm the changeover is functioning as expected. Allow some time to make sure your updated workloads and jobs are interacting successfully with Linode Object Storage. Once you confirm everything is working as expected, you can safely delete the original source bucket and its contents.

More Information

You may wish to consult the following resources for additional information on this topic. While these are provided in the hope that they will be useful, please note that we cannot vouch for the accuracy or timeliness of externally hosted materials.

This page was originally published on


Your Feedback Is Important

Let us know if this guide was helpful to you.


Join the conversation.
Read other comments or post your own below. Comments must be respectful, constructive, and relevant to the topic of the guide. Do not post external links or advertisements. Before posting, consider if your comment would be better addressed by contacting our Support team or asking on our Community Site.
The Disqus commenting system for Linode Docs requires the acceptance of Functional Cookies, which allow us to analyze site usage so we can measure and improve performance. To view and create comments for this article, please update your Cookie Preferences on this website and refresh this web page. Please note: You must have JavaScript enabled in your browser.