Aliyun Object Storage Service

Slack Docker Pulls

This guide describes how to configure Aliyun OSS as Alluxio’s under storage system.

Aliyun Object Storage Service (OSS) is a massive, secure and highly reliable cloud storage service provided by Alibaba Cloud. OSS provides multiple storage classes to help you manage and reduce storage costs.

For more information about Aliyun OSS, please read its documentation

Prerequisites

If you haven’t already, please see Prerequisites before you get started.

In preparation for using OSS with Alluxio:

<OSS_BUCKET> Create a new bucket in the OSS console or use an existing bucket
<OSS_DIRECTORY> The directory you want to use in the bucket, either by creating a new directory or using an existing one
<OSS_ACCESS_KEY_ID> ID used to identify a user. See How to Obtain AccessKey Pair
<OSS_ACCESS_KEY_SECRET> Secret used to verify the identity of the user. See How to Obtain AccessKey Pair
<OSS_ENDPOINT> Endpoints are the domain names that other services can use to access OSS. See Regions and OSS Endpoints in the Public Cloud

Basic Setup

Use the mount table operations to add a new mount point, specifying the Alluxio path to create the mount on and the OSS path as the UFS URI. Credentials and configuration options can also be specified as part of the mount command by specifying the --option flag as described by configuring mount points.

An example command to mount oss://<OSS_BUCKET>/<OSS_DIRECTORY> to /oss:

bin/alluxio mount add --path /oss/ --ufs-uri oss://<OSS_BUCKET>/<OSS_DIRECTORY> \
  --option fs.oss.accessKeyId=<OSS_ACCESS_KEY> --option fs.oss.accessKeySecret=<OSS_ACCESS_KEY_SECRET> \
  --option fs.oss.endpoint=<OSS_ENDPOINT>

Note that if you want to mount the root of the OSS bucket, add a trailing slash after the bucket name (e.g. oss://OSS_BUCKET/).

Advanced Setup

Note that configuration options can be specified as mount options or as configuration properties in conf/alluxio-site.properties. The following sections will describe how to set configurations as properties, but they can also be set as mount options via --option <key>=<value>.

[Experimental] OSS multipart upload

The default upload method uploads one file completely from start to end in one go. We use multipart-upload method to upload one file by multiple parts, every part will be uploaded in one thread. It won’t generate any temporary files while uploading.

To enable OSS multipart upload, you need to modify conf/alluxio-site.properties to include:

alluxio.underfs.oss.multipart.upload.enabled=true

There are other parameters you can specify in conf/alluxio-site.properties to potentially speed up the upload.

# Timeout for uploading part when using multipart upload.
alluxio.underfs.object.store.multipart.upload.timeout
# Thread pool size for OSS multipart upload.
alluxio.underfs.oss.multipart.upload.threads
# Multipart upload partition size for OSS. The default partition size is 64MB. 
alluxio.underfs.oss.multipart.upload.partition.size