Unified Namespace

Slack Docker Pulls GitHub edit source

This page summarizes how to manage different under storage systems in the Alluxio file system namespace.

Introduction

Alluxio enables effective data management across different storage systems through its use of transparent naming and mounting API.

Unified Namespace

One of the key benefits that Alluxio provides is a unified namespace to the applications. This is an abstraction that allows applications to access multiple independent storage systems through the same namespace and interface. Rather than communicating to each individual storage system, applications can delegate this responsibility by connecting through Alluxio, which will handle the different underlying storage systems.

unified

The directory specified by the master configuration property alluxio.master.mount.table.root.ufs is mounted to the root of the Alluxio namespace. This directory identifies the “primary storage” for Alluxio. In addition, users can use the mounting API to add and remove data sources:

  1. void mount(AlluxioURI alluxioPath, AlluxioURI ufsPath);
  2. void mount(AlluxioURI alluxioPath, AlluxioURI ufsPath, MountOptions options);
  3. void unmount(AlluxioURI path);
  4. void unmount(AlluxioURI path, UnmountOptions options);

For example, mount a S3 bucket to the Data directory through

  1. mount(new AlluxioURI("alluxio://host:port/Data"), new AlluxioURI("s3://bucket/directory"));

UFS Namespace

In addition to the unified namespace Alluxio provides, each underlying file system that is mounted in Alluxio namespace has its own namespace; this is referred to as the UFS namespace. If a file in the UFS namespace is changed without going through Alluxio, the UFS namespace and the Alluxio namespace can potentially get out of sync. When this happens, a UFS Metadata Sync operation is required to synchronize the two namespaces.

Transparent Naming

Transparent naming maintains an identity between the Alluxio namespace and the underlying storage system namespace.

transparent

When a user creates objects in the Alluxio namespace, they can choose whether these objects should be persisted in the underlying storage system. For objects that are persisted, Alluxio preserves the object paths, relative to the underlying storage system directory in which Alluxio objects are stored. For instance, if a user creates a top-level directory Users with subdirectories Alice and Bob, the directory structure and naming is preserved in the underlying storage system. Similarly, when a user renames or deletes a persisted object in the Alluxio namespace, it is renamed or deleted in the underlying storage system.

Alluxio transparently discovers content present in the underlying storage system which was not created through Alluxio. For instance, if the underlying storage system contains a directory Data with files Reports and Sales, all of which were not created through Alluxio, their metadata will be loaded into Alluxio the first time they are accessed, such as when a user requests to open a file. The contents of the file is not loaded to Alluxio during this process. To load the file contents into Alluxio, one can read the data using FileInStream or use the load command of the Alluxio shell.

Mounting Under Storage Systems

Mounting an Under storage system to the Alluxio file system namespace is the mechanism for defining the association between the Alluxio namespace and the UFS namespace. Mounting in Alluxio works similarly to mounting a volume in a Linux file system. The mount command attaches a UFS to the file system tree in the Alluxio namespace.

Root Mount Point

The root mount point of the Alluxio namespace is configured in conf/alluxio-site.properties on the masters. The following line is an example configuration where an HDFS path is mounted to the root of the Alluxio namespace.

  1. alluxio.master.mount.table.root.ufs=hdfs://HDFS_HOSTNAME:8020

Mount options for the root mount point are configured using the configuration prefix:

alluxio.master.mount.table.root.option.<some alluxio property>

For example, the following configuration adds AWS credentials for the root mount point.

  1. alluxio.master.mount.table.root.option.aws.accessKeyId=<AWS_ACCESS_KEY_ID>
  2. alluxio.master.mount.table.root.option.aws.secretKey=<AWS_SECRET_ACCESS_KEY>

The following configuration shows how to set other parameters for the root mount point.

  1. alluxio.master.mount.table.root.option.alluxio.security.underfs.hdfs.kerberos.client.principal=client
  2. alluxio.master.mount.table.root.option.alluxio.security.underfs.hdfs.kerberos.client.keytab.file=keytab
  3. alluxio.master.mount.table.root.option.alluxio.security.underfs.hdfs.impersonation.enabled=true
  4. alluxio.master.mount.table.root.option.alluxio.underfs.version=2.7

Nested Mount Points

In addition to the root mount point, other under file systems can be mounted into Alluxio namespace. These additional mount points are added to Alluxio at runtime, via the mount command. The --option flag allows the user to pass additional parameters to the mount operation, such as credentials.

  1. # the following command mounts an hdfs path to the Alluxio path `/mnt/hdfs`
  2. $ ./bin/alluxio fs mount /mnt/hdfs hdfs://host1:9000/data/
  3. # the following command mounts an s3 path to the Alluxio path `/mnt/s3` with additional options specifying the credentials
  4. $ ./bin/alluxio fs mount \
  5. --option aws.accessKeyId=<accessKeyId> --option aws.secretKey=<secretKey> \
  6. /mnt/s3 s3://data-bucket/

Note that mount points can be nested as well. For example, if a UFS is mounted at alluxio:///path1, another UFS could be mounted at alluxio:///path1/path2.

Mount UFS with Specific Versions

Alluxio supports mounting HDFS with specified versions. As a result, users can mount HDFS with different versions into a single Alluxio namespace. Please refer to HDFS Under Store for more details.

Relationship Between Alluxio and UFS Namespace

Alluxio provides a unified namespace, acting as a cache for data in one or more under file storage systems. This section discusses how Alluxio interacts with under file systems to discover their files and make them available through Alluxio.

Accessing UFS files through Alluxio should behave the same as accessing them directly through the UFS. If the root UFS is s3://bucket/data, listing alluxio:/// should show the same result as listing s3://bucket/data. Running cat on alluxio:///file should show the same result as running cat on s3://bucket/data/file.

Alluxio loads metadata from the UFS as needed. In the above example, Alluxio does not have information about s3://bucket/data/file at startup. The file will be discovered when the user lists alluxio:/// or tries to cat alluxio:///file. This prevents unnecessary work when mounting a new UFS.

By default, Alluxio expects that all modifications to under file systems occur through Alluxio. This allows Alluxio to only scan each UFS directory once, significantly improving performance when UFS metadata operations are slow. In the scenario where changes are made to the UFS outside of Alluxio, the metadata sync feature is used to synchronize the two namespaces.

UFS Metadata Sync

The UFS metadata sync feature has been available since version 1.7.0.

When Alluxio scans a UFS directory and loads metadata for its sub-paths, it creates a copy of the metadata so that future operations do not need to load from the UFS. The cached copy of the metadata is refreshed according to the alluxio.user.file.metadata.sync.interval client configuration property. This property applies to client side operations. For example, if a client executes an operation with the interval set to one minute, the relevant metadata will be refreshed from the UFS if the last refresh was over a minute ago. A value of 0 indicates that the metadata will always be synced for every operation, whereas the default value of -1 indicates the metadata is never synced again after the initial load.

Low interval values allow Alluxio clients to quickly discover external modifications to the UFS, at the cost of decreasing performance since the number of calls to the UFS increases.

Metadata sync keeps a fingerprint of each UFS file so that Alluxio can update the file if it changes. The fingerprint includes information such as file size and last modified time. If a file is modified in the UFS, Alluxio will detect this from the fingerprint, free the existing data for that file, and reload the metadata for the updated file. If a file is added or deleted in the UFS, Alluxio will update the metadata in its namespace as well.

Techniques for managing UFS sync

Periodic Metadata Sync

If the UFS updates at a scheduled interval, you can manually trigger the sync command after the update. Set the sync interval to 0 by running the command:

  1. $ ./bin/alluxio fs ls -R -Dalluxio.user.file.metadata.sync.interval=0 /path/to/sync

Centralized Configuration

For clusters where jobs run with data from a frequently updated UFS, it is inconvenient for every client to specify a sync interval. Requests will be handled with a default sync interval if one is set in the master configuration.

In alluxio-site.properties on master nodes:

alluxio.user.file.metadata.sync.interval=1m

Note master nodes need to be restarted to pick up configuration changes.

Other Methods for Loading New UFS Files

The UFS sync discussed previously is the recommended method for synchronizing changes from the UFS. Here are some other methods for loading files:

  • alluxio.user.file.metadata.load.type: This client property can be set to either ALWAYS, ONCE, or NEVER. It acts similar to alluxio.user.file.metadata.sync.interval, but with two caveats:

    1. It only discovers new files and does not reload modified or deleted files.
    2. It only applies to the exists, list, and getStatus RPCs.

    ALWAYS will always check the UFS for new files, ONCE will use the default behavior of only scanning each directory once ever, and NEVER will prevent Alluxio from scanning for new files at all.

  • alluxio fs ls -f /path: The -f option to ls acts is equivalent to setting alluxio.user.file.metadata.load.type to ALWAYS. It discovers new files but does not detect modified or deleted UFS files. The only way to detect modified or deleted UFS files is to pass the -Dalluxio.user.file.metadata.sync.interval=0 option to ls.

Examples

The following examples assume that Alluxio source code exists in the ${ALLUXIO_HOME} directory and an instance of Alluxio is running locally.

Transparent Naming

Create a temporary directory in the local file system that will be used as the under storage to mount:

  1. $ cd /tmp
  2. $ mkdir alluxio-demo
  3. $ touch alluxio-demo/hello

Mount the local directory created into Alluxio namespace and verify it appears in Alluxio:

  1. $ cd ${ALLUXIO_HOME}
  2. $ ./bin/alluxio fs mount /demo file:///tmp/alluxio-demo
  3. Mounted file:///tmp/alluxio-demo at /demo
  4. $ ./bin/alluxio fs ls -R /
  5. ... # note that the output should show /demo but not /demo/hello

Verify that the metadata for content not created through Alluxio is loaded into Alluxio the first time it is accessed:

  1. $ ./bin/alluxio fs ls /demo/hello
  2. ... # should contain /demo/hello

Create a file under the mounted directory and verify the file is created in the underlying file system with the same name:

  1. $ ./bin/alluxio fs touch /demo/hello2
  2. /demo/hello2 has been created
  3. $ ls /tmp/alluxio-demo
  4. hello hello2

Rename a file in Alluxio and verify the corresponding file is also renamed in the underlying file system:

  1. $ ./bin/alluxio fs mv /demo/hello2 /demo/world
  2. Renamed /demo/hello2 to /demo/world
  3. $ ls /tmp/alluxio-demo
  4. hello world

Delete a file in Alluxio and verify the file is also deleted in the underlying file system:

  1. $ ./bin/alluxio fs rm /demo/world
  2. /demo/world has been removed
  3. $ ls /tmp/alluxio-demo
  4. hello

Unmount the mounted directory and verify that the directory is removed from the Alluxio namespace, but is still preserved in the underlying file system:

  1. $ ./bin/alluxio fs unmount /demo
  2. Unmounted /demo
  3. $ ./bin/alluxio fs ls -R /
  4. ... # should not contain /demo
  5. $ ls /tmp/alluxio-demo
  6. hello

Metadata Active Sync for HDFS

In version 2.0, we introduced a new feature for maintaining synchronization between Alluxio space and the UFS when the UFS is HDFS. The feature, called active sync, listens for HDFS events and periodically synchronizes the metadata between the UFS and Alluxio namespace as a background task on the master. Because active sync feature depends on HDFS events, this feature is only available when the UFS HDFS versions is later than 2.6.1. You may need to change the value for alluxio.underfs.version in your configuration file. Please refer to HDFS Under Store for a list of supported Hdfs versions.

To enable active sync on a directory, issue the following Alluxio command.

  1. $ ./bin/alluxio fs startSync /syncdir

You can control the active sync interval by changing the alluxio.master.ufs.active.sync.interval option, the default is 30 seconds.

To disable active sync on a directory, issue the following Alluxio command.

  1. $ ./bin/alluxio fs stopSync /syncdir

Note: When startSync is issued, a full scan of the sync point is scheduled. If run as the Alluxio superuser, stopSync will interrupt any full scans which have not yet ended. If run as any other user, stopSync will wait for the full scan to finish before completing.

You can also examine which directories are currently under active sync, with the following command.

  1. $ ./bin/alluxio fs getSyncPathList

Quiet period for Active Sync

Active sync also tries to avoid syncing when the target directory is heavily used. It tries to look for a quiet period in UFS activity to start syncing between the UFS and the Alluxio space, to avoid overloading the UFS when it is busy. There are two configuration options that control this behavior.

alluxio.master.ufs.active.sync.max.activities is the maximum number of activities in the UFS directory. Activity is a heuristic based on the exponential moving average of number of events in a directory. For example, if a directory had 100, 10, 1 event in the past three intervals. Its activity would be 100/10*10 + 10/10 + 1 = 3 alluxio.master.ufs.active.sync.max.age is the maximum number of intervals we will wait before synchronizing the UFS and the Alluxio space.

The system guarantees that we will start syncing a directory if it is “quiet”, or it has not been synced for a long period (period longer than the max age).

For example, the following setting

  1. alluxio.master.ufs.active.sync.interval=30secs
  2. alluxio.master.ufs.active.sync.max.activities=100
  3. alluxio.master.ufs.active.sync.max.age=5

means that every 30 seconds, the system will count the number of events in the directory and calculate its activity. If the activity is less than 100, it will be considered a quiet period, and syncing will start for that directory. If the activity is greater than 100, and it has not synced for the last 5 intervals, or 5 * 30 = 150 seconds, it will start syncing the directory. It will not perform active sync if the activity is greater than 100 and it has synced at least once in the last 5 intervals.

Unified Namespace

This example will mount multiple under storages of different types to showcase the unified file system namespace abstraction. This example will use two S3 buckets owned by different AWS accounts and a HDFS service.

Mount the first S3 bucket into Alluxio using its corresponding credentials <accessKeyId1> and <secretKey1> :

  1. $ ./bin/alluxio fs mkdir /mnt
  2. $ ./bin/alluxio fs mount \
  3. --option aws.accessKeyId=<accessKeyId1> \
  4. --option aws.secretKey=<secretKey1> \
  5. /mnt/s3bucket1 s3://data-bucket1/

Mount the second S3 bucket into Alluxio using its corresponding credentials <accessKeyId2> and <secretKey2>:

  1. $ ./bin/alluxio fs mount \
  2. --option aws.accessKeyId=<accessKeyId2> \
  3. --option aws.secretKey=<secretKey2> \
  4. /mnt/s3bucket2 s3://data-bucket2/

Mount the HDFS storage into Alluxio:

  1. $ ./bin/alluxio fs mount /mnt/hdfs hdfs://<NAMENODE>:<PORT>/

All three directories are all contained in one space in Alluxio:

  1. $ ./bin/alluxio fs ls -R /
  2. ... # should contain /mnt/s3bucket1, /mnt/s3bucket2, /mnt/hdfs

Resources