FAQ

My question is not answered in the documentation

If you can't find an answer in the documentation, please try using the "Ask AI" feature (in the bottom right corner). If the AI assistant's answer helps you or provides a wrong answer, feel free to leave feedback on the response. Alternatively, use the document search feature (in the top right corner) and try searching with different keywords.

If these methods still do not resolve your question, you can join the JuiceFS Community for further assistance.

General Questions

What's the difference between JuiceFS and XXX?

See "Comparison with Others" for more information.

How to upgrade JuiceFS client?

First unmount JuiceFS volume, then re-mount the volume with newer version client.

Where is the JuiceFS log?

Different types of JuiceFS clients have different ways to obtain logs. For details, please refer to "Client log" document.

Can JuiceFS directly read files that already exist in object storage?

JuiceFS cannot directly read files that already exist in object storage. Although JuiceFS typically uses object storage as the data storage layer, it is not a tool for accessing object storage in the traditional sense. You can refer to the technical architecture documentation for more details.

If you want to migrate existing data in an object storage bucket to JuiceFS, you can use JuiceFS Sync.

How can I combine multiple servers into a single JuiceFS file system for use?

No, while JuiceFS supports using local disks or SFTP as the underlying storage, it does not interfere with the logical structure management of the underlying storage. If you wish to consolidate storage space from multiple servers, you may consider using MinIO or Ceph to create an object storage cluster, and then create a JuiceFS file system on top of it.

Does support Redis in Sentinel or Cluster-mode as the metadata engine for JuiceFS?

Yes, There is also a best practice document for Redis as the JuiceFS metadata engine for reference.

Why doesn't JuiceFS support XXX object storage?

JuiceFS already supported many object storage, please check the list first. If this object storage is compatible with S3, you could treat it as S3. Otherwise, try reporting issue.

Why do I delete files at the mount point, but there is no change or very little change in object storage footprint?

The first reason is that you may have enabled the trash feature. In order to ensure data security, the trash is enabled by default. The deleted files are actually placed in the trash and are not actually deleted, so the size of the object storage will not change. trash retention time can be specified with juicefs format or modified with juicefs config. Please refer to the "Trash" documentation for more information.

The second reason is that JuiceFS deletes the data in the object storage asynchronously, so the space change of the object storage will be slower. If you need to immediately clean up the data in the object store that needs to be deleted, you can try running the juicefs gc command.

How Does JuiceFS Asynchronous Deletion Work?

When trash is disabled:
- The system checks whether the file is being opened by other processes:
  - If the file is in use, it is marked as **"deferred deletion (sustained)"** and will be processed after the program closes the file
  - If the file is not in use, it is marked as pending deletion (delfile) and attempts to place it into the deletion queue (maxDeleting)
When trash is enabled:
- The system creates subdirectories in the trash based on current time (accurate to the hour) (e.g., 2024-01-15-14)
- Files pending deletion are moved to the corresponding time-stamped directory:
  - All chunks and slices of data remain intact
  - Only the parent directory pointer in metadata changes
  - Filenames are re-encoded to prevent conflicts
- A background task cleans expired files based on retention period:
  - Starts cleaning from the oldest directory
  - Method: Marked as pending deletion (delfile), placed into the deletion queue (maxDeleting)
Deletion queue processing (asynchronous cleanup):
1. Find all chunks corresponding to the file and delete them
2. Deleting chunks will decrement the reference count of their slices
3. When a slice's reference count drops to zero, it becomes **Pending Deleted Slices**
4. The background task cleans these data slices from object storage

JuiceFS-delete-file

The deletion queue has a capacity limit. If too many files are deleted simultaneously, deletion requests will return immediately when the queue is full. Then a background cleanup task that runs hourly continues the cleanup. It finds all files marked as pending deletion (delfile) and cleans them using the same method as files in the deletion queue.
If NoBGJob is configured, the hourly scheduled background cleanup task and trash cleanup task are disabled. After deleting files, manual cleanup is required in the trash.
In a special scenario, when you manually delete files directly from the trash, it ensures synchronous insertion into the deletion queue, enabling relatively fast reclamation of object storage space. However, subsequent chunk cleanup remains asynchronous.
Regarding slice reference count: Deleting chunks and compaction (compact) will decrease the reference count of related slices, while clone and copyFileRange will increase the reference count of related slices.

Why is file system data size different from object storage usage?

"Random write in JuiceFS" produces data fragments, causing higher storage usage for object storage, especially after a large number of overwrites in a short period of time, many fragments will be generated. These fragments continue to occupy space in object storage until they are compacted and released. You shouldn't worry about this because JuiceFS checks for file compaction with every read/write, and cleans up in the client background job. Alternatively, you can manually trigger merges and garbage collection with juicefs gc --compact --delete.
If Trash is enabled, deleted files will be kept for a specified period of time, and then be garbage collected (all carried out in client background job).
After data fragments are compacted, stale slices will be kept inside Trash as well (not visible to user), following the same expiration settings. To delete this type of data, read Trash and stale slices.
If compression is enabled (the --compress parameter in the format command, disabled by default), object storage usage may be smaller than the actual file size (depending on the compression ratio of different types of files).
Different storage class of the object storage may calculate storage usage differently. The cloud service provider may set the minimum billable size for some storage classes. For example, the minimum billable size for Alibaba Cloud OSS IA storage is 64KB. If a file is smaller than 64KB, it will be calculated as 64KB.
For self-hosted object storage services, for example MinIO, actual data usage is affected by storage class settings.

Does JuiceFS support using a directory in object storage as the value of the `--bucket` option?

As of the release of JuiceFS 1.0, this feature is not supported.

Does JuiceFS support accessing data that already exists in object storage?

As of the release of JuiceFS 1.0, this feature is not supported.

Is it possible to bind multiple different object storages to a single file system (e.g. one file system with Amazon S3, GCS and OSS at the same time)?

No. However, you can set up multiple buckets associated with the same object storage service when creating a file system, thus solving the problem of limiting the number of individual bucket objects, for example, multiple S3 Buckets can be associated with a single file system. Please refer to --shards option for details.

How is the performance of JuiceFS?

JuiceFS is a distributed file system, the latency of metadata is determined by 1 (reading) or 2 (writing) round trip(s) between client and metadata service (usually 1-3 ms). The latency of first byte is determined by the performance of underlying object storage (usually 20-100 ms). Throughput of sequential read/write could be 50MB/s - 2800MiB/s (see fio benchmark for more information), depends on network bandwidth and how the data could be compressed.

JuiceFS is built with multiple layers of caching (invalidated automatically), once the caching is warmed up, the latency and throughput of JuiceFS could be close to local file system (having the overhead of FUSE).

Does JuiceFS support random read/write? How?

Yes, including those issued using mmap. Currently JuiceFS is optimized for sequential reading/writing, and optimized for random reading/writing is work in progress. If you want better random read performance, it's best to turn off compression (--compress none).

JuiceFS does not store the original file in the object storage, but splits it into data blocks using a fixed size (4MiB by default), then uploads it to the object storage, and stores the ID of the data block in the metadata engine. When random write happens, the original metadata is marked stale, and then JuiceFS Client uploads the new data block to the object storage, then update the metadata accordingly.

When reading the data of the overwritten part, according to the latest metadata, it can be read from the new data block uploaded during random writing, and the old data block may be deleted by the background garbage collection tasks automatically clean up. This shifts complexity from random writes to reads.

Read JuiceFS Internals and Data Processing Flow to learn more.

How to copy a large number of small files into JuiceFS quickly?

You could mount JuiceFS with --writeback option, which will write the small files into local disks first, then upload them to object storage in background, this could speedup coping many small files into JuiceFS.

See "Write Cache in Client" for more information.

Does JuiceFS support distributed cache?

Distributed cache is supported in our enterprise edition.

Can I mount JuiceFS without `root`?

Yes, JuiceFS could be mounted using juicefs without root. The default directory for caching is $HOME/.juicefs/cache (macOS) or /var/jfsCache (Linux), you should change that to a directory which you have write permission.

See "Read Cache in Client" for more information.

What other ways JuiceFS supports access to data besides mount?

In addition to mounting, the following methods are also supported:

Kubernetes CSI Driver: Use JuiceFS as the storage layer of Kubernetes cluster through the Kubernetes CSI Driver. For details, please refer to "Use JuiceFS on Kubernetes".
Hadoop Java SDK: It is convenient to use a Java client compatible with the HDFS interface to access JuiceFS in the Hadoop ecosystem. For details, please refer to "Use JuiceFS on Hadoop Ecosystem".
S3 Gateway: Access JuiceFS through the S3 protocol. For details, please refer to "Deploy JuiceFS S3 Gateway".
Docker Volume Plugin: A convenient way to use JuiceFS in Docker, please refer to "Use JuiceFS on Docker".
WebDAV Gateway: Access JuiceFS via WebDAV protocol

Why the same user on host X has permission to access a file in JuiceFS while has no permission to it on host Y?

The same user has different UID or GID on host X and host Y. Use id command to show the UID and GID:

$ id alice
uid=1201(alice) gid=500(staff) groups=500(staff)

Read "Sync Accounts between Multiple Hosts" to resolve this problem.

Does JuiceFS Gateway support advanced features such as multi-user management?

The built-in gateway subcommand of JuiceFS does not support functions such as multi-user management, and only provides basic S3 gateway functions. If you need to use these advanced features, please refer to the documentation.

Is there currently an SDK available for JuiceFS?

As of the release of JuiceFS 1.0, the community has two SDKs, one is the Java SDK that is highly compatible with the HDFS interface officially maintained by Juicedata, and the other is the Python SDK maintained by community users.

My question is not answered in the documentation​

General Questions​

What's the difference between JuiceFS and XXX?​

How to upgrade JuiceFS client?​

Where is the JuiceFS log?​

Can JuiceFS directly read files that already exist in object storage?​

How can I combine multiple servers into a single JuiceFS file system for use?​

Metadata Related Questions​

Does support Redis in Sentinel or Cluster-mode as the metadata engine for JuiceFS?​

Object Storage Related Questions​

Why doesn't JuiceFS support XXX object storage?​

Why do I delete files at the mount point, but there is no change or very little change in object storage footprint?​

How Does JuiceFS Asynchronous Deletion Work?​

Why is file system data size different from object storage usage?​

Does JuiceFS support using a directory in object storage as the value of the --bucket option?​

Does JuiceFS support accessing data that already exists in object storage?​

Is it possible to bind multiple different object storages to a single file system (e.g. one file system with Amazon S3, GCS and OSS at the same time)?​

Performance Related Questions​

How is the performance of JuiceFS?​

Does JuiceFS support random read/write? How?​

How to copy a large number of small files into JuiceFS quickly?​

Does JuiceFS support distributed cache?​

Mount Related Questions​

Can I mount JuiceFS without root?​

Access Related Questions​

What other ways JuiceFS supports access to data besides mount?​

Why the same user on host X has permission to access a file in JuiceFS while has no permission to it on host Y?​

Does JuiceFS Gateway support advanced features such as multi-user management?​

Is there currently an SDK available for JuiceFS?​