ZFS dataset configuration for a movies and tv shows library? Very heterogeneous data

The Hobbyist@lemmy.zip · edit-2 1 year ago

ZFS dataset configuration for a movies and tv shows library? Very heterogeneous data

InvertedParallax@lemm.ee · 1 year ago

Have a video dataset with 1m recordsize, primarycache=metadata, secondarycache=metadata, and a general dataset as parent with 128kb recordsize, primarycache=secondarycache=normal, compression=lzma or lz44 or something.

Works like a monster, I don’t worry about things like srts and such, though your symlinks idea looks interesting.

I’m reworking my entire system to get off the filesystem structure anyway and use python and some other dB possibly reading from sonarr for metadata seeding, but haven’t got to it yet.

Actually, you make a good point, what would be nice is if sonarr put nfos in a different structure, but since I’m going to read sonarr metadata I can just delete them anyway.

The Hobbyist@lemmy.zip · 1 year ago

Do you have it set at 1M for a situation similar to mine or do you not have any small files for your video files? Setting it at 1M is indeed possible, though it would uselessly consume a large amount of extra disk space as all files of just a few KB would automatically require a whole 1MB disk space from my understanding.

InvertedParallax@lemm.ee · 1 year ago

Similar to yours, I originally didn’t have many small files, but I turned on sonarr metadata and now there are tons of 1k files everywhere.

I think zfs keeps them compacted though.

So far, this seems pretty simple: set volblocksize=64K, you get 64KiB blocks in your zvol, and that’s that. But recordsize is a bit trickier: the blocks in a dataset are dynamically sized, and recordsize sets the maximum size for blocks in that dataset—not a fixed size.

https://klarasystems.com/articles/tuning-recordsize-in-openzfs/

So I wasn’t worried about the small files in the beginning, the major reason to have smaller recordsize is if you want to make small accesses within a file, not if you want to access small files.

vext01@lemmy.sdf.org · 1 year ago

I just left the defaults and I’ve never had problems.

The Hobbyist@lemmy.zip · 1 year ago

Yes I don’t think there could be an issue with non optimal value, it has more to do with leaving IOPS on the table. I might be too concerned about it when it might not be that important.

Spectator47@lemmy.world · 1 year ago

Recordsize sets the maximum it can be for a file.

Either leave it at the zfs default of 128k or since your use case involves primarily reads of large files you could set it to 1MB.

The Hobbyist@lemmy.zip · 1 year ago

Setting it at 1MB is also possible, though it would uselessly consume a large amount of extra disk space as all files of just a few KB would automatically require a whole 1MB disk space from my understanding. And as there are usually multiple tiny files for each video, it could end up growing into something quite unnecessarily large…

InverseParallax@lemmy.world · 1 year ago

Let me clarify:

Recordsize is basically hash block size. If you want to change things you will always write in blocks up to the recordsize, smaller if the file is smaller, then calculate the hash based on that.

Smaller only helps for randomish accesses inside a file.