Log-structured File System (BSD)

This article is about the NetBSD file system. For the general concept, see log-structured file system.

The Log-Structured File System (or LFS) is an implementation of a log-structured file system (a concept originally proposed and implemented by John Ousterhout), originally developed for BSD. It was removed from FreeBSD and OpenBSD; the NetBSD implementation was nonfunctional until work leading up the 4.0 release made it viable again as a production file system.[1]

Design

Most of the on-disk format of LFS is borrowed from UFS. The indirect block, inode and directory formats are almost identical. This allows well-tested UFS file system code to be re-used; current implementations of LFS share the higher-level UFS code with the lower-level code for FFS, since both of these file systems share much in common with UFS.

LFS divides the disk into segments, only one of which is active at any one time. Each segment has a header called a summary block. Each summary block contains a pointer to the next summary block, linking segments into one long chain that LFS treats as a linear log. The segments do not necessarily have to be adjacent to each other on disk; for this reason, larger segment sizes (between 384KB and 1MB) are recommended because they amortize the cost of seeking between segments.[2]

Whenever a file or directory is changed, LFS writes to the head of this log:

  1. Any changed or new data blocks.
  2. Indirect blocks updated to point to (1).
  3. Inodes updated to point to (2).
  4. Inode map blocks updated to point at (3).[3]

Unlike UFS, inodes in LFS do not have fixed locations. An inode mapa flat list of inode block locationsis used to track them. As with everything else, inode map blocks are also written to the log when they are changed.

When a segment is filled, LFS goes on to fill the next free or clean segment. Segments are said to be dirty if they contain live blocks, or blocks for which no newer copies exist further ahead in the log. The LFS garbage collector turns dirty segments into clean ones by copying live blocks from the dirty segment into the current segment and skipping the rest. The summary block in each segment contains a map to track live blocks.

Generally, garbage collection is delayed until there are no clean segments left; it can also be deferred for when the system is idle. Even then, only the least-dirty segments are picked for collection. This is intended to avoid the penalty of cleaning full segments when I/O bandwidth is most needed.[2]

At a checkpoint (usually scheduled about once every 30 seconds), LFS writes the last known block locations of the inode map and the number of the current segment to a checkpoint region at a fixed place on disk. There are two such regions; LFS alternates between them each checkpoint. Once written, a checkpoint represents the last consistent snapshot of the file system. Recovery after a crash and normal mounting work the same waythe file system simply reconstructs its state from the last checkpoint and resumes logging from there.

Disadvantages

References

  1. Bouyer, Manuel (2007-12-15). "NetBSD 4.0 Release CHANGELOG". Retrieved 2008-01-26..
  2. 1 2 Rosenblum, Mendel; Ousterhout, John K (February 1992), "The Design and Implementation of a Log-Structured Filesystem", ACM Transactions on Computer Systems 10 (1): 26–52, doi:10.1145/146941.146943.
  3. Rosenblum, Mendel; Ousterhout, John K (June 1990), "The LFS Storage Manager", Proceedings of the 1990 Summer Usenix: 315–324.
This article is issued from Wikipedia - version of the Saturday, February 07, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.