ReiserFs

Using ReiserFS

Basic Info

Usage

To create a reiserfs disk partition:

mkreiserfs -l dlCache1 /dev/hdb12

To create a reiserfs disk label:

reiserfstune -l dlCache1 /dev/hdb12

Help

Quick Help

mkreiserfs

$ mkreiserfs -h mkreiserfs: option requires an argument — h Usage: mkreiserfs [options] device [block-count]

Options:

-b | --block-size N              size of file-system block, in bytes
-j | --journal-device FILE       path to separate device to hold journal
-s | --journal-size N            size of the journal in blocks
-o | --journal-offset N          offset of the journal from the start of
                                 the separate device, in blocks
-t | --transaction-max-size N    maximal size of transaction, in blocks
-B | --badblocks file            store all bad blocks given in file on the fs
-h | --hash rupasov|tea|r5       hash function to use by default
-u | --uuid UUID                 store UUID in the superblock
-l | --label LABEL               store LABEL in the superblock
--format 3.5|3.6                 old 3.5 format or newer 3.6
-f | --force                     specified once, make mkreiserfs the whole
                                 disk, not block device or mounted partition;
                                 specified twice, do not ask for confirmation
-q | --quiet                     quiet work without messages, progress and
                                 questions. Useful if run in a script. For use
                                 by end users only.
-d | --debug                     print debugging information during mkreiser
-V                               print version and exit

Version 3.6.17

Working History

$ df | grep dlCache1
/dev/hdb12            74706268     32828  70878488   1% /lfs/dlCache1

% mkreiserfs -l dlCache1 /dev/hdb12

Guessing about desired format.. Kernel 2.4.25-1-386 is running.
Format 3.6 with standard journal
Count of blocks on the device: 18974752
Number of blocks consumed by mkreiserfs formatting process: 8791
Blocksize: 4096
Hash function used to sort names: "r5"
Journal Size 8193 blocks (first block 18)
Journal Max transaction length 1024
inode generation number: 0
UUID: 1ce0e8d9-ad89-436b-b35f-40f0a9265fa9
LABEL: dlCache1
ATTENTION: YOU SHOULD REBOOT AFTER FDISK!
        ALL DATA WILL BE LOST ON '/dev/hdb12'!
Continue (y/n):y
Initializing journal - 0%....20%....40%....60%....80%....100%
Syncing..ok

ReiserFS is successfully created on /dev/hdb12.

mount /dev/hdb12 /lfs/dlCache1

$ mount | grep dlCache1
/dev/hdb12 on /lfs/dlCache1 type reiserfs (rw)

$ df | grep dlCache1
/dev/hdb12            75896684     32840  75863844   1% /lfs/dlCache1

Creating a Reiser file system as a loop-mount file.

http://www.topology.org/linux/reiser.html

To create a 20-MiByte Reiser file system file (on a SuSE 9.0 system), do something like this.

root@shark# dd if=/dev/zero of=reiser1.img count=40960
40960+0 records in
40960+0 records out

root@shark# ls -l reiser1.img
-rw-r--r--    1 root     root     20971520 Nov 17 19:16 reiser1.img

root@shark# losetup /dev/loop3 reiser1.img

root@shark# losetup /dev/loop3
/dev/loop3: [0700]:109 (reiser1.img) offset 0, no encryption

root@shark# mkfs.reiserfs /dev/loop3
mkfs.reiserfs 3.6.9 (2003 www.namesys.com)

Now continuing…

root@shark# mkdir reiser1

root@shark# ls -ld reiser*
drwxr-xr-x    2 root     root           48 Nov 17 19:55 reiser1
-rw-r--r--    1 root     root     104857600 Nov 17 19:52 reiser1.img

root@shark# mount /dev/loop3 reiser1

root@shark# df reiser1
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/loop3              102392     32840     69552  33% /home2/rsoft/reiser1

Well, what could be simpler!!

Ironically, the file reiser1.img is itself in a file system which is an encrypted loop-mounted Reiser file system.

The next logical step is to work out how to undo the deed.

root@shark# umount reiser1

root@shark# df reiser1
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/hdd1            156275512  24241608 132033904  16% /home2

root@shark# losetup /dev/loop3
/dev/loop3: [0700]:109 (reiser1.img) offset 0, no encryption

root@shark# losetup -d /dev/loop3

root@shark# losetup /dev/loop3
loop: can't get info on device /dev/loop3: No such device or address

So let's try this just once more to make sure it's all okay.

root@shark# losetup /dev/loop3 reiser1.img
root@shark# mount /dev/loop3 reiser1
root@shark# df reiser1
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/loop3              102392     32840     69552  33% /home2/rsoft/reiser1

Racing to ReiserFS on Red Hat

http://www.freeos.com/articles/3933/

By Mayank Sarup <Mayank@freeos.com> Posted: ( 2001-04-16 06:49:53 EST by gatha )

A Journal Filesystem keeps track of all the transactions undertaken. And the lack of a proper journal filesystems has always been sited as one of the reasons holding back widespread adoption of Linux in the enterprise space. But this argument no longer holds true, for currently, there are four journal filesystems to choose from — XFS, JFS, ext3 and ReiserFS. Here we shall look into ReiserFs.

Of all the journal filesystems available today, ReiserFS, is the most developed under Linux. While IBM's JFS and SGI's XFS have been around a lot longer, they are still in an prenatal stage as far as the Linux OS is concerned. Ext3 has also quite a way to go. So, for now, our choice of a journal filesystem is ReiserFS. Of course, this isn't to say that we're settling for ReiserFS for lack of a better option. ReiserFS is a good choice regardless of the number of options in the market or the quality of those options.

Why Journaling?

Journaling filesystems are sort of the elite in the business. They're what filesystems like FAT and ext2 want to grow up into. They're also essential for big enterprise server where data integrity is a priority.

The idea behind journal filesystems comes from big databases like Oracle. Database operations quite often include several related and dependent sub-operations. The failure of any one operation means that the entire operation is invalid and any changes made to the database are rolled back. Journal filesystems use a similar system.

A journal log file is maintained on the partition. Filesystem writes are first written to the log file. If a write operation is interrupted due to the machine unexpectedly going down — power failure, crash — then at the next boot, the journal log file is read and operations are rolled back. This process takes only a few seconds to few minutes rather than the hours that a `fsck' can possibly take, on larger servers.

A far more effective description can be seen in this LinuxGazette article.

Using ReiserFS

Start by patching the kernel for ReiserFS, then it's time to compile the ReiserFS utilities. The utilities are place in the directory /usr/src/linux/fs/reiserfs/utils. Do a "make" followed by a "make install" to install the programs. The utilities are not included in the 2.4 kernels and are a separate download. Get the package from this location.

Un-tar the file.

# tar zxvf reiserfsprogs-3.x.0j.tar.gz
# cd reiserfsprogs-3.x.0j
# ./configure
# make
# make install

Unfortunately, there is no "ext2toreiserfs" utility to make our lives easier. So the basic procedure for converting root (/) from ext2 to reiserfs becomes the following three-step procedure.

Create a new partition and format it with ReiserFS.
Copy data from old partition.
Mount new partition as root (/).

Create a new partition. ReiserFS does not require a special partition type. Use 83 (Linux).

# fdisk -l /dev/hda
/dev/hda9 2872 3126 2048256 83 Linux
/dev/hda10 3127 3381 2048256 83 Linux

Now create a ReiserFS filesystem on the new partition.

# mkreiserfs /dev/hda10

Mount the new partition

# mount -t reiserfs /dev/hda10 /mnt/hda10

Copy the data onto the new partition.

# cd /mnt/hda10
# tar cvlf - / | tar xf -

Edit fstab to point to the new root

/dev/hda10 / reiserfs defaults 1 1

Create a symlink to reiserfsck because the Red Hat boot looks for fsck.reiserfs.

# ln -s /sbin/reiserfsck /sbin/fsck.reiserfs

Get at least lilo 21.6. This was the first version to support ReiserFS. The alternative is to keep a small /boot partition that is ext2. The first option — upgrading lilo — is preferred. A better boot manager that we recommend for use is GRUB.

Install lilo after creating new lilo.conf entries for your new kernel and boot with it. The "root =" entry should point to the new root.

Reboot! You'll now have root on ReiserFS.

The above procedure was for putting root on ReiserFS, but it is pretty much what you will want to use when converting any of your partitions to ReiserFS. The procedure given in this article should work across distributions and partitions. But do let us know, in case you encounter any problems regarding the new kernel.

ReiserFS home
ReiserFS FAQ
LinuxGazette article on journal filesystems
XFS by SGI
IBM's JFS
Ext3 beta testing program on redhat.com

Using the New Linux ReiserFS Filesystem

http://networking.earthweb.com/netos/print.php/625421

By Stew Benedict December 21, 2000

For most of Linux's history, the extended2 (ext2) filesystem has been the standard, although the Linux kernel can be configured to read/write many other types of filesystems. Recently, work has been done on some new filesystem types in an attempt to improve on the performance of ext2. Ext3 and ReiserFS are two of these projects. Ext3 adds journaling to ext2, whereas ReiserFS is a whole new filesystem, with journaling. This article will focus on the strengths and weaknesses of ReiserFS, as well as how to go about setting it up, should you decide to give it a try.

Journaling

If you're not familiar with database systems, journaling means that each transaction is written to a journal, or log. It is then possible to replay this journal in the event of a catastrophe and recover the lost transactions. Periodically, the journal is flushed, after it is certain the transactions have taken place.

In addition, for performance reasons, the author of ReiserFS has chosen to store both the file names and the files in a database, rather than just the file names and locations. This arrangement allows better storage of small files. Rather than allocating a whole block for a file that is smaller than the block size, small files are combined for optimal usage of disk space.

Strengths and Weaknesses

ReiserFS has a number of strengths: Generally higher performance for all file sizes.
Wastes less space. There is no static inode space allocation; small files are packed together.
Much higher performance for large directories, even compared to other balanced tree filesystems.
Uses B* balanced trees, whereas other balanced tree filesystems use obsolete B+ trees.
Partitions can be resized while in use.
Extremely fast recovery in the event of unplanned machine shutdown (loss of power). Rather than taking many seconds or even minutes to check the filesystem as e2fsck does, ReiserFSck takes only seconds.

Of course, it also has its weaknesses:

The software is still relatively new. You may want to hold off implementing in production systems, although results to date have been good.
No quota support (yet).
Dump does not work (yet).
Kernel-based NFS is not very stable yet.
Certain programs (qmail, for one) have issues with ReiserFS. Patches are available in many cases.
It is not possible to change live filesystems back and forth from ext2 to ReiserFS. To make the change, you need to back up your data, create the file system, and restore.
Occasionally, you may experience a stall condition (read starves). When large writes are scheduled all at once, reads can starve. A fix for this is in the works, and the later your ReiserFS patch, the better this situation is handled.

Precautions

When you use ReiserFS, you'll need to take a number of precautions:

If you're using Lilo, you will need to set the notail option on / (the root partition) if you use ReiserFS on that partition. The new version of Lilo claims notail isn't needed anymore, but it also requires a newer version of ReiserFS.
If using md (software raid) to spread ReiserFS over multiple disks, turn off REISERFS_READ_LOCK. To do so, comment out
```
#define REISERFS_READ_LOCK
```
```
in linux/include/linux/reiserfs_fs.h.
```
If you want to run Windows 98 under Linux using VMware, add the following line to the configuration file (~/vmware/win98/win98.cfg): host.FSSupportLocking1 = 0x52654973

Utilities

Several utilities are included with ReiserFS kernel patches:

mkreiserfs Creates a Linux ReiserFS file system on a device. This utility is equivalent to mke2fs.
reiserfsck Performs a consistency check for the Linux ReiserFS file system. It is equivalent to e2fsck on ext2.
resize_reiserfs Used for offline file system resizing. As far as I know, ext2 has no equivalent to this command.

I am currently running ReiserFS on /home and /usr on a SuSE7.0 Linux install, leaving the / partition as ext2. SuSE 7.0 comes with ReiserFS as an install option, if you pick the expert install. For testing purposes, I created equivalent partitions /usr2, and /home2 using ext2. The following is the output from mount:

larry:~ # mount
/dev/hda2 on / type ext2 (rw)
proc on /proc type proc (rw)
/dev/hda1 on /boot type ext2 (rw)
/dev/hda5 on /usr type reiserfs (rw)
/dev/hda7 on /home type reiserfs (rw)

fs level undeletable for reiserfs

Newsgroups:  comp.os.linux.misc
Date:        Tue, 27 Sep 2005 13:18:43 -0400

I remember that when I used the ext3 fs format, I was able to set something in the file system level that a certain file is un-deletable. Is it possible to do the same thing in reiserfs?

fs level undeletable for reiserfs

ReiserFS 3.6, at least, will let you *set* the ext3 attributes, and it'll *remember* those attributes, but it won't *honor* those attributes. On a ReiserFS 3.6 partition, you can chattr +i a file and lsattr will show you that the file's immutable. You can then rm that file without a problem. At least you can with ReiserFS 3.6, kernel 2.6.10-vanilla, coreutils 5.2.1. YMMV. ext3 works as advertised with ext attributes, natch.

Matt G

fs level undeletable for reiserfs

> At least it needs to be enable in your kernel.
>  zgrep XATTR /proc/config.gz

Hm. Yeah, "ReiserFS extended attributes" isn't set to Y, but ext[23] extended attributes are set to Y. This still doesn't really explain the observed behavior I saw with chattr, though. I would think that if the kernel didn't support extended attributes on ReiserFS, "chattr +i /reiserfs/file" would *fail*, not set the immutable attribute! I may be expecting too much though.

Matt G

An In-Depth Look at Reiserfs

http://www.linuxplanet.com/linuxplanet/print/2926/

 By: Scott Courtney
Monday, January 22, 2001 08:42:21 AM EST
URL: http://www.linuxplanet.com/linuxplanet/tutorials/2926/1/[]

Included in the Linux kernel

Reiserfs will soon become the first journaled file system to be bundled as part of the standard Linux kernel tree. What is a journaled file system, how does Reiserfs fit into that category, and why should you care that it's about to become part of the Linux core?

Let's start with a discussion of filesystems in general. If you are coming to Linux from a Windows or DOS environment, then you have been using filesystems already — you just haven't called them that. A filesystem can mean either a specific disk drive or partition, or it can mean, in a more general way, the internal format of how the data is organized on a mass storage device. For example, you have a root filesystem on your Linux machine and perhaps another filesystem for /home, and another for /opt, and so on. Each of these corresponds to a partition on a disk drive. Other directories that are undeneath these may not necessarily have their own disk partition, so they aren't filesystems.

On the other hand, we use the term "filesystem" to represent the particular way that the data is stored and how the operating system keeps track of it. Information such as the date of a file's creation and last modification, which user and group own it, what permissions are granted for reading and modifying the file, how large it is, and where it is located on the drive or partition, are all part of the responsibility of the filesystem. If the file itself is "data" then all these other items are "data about the data" and they are collectively called "metadata." So any filesystem must manage all the files and all of their metadata.

In Windows, the most common filesystems are File Allocation Table (FAT) and its newer flavors such as FAT32 and VFAT. FAT is a holdover from the dark ages of DOS and is very primitive internally. To be fair, it was created in the days of 8- and 16-bit computers and single-tasking operating systems, and it was as complex as the systems of the day could really support. Windows NT introduced a much more sophisticated filesystem called NTFS which is more reliable, faster, and capable of supporting extremely large drives and partitions. NTFS, by the way, is quite similar to its ancestor, the High Performance File System (HPFS) from IBM's OS/2 operating system.

FAT and its variants support essentially no user-level security, and they are extremely vulnerable to data corruption after a system crash. Given the poor reliability of Windows 95 and 98, the fact that their filesystems don't do a good job of recovering from crashes is a recipe for disaster! The problem isn't so much that the files may get corrupted, but rather that the metadata about those files can be corrupted.

How Filesystems Become Corrupted

Here's an example of how this happens: Suppose you have a word processing document, which you open in the application and to which you add some content. If the machine crashes before you save the file, you have lost all your changes but your original file will still be okay. If the machine crashes after you save the file, then you really haven't lost anything except the time it takes to reboot and reload the program. But what happens if the machine crashes during the exact moment when the disk is being written?

The answer is, "Things get very ugly." Since the new version of the file is physically overwriting all or part of the old version, the data can have some of each at the moment the drive stops writing. You end up with a file that you can't open because the internal format of its data is inconsistent with what the application expects.

This gets even worse if the drive was writing the metadata areas, such as the directory itself. Now instead of one corrupted file, you have one corrupted filesystem — in other words, you can lose an entire directory or all the data on an entire disk partition. On a large system this can mean hundreds of thousands of files. Even if you have a backup, restoring such a large amount of data can take a long time.

Most PC operating systems have no good way to prevent the loss of a single file that was being written during a system failure. Modern systems, such as Linux, OS/2, and NT, however, do make an attempt to prevent and recover from the horrible metadata corruption case. To accomplish this, the system performs an extensive filesystem analysis during bootup. Well-designed filesystems often incorporate redundant copies of critical metadata, so that it is extremely unlikely for that data to be completely lost. The system figures out where the corrupt metadata is, and then either repairs the damage by copying from the redundant version or simply deletes the file or files whose metadata is affected. Losing files this way is bad, but it is much better than losing the whole partition.

Unfortunately, such an extensive diagnostic analysis requires a great deal of time. Even on a very fast PC, a large and heavily-used partition can require several minutes to check. Most of the time, however, the check is not really needed because the system was shut down normally, without a sudden crash. To prevent unnecessary delays, the operating system's normal shutdown process puts a status flag on the filesystem as it is unmounted, marking it as a "clean" filesystem. If a crash happened, the system never has the chance to mark the filesystem as "clean" and the bootup process knows that it needs to run the extensive filesystem tests just to be safe. A filesystem that has not been shut down cleanly is called, appropriately enough, a "dirty" filesystem.

Enter the Journaled Filesystem

Modern filesystem designs, such as OS/2's HPFS, NT's NTFS, and Linux's popular ext2, do a very good job of implementing the things discussed in the previous section. If you have a system crash, it may take a great deal of time to check the metadata during bootup but the odds are good that you will still have all your files when it's done. As Linux begins to take on more complex applications, on larger servers, and with less tolerance for downtime, there is a need for more sophisticated filesystems that do an even better job of protecting data and metadata. The journaled filesystems now available for Linux are the answer to this need.

It's important to note here that we are talking about journaled filesystems in general. There are a number of such systems available, including "xfs" from Silicon Graphics, "Reiserfs" from The Naming System Venture, "ext3" currently hosted at Red Hat, and "Journaled File System" from IBM. In this article, I use "journaled filesystem" in lower case to mean the generic type of system, as opposed to the capitalized version that refers specifically to IBM's software. You can find links to all of these projects in the references attached to this article.

No matter which journaled filesystem is used, there are certain principles that always apply. The term "journaled" means that the filesystem maintains a log or record of what it is doing to the main data areas of the disk, so that if a crash occurs it can re-create anything that was lost. That can be a little confusing, so let's take a closer look at this process.

In public speaking classes, there is an old saying that goes, "Tell them what you're going to tell them, then tell them, then tell them what you told them." This is similar to what the journal does in a filesystem. When the system is about to alter the metadata, it first makes an entry in the journal saying, "Here is what I'm going to change." Then it makes the change. Finally, it goes back to the journal and either marks that change as "completed" or simply deletes the journal entry entirely. There are variations on this sequence, and other ways to accomplish the same thing, but this simplified view will suffice for our purposes.

The idea is that the system can crash at any point in this process but that such a crash won't have lasting effect. If the crash happens before the first journal entry, then the original data is still on the disk. You lost your new changes, but you didn't lose the file in its previous state. If the crash happens during the actual disk update, you still have the journal entry showing what was supposed to have happened. So when the system reboots, it can simply replay the journal entries and complete the update that was interrupted, or it can back out a partially completed update to restore the file's previous state. In either case, you have valid data and not a trashed partition.

These concepts are familiar to anyone who works with SQL databases with their transaction logic. Replaying and completing an operation that was interrupted is called "roll forward" and backing out such an operation to its previous, consistent state is called "roll back." Ideas that were developed to prevent lost data in SQL databases are also valuable on regular mass storage devices. That is the real benefit of journaled filesystems.

Reiserfs

Now that you understand the need for journaled filesystems, we can take a look at one particular type, called Reiserfs. Originally designed by Hans Reiser, Reiserfs carries the analogy between databases and filesystems to its logical conclusion. In essence, Reiserfs treats the entire disk partition as if it were a single database table. Directories, files, and file metadata are organized in an efficient data structure called a "balanced tree." This differs somewhat from the way in which traditional filesystems operate, but it offers large speed improvements for many applications, especially those which use lots of small files.

Reading and writing of large files, such as CDROM images, is often limited by the speed of the disk hardware or the I/O channel, but access to small files such as shell scripts is often limited by the efficiency of the filesystem design. The reason for this is that opening a file requires the system first to locate the file, and that means reading directories off the disk. Furthermore, the system needs to examine the security metadata to see if the user has permission to access the file, and that means additional disk reads. The system can literally spend more time deciding whether to allow the access, and then locating the data on the drive, than it does actually reading such a small amount of information from the file itself.

Reiserfs uses its balanced trees to streamline the process of finding the files and retrieving their security (and other) metadata. For extremely small files, the entire file's data can actually be stored physically near the file's metadata, so that both can be retrieved together with little or no movement of the disk seek mechanism. If an application needs to open many small files rapidly, this approach significantly improves performance.

Another feature of Reiserfs is that the balanced tree stores not just metadata, but also the file data itself. In a traditional filesystem such as ext2, space on the disk is allocated in blocks ranging in size from 512 bytes to 4096 bytes, or even larger. If a file's size happens to be anything other than an exact multiple of the block size, space will be wasted. For example, suppose the block size is 1024 bytes but you need to store a file that is 8195 bytes long. Eight blocks is 8192, so almost all of the file will fit into eight blocks. The remaining three bytes have their own block, which is mostly empty! The wasted space is almost one whole block out of nine, or about 11 percent. Now imagine a file 1025 bytes long. It almost, but not quite, fits into one block, but requires two. The wasted space is nearly 50 percent. The worst case is a very tiny file, such as a trivial (but useful) one-line shell script. Such a file may be only 50 bytes or so (for example) and would fit into just one block. But if the block is 1024 bytes, then the file has wasted about 95 percent of its allocated space. As you can see, the wasted space (as a percentage) is smaller if the files are larger.

Reiserfs doesn't use a traditional block approach to allocating space, instead relying on the tree structure to keep track of exact byte counts. On small files, this can save a lot of storage space. Furthermore, since more files are placed closer together, the system is able to open and read many small files with just one physical access to the drive. This further improves performance by eliminating time-consuming head seek operations.

Some applications benefit more than others from this type of optimization. Imagine a directory with hundreds of tiny PNG or GIF files used as web page icons, on a busy site. This situation is tailor-made for something like Reiserfs. Likewise, a web site with thousands of HTML files, each just a few kilobytes in size, is an excellent candidate. On the other hand, a disk partition that stores ISO9660 CDROM images, each hundreds of megabytes in size, will see little performance gain from Reiserfs. As with so many other things in the world of computing, the best performance is gained by matching the right tool with the job at hand. (Note that I'm not saying Reiserfs is slower than ext2 on large files — only that there won't be much difference in some cases.)

On top of everything else, Reiserfs is a true journaled filesystem like xfs, ext3, and IBM's JFS. Each of these systems implements the journaling feature in a different way, but the effect is the same: extremely good reliability, and extremely fast recovery after an abrupt shutdown or crash. On my system, I have found that filesystems that took several minutes to check using ext2 take only a second or two under Reiserfs. This difference is typical of any journaled filesystem versus a traditional filesystem.

Installation — Aye, There's the Rub!

For all its benefits, Reiserfs requires a bit of effort to install and configure. It is supplied as a set of kernel patches, which are applied to the kernel source code. After extracting the standard, generic kernel into a directory such as /usr/src/linux, you make that the current directory and run the patch command. If the Reiserfs patch is stored in /usr/local/src/reiserfs-patch.gz (note that the ".gz" means it's gzipped), then you would run a command like this:

zcat /usr/local/src/reiserfs-patch.gz | patch -p1

After that, you can configure the kernel as usual. There will now be a new option in the "filesystems" section that allows you to enable Reiserfs as either a built-in kernel feature or as a loadable module. You have to choose the built-in option if you want to boot the system from a Reiserfs partition.

Once the new kernel is built and tested, you can use the mkreiserfs utility to format partitions as Reiserfs filesystems. As with the regular mke2fs command, this will erase all the data on the partition! (By the way, after you build the kernel you have to go into a subdirectory of the kernel source and separately build the Reiserfs utilities, which include the mkreiserfs command. Instructions for doing this are found on the Reiserfs web site.) Once the desired Reiserfs partitions are formatted, you can use the mount command (with -t reiserfs options) to mount them just as you would with regular ext2 partitions. You can even put Reiserfs filesystems in the /etc/fstab file to have them mount automatically during system initialization.

Each time a new kernel version is installed, a new version of Reiserfs is required. The data on disk is upward-compatible so you don't have to reformat each time, but the filesystem code changes slightly. It takes a certain amount of care to ensure that you don't try to put the wrong Reiserfs version with the wrong kernel, and there is a slight delay between the release of a new kernel and the release of the Reiserfs patch for that kernel.

None of this is especially difficult if you have previous experience building kernels. The process is quite similar to applying the Alan Cox kernel patches, or the international crypto patches. Nonetheless, it can be quite daunting for someone new to Linux. It's especially tricky to create a Reiserfs root partition, since this requires temporarily booting off a backup partition as well as copying and recopying the files from the root partition. One false step, and you can be left with an unbootable machine and be forced to reinstall Linux from scratch or get help from a guru. It's certainly not something a new user should try!

Practical Considerations for Reiserfs

Reiserfs isn't perfect, and has problems and limitations like any other software. Because it changes the conceptual way in which the disk is allocated and managed, Reiserfs doesn't work well with network file system (NFS) servers. There are some patches available to remedy part of the problem, but they don't completely solve it. Likewise, using software RAID to create fault-tolerant drive arrays doesn't work under Reiserfs (but hardware RAID is fine). As with any other piece of software, you have to look at Reiserfs in relation to your needs and the system's intended purpose, and then make a reasoned decision as to whether it's the right tool to use.

Performance gains under Reiserfs can be substantial, or can be miniscule, depending on what you are doing. I have found that Reiserfs is extremely responsive for most of my work, and I wouldn't want to live without it. Compiling source code, something that typically opens hundreds or thousands of files in rapid succession, really zooms. The biggest difference I have noticed is when using the find command to scan large directory trees. Scans that used to take thirty seconds or more now take just five or ten seonds. Copying large files takes just about the same amount of time as with ext2, though deleting unwanted files is significantly faster.

Traditional filesystems such as ext2 can be well-designed and reliable, and I certainly have found ext2 to be quite acceptable in the past. I have never, ever, lost a filesystem after a crash under ext2. Yet the long bootup delay while ext2 does its checking is annoying, especially on a test machine where crashes are more frequent because, well, it's a test machine. All things considered, I am thoroughly sold on journaled filesystems in general and Reiserfs is certainly a fine implementation.

Into the Kernel It Goes!

A lot of people are very happy that Reiserfs is being added to the standard Linux kernel. Instead of being a separate, complex process that has to be done on a complete and working system, Reiserfs becomes a part of the normal installation process, just another option that can be selected in your favorite distribution's install tool.

The fast crash-recovery of journaled filesystems, such as Reiserfs, makes Linux more friendly toward novice users. I have seen new users, when faced with a system that sat for a minute or more at the "checking local filesystems…" message, decide that the machine is completely hung when in fact it's just very, very busy. They instinctively reach for the power switch or reset button, a habit that was probably acquired under Windows. OUCH! There is not much worse than killing power during a filesystem check, and if you didn't have disk corruption before, you probably do now! So a journaled filesystem makes Linux behave in a more intuitive way and makes it more forgiving of mistakes like accidentally hitting the power switch or reset button. And, let's face it, even advanced users don't enjoy waiting ten minutes for their systems to reboot.

Having world-class journaled filesystems in Linux also makes it more enterprise-ready for corporate deployments. We all know how seldom Linux crashes if properly installed, but in a major data center application even a few minutes of downtime once a year may be too much, and even a small risk of corrupt filesystems cannot be tolerated. Journaled filesystems bring Linux to parity with commercial UNIX-like systems such as Irix and AIX, and this can only help Linux in the corporate marketplace.

It will take time for the commercial distrubutions to catch up with the kernel, so that Reiserfs is an integral part of the installation. Yet it will happen, and when it does Linux will take another leap forward in usability.

There is, of course, no reason why this benefit is gained only with Reiserfs. The other journaled filesystems (xfs, JFS, and ext3) each have their own advantages, and each offers something the others do not. Reiserfs is the most widely-used right now, and it has the longest track record in the Linux world. Both xfs and JFS are Linux versions of proven commercial filesystems (on Irix and AIX, respectively) but they are still considered beta quality in their Linux incarnations, and their development teams still recommend against using them for production systems. ext3 simply adds the journaling capability to regular ext2, and as such it is less disruptive and potentially less risky — but it is still called a beta. Hopefully, all four of the journaled filesystems will eventually be part of the standard kernel, letting distribution vendors and users choose the right one for their individual needs.

ReiserFS under Windows

http://sssup1.sssup.it/~pit/reiserfs.html

pit Last Updated February 08, 2002

This project is the porting of ReiserFS & Ext2 filesystem drivers of Linux (2.4.17) into a Windows system as a user mode application. The access is readonly for security and implementations reasons. I've ported first Ext2 and then extended to ReiserFS because of the absence of such tools for a Windows System.

The main idea is to mantain unchanged most of the code of the original driver and let it talk with fake linux calls. The only changes made to the ReiserFS and EXT2 code are relative to the differences between the two C/C++ compilers (GNU Extensions to C standard and bad const declarations). The main work of this porting is the se of Linux include files, full of assembly inlines, compiler specific code, and low level code.

My TODO list contains…

EXT3
Multiple filesystem support in same tool (EXT2,EXT3,REISERFS)
Automatic partition type recognition
GUI tool for file access

Use

Actually this work is in prealpha stage, so I don't have a GUI or Shell extension, just this simple utility: extdump. The extdump accepts a file or device specification, an offset of the start of the partition and finally a path to lookat. If the path is a directory it prints the listing, otherwise it dumps part of the file content

This is the example relative to my ReiserFS partition in the second Hard Disk at offset 68b3cb000. The right offset of the partition can be retrivied with a Partition Information tool or with WinHex.

To specify the location of the filesystem you have two choices:

A file, created with makefs. Offset is 0
A physical drive like \\.\PHYSICALDRIVE#, with the correct offset taken from the partition table >
```
extdump \\.\PHYSICALDRIVE1 -o 68b3cb000 /etc/nsswitch.conf
```

Here the output

Kernel:Inode-cache hash table entries: 512 (order: 0, 4096 bytes)
Kernel:REISERFS Module Initialized
Kernel:DEV Initialized
Kernel:reiserfs: checking transaction log (device aaa) ...
Kernel:(4)Using r5 hash to sort names
Kernel:ReiserFS version 3.6.25
Kernel:Super Block Loaded
Searching: /etc/nsswitch.conf
...etc....
...nsswitch.conf....
Found 70275 004DAC28
Listing some bytes of /etc/nsswitch.conf
#
# /etc/nsswitch.conf
#
# An example Name Service Switch config file. This file should be
# sorted with the most-used services at the beginning.
#
# The entry '[NOTFOUND=return]' means that the search for an
# entry should stop if the search in the previous entry turned
# up nothing. Note that if
Press any key to continue

Downloads

Executable extdump.zip

Source Code extdumpsrc.zip

Links

Linux Source Code Crossreference
ReiserFS Page by NameSys

Changes February 8, 2002

Initial release.

The Secret World of ReiserFS

http://www.newsfactor.com/story.xhtml?story_title=The_Secret_World_of_ReiserFS&story_id=23157&category=databases

By Vincent Ryan Enterprise Linux IT February 10, 2004 12:07PM

Hans Reiser, author of the ReiserFS program, created benchmarking tests designed to be fairly representative of the file-size distribution of most users. "Reiser4 does quite well on all benchmarks," he said. "With [Reiser4], we took five different technical gambles, and all of them worked. The result is very high performance."

Although there are a number of filesystems that can be used in Linux environments, ReiserFS is becoming a favorite among systems administrators. In addition to being one of the fastest filesystems available, it eliminates file corruption and uses hard-drive space more efficiently than other options.

Though it has been a long-time coming, a new version of ReiserFS — Reiser4 — is scheduled to hit the open-source world in the next month or so. This new version is a complete rewrite from version 3 and brings "journaled" filesystems to another level. It is also the first salvo in an inevitable battle with Microsoft's Longhorn filesystem.

Performance Perks

Reiser4 has a performance boost two to five times that of version 3, Hans Reiser, the program's author, told NewsFactor. The "atomic" filesystem ensures data integrity — crashes cannot corrupt file operations — and does it without a performance hit, because a new algorithm means data does not have to be copied twice. Reiser4 uses the faster "dancing trees" algorithm instead of the balanced tree algorithm previously used, so it is more space efficient than block-alignment filesystems, Reiser said.

Reiser4 also can scale to a large number of CPUs, because it uses "per node" locking. "We lock nodes in the tree that we need to modify while performing a [change to a file]," Reiser said. In version 3, the entire filesystem would lock, so multiple CPUs could not modify different parts of the tree simultaneously. Reiser4 also locks the tree from bottom up rather than top down. Most databases lock as much as they possibly need to, but Reiser4 only locks what it has to, Reiser said.

Reiser4 also features built-in encryption and compression plug-ins. Compression and encryption occur when data is "flushed" to the disk rather than with each write, Reiser said. Therefore, a frequently used file is not continually compressed and encrypted, hogging system resources.

The plug-in infrastructure of ReiserFS makes the system easily customizable, Reiser said. There are already researchers working on plug-ins that will give Reiser4 "security attributes that we couldn't have imagined," Reiser said. "We hope it will lead to all kinds of innovation," he added.

Namesys puts the ReiserFS code through a "strenuous" QA process before submitting it, Reiser noted. He said he has been working on Reiser4 for three years, and the program is about a month away from release. "Once it is submitted, I expect that it will be accepted [by the Linux community]."

Ext2, ReiserFS and XFS Benchmarks (77750 lectures)

http://bulma.net/body.phtml?nIdNoticia=642

Per Ricardo Galli Granada Creat el 22/05/2001 04:46 modificat el 22/05/2001 04:46

Conclusions

I noted disparate results among the different base file size Mongo benchmarks, so I wanted to see what would happen in a real scenario (at least for us) where the Linux VFS and cache techniques can improve enormously the global performance of the system.

In the case of the kernel compilation, Ext2 has a very low performance for copying files, for the other tests ReiserFS with notail option is the winner but the times are very close to Ext2 and XFS, the difference is less than 2% for make bzImage.

In the last case, where I mixed lseek, read, write and fsync on files of different sizes, the winner is XFS but for a very small difference, less that 8% compared to ReiserFS.

Analysing all benchmarks, it seems that for the common cases, ReiserFS and XFS have a better performance that Ext2 and with the added value of a journaled file system.

What can I say? If you are a home user or own a small server and a relatively fast CPU, use ReiserFS or XFS, both were very stable in our tests and the differences are almost inexistent.

Journaling-Filesystem Fragmentation Project

http://www.informatik.uni-frankfurt.de/~loizides/reiserfs/

Introduction

I am undertaking a small research project investigating the effect of fragmentation on the performance of the ReiserFS file system. The project is part of my master thesis at the university of Frankfurt and is financed by the Innovative Software AG.

The objective is to find and provide a method that artificially ages a filesystem so that one gets an idea of how the performance of a given filesystem behaves under different workloads.