knowledge sharing session: Understanding Filesystems

Linux root filesystem

• The organization of data is done using the root file system

• The program binary files and libraries separated and located into specific directories

• Basic programs are installed in /bin and /sbin and basic libraries in /lib

• All other programs are installed in /usr/bin and /usr/sbin and all other libraries in /usr/lib

• In the past, on Unix systems, /usr was very often mounted over the network, through NFS

• In order to allow the system to boot when the network was down, some binaries and libraries are stored in /bin, /sbin and /lib

• /bin and /sbin contain programs like ls, ifconfig, cp, bash, etc.

• /lib contains the C library and sometimes a few other basic libraries

• All other programs and libraries are in /usr

Important directories

• /bin Basic programs

• /boot Kernel image (only when the kernel is loaded from a filesystem, not common on non-x86 architectures)

• /dev Device files

• /etc System-wide configuration

• /home Directory for the users home directories

• /lib Basic libraries

• /media Mount points for removable media

• /mnt Mount points for static media

• /proc Mount point for the proc virtual filesystem

• /root Home directory of the root user

• /sbin Basic system programs

• /sys Mount point of the sysfs virtual filesystem

• /tmp Temporary files

• /usr /usr/bin Non-basic programs

• /usr/lib Non-basic libraries

• /usr/sbin Non-basic system programs

• /var Variable data files. This includes spool directories and files, administrative and logging data, and transient and temporary

Library components

Proc file system

• The proc virtual filesystem exists since the beginning of Linux

• It allows

– The kernel to expose statistics about running processes in the system

– The user to adjust at runtime various system parameters about process management, memory management, etc.

• The proc filesystem is used by many standard userspace applications, and they expect it to be mounted in /proc

• Applications such as ps or top would not work without the proc filesystem

• Command to mount /proc:

– mount -t proc nodev /proc

• Documentation/filesystems/proc.txt in the kernel sources

• man proc

/proc contents

• One directory for each running process in the system

– /proc/<pid>

– cat /proc/3840/cmdline

– It contains details about the les opened by the process, the CPU and memory usage, etc.

• /proc/interrupts, /proc/devices, /proc/iomem, /proc/ioports contain general device-related information

• /proc/cmdline contains the kernel command line

• /proc/sys contains many files that can be written to to adjust kernel parameters

– They are called sysctl. See Documentation/sysctl/ in kernel sources.

– Example

• echo 3 > /proc/sys/vm/drop_caches

Sysfs filesystem

• The sysfs filesystem is a feature integrated in the 2.6 Linux kernel

• It allows to represent in userspace the vision that the kernel has of the buses, devices and drivers in the system

• It is useful for various userspace applications that need to list and query the available hardware, for example udev

• All applications using sysfs expect it to be mounted in the /sys directory

• Command to mount /sys:

– mount -t sysfs nodev /sys

– $ ls /sys/

block bus class dev devices firmware fs kernel

Basic applications

• In order to work, a Linux system needs at least a few applications

• An init application, which is the first userspace application started by the kernel after mounting the root filesystem

• The kernel tries to run /sbin/init, /bin/init, /etc/init and /bin/sh.

• If none of them are found, the kernel panics and the boot process is stopped.

• The init application is responsible for starting all other userspace applications and services

• Usually a shell, to allow a user to interact with the system

• Basic Unix applications, to copy les, move les, list files (commands like mv, cp, mkdir, cat, etc.)

• Those basic components have to be integrated into the root filesystem to make it usable

Overall booting process

Building root filesystem

• Basic root file system structure

– The top level directories in the root file system each have a specific use and purpose.

– Common top-level directories:

– /bin/, /boot, /dev, /etc, /home, /lib, /proc, /root, /tmp, /usr, /var etc

• To work on the root filesystem, let's move into the directory we created for this purpose:

$ cd ${PRJROOT}/rootfs

• We now create the core root filesystem directories required for our system:

• $ mkdir bin dev etc lib proc sbin tmp usr var

• For the various application packages

• We can proceed with the creation of the /usr hierarchy:

• $ mkdir usr/bin usr/lib usr/sbin

• The last entries to create are in the /var directory:

• $ mkdir var/lib var/lock var/log var/run var/tmp

• With the root filesystem skeleton now ready, let's place the various software components in their appropriate locations.

• Placa all the libraries including glibc and application libraries in /lib dir

• Copy the compiled kernel modules in lib dir

• cp “all_kernel_modules.ko” ${PRJROOT}/rootfs/lib/modules

• Create the device files in ${PRJROOT}/rootfs/dev dir using mknod command

• # mknod -m 600 mem c 1 1

• # mknod -m 666 null c 1 3

• # mknod -m 666 zero c 1 5

• # mknod -m 644 random c 1 8

Main system applications

• Beyond the kernel's functionality and the root filesystem's structure, Linux inherits Unix's very rich command set.

• We can decide what all commands we need to include into the target box.

• There are two approaches

– Select and cross compile each of the application, ie complete standard application

– Use busybox

Complete standard application

• If you would like to selectively include some of the standard applications found in main stream distributions, your best bet is to start with the Linux From Scratch project located at http://www.linuxfromscratch.org/.

• This project aims at providing explanations and links to packages to help you build your own custom distributions.

Busybox

• A Linux system needs a basic set of programs to work

– An init program

– A shell

• Various basic utilities for le manipulation and system configuration

• In normal Linux systems, those programs are provided by different projects

– coreutils, bash, grep, sed, tar, wget, modutils, etc. are all different projects

– A lot of different components to integrate

– Components not designed with embedded systems constraints in mind: they are not very configurable and have a wide range of features

• Busybox is an alternative solution, extremely common on embedded systems

General purpose toolbox: Busybox

• Rewrite of many useful Unix command line utilities

– Integrated into a single project, which makes it easy to work with

– Designed with embedded systems in mind: highly configurable, no unnecessary features

• All the utilities are compiled into a single executable, /bin/busybox

– Symbolic links to /bin/busybox are created for each application integrated into Busybox

• http://www.busybox.net/

• Although BusyBox does not support all the options provided by the commands it replaces, the subset it provides is sufficient for most typical uses

Configuring busybox

• Get the latest stable sources from http://busybox.net

• Congure BusyBox (creates a .config le):

– make defconfig

Good to begin with BusyBox. Congures BusyBox with all options for regular users.

• make allnoconfig

– Unselects all options. Good to congure only what you need.

• make xconfig (graphical, needs the libqt3-mt-dev package) or make menuconfig (text)

– Same conguration interfaces as the ones used by the Linux kernel (though older versions are used).

Compiling Busybox

• Using “make menuconfig” command you can choose the commands to compile and the commands options and features that you need.

• Set the cross-compiler prefix in the conguration interface:

– BusyBox Settings -> Build Options -> Cross Compiler prefix

– Example: arm-linux-

• Set the installation directory in the conguration interface:

– BusyBox Settings -> Installation Options -> BusyBox installation prefix

• Add the cross-compiler path to the PATH environment variable:

– export PATH=/usr/xtools/arm-unknown-linux-uclibcgnueabi/bin:$PATH

• Compile BusyBox:

– Make

– Install it (this creates a Unix directory structure symbolic links to the busybox executable):

• make install

Using busybox commands

• To understand how best to use BusyBox, let's first take a look at the components installed on the target's root filesystem by BusyBox's build process. As observed there is only one executable was installed, /bin/busybox.

• This is the single binary with support for all the commands configured This binary is never called directly, however. Instead, symbolic links bearing the original commands' names have been created to /bin/busybox. Such symbolic links have been created in all the directories in which the original commands would be found, including /bin, /sbin, /usr/bin, and /usr/sbin.

• When you type a command during the system's normal operation, the busybox command is invoked via the symbolic link. In turn, busybox determines the actual command you were invoking using the name being used to run it. /bin/ls, for instance, points to /bin/busybox.

• When you type ls, the busybox command is called and it determines that you were trying to use the ls command, because ls is the first argument on the command line

Busy box

• Busy box provides all the required functionality while still remaining a very small-size application

• BusyBox implements many commands. Here are a few:

• ar, cat, chgrp, chmod, chown, chroot, cp, cpio, date, dd, df, dmesg, dos2unix, du, echo, env, expr, find, grep, gunzip, gzip, halt, id, ifconfig, init, insmod, kill, killall, ln, ls, lsmod, md5sum, mkdir, mknod, modprobe, more, mount, mv, ping, ps, pwd, reboot, renice, rm, rmdir, rmmod, route, rpm2cpio, sed, stty, swapon, sync, syslogd, tail, tar, telnet, tftp, touch, traceroute, umount, uname, uuencode, vi, wc, which, and whoami.

• Although BusyBox does not support all the options provided by the commands it replaces, the subset it provides is sufficient for most typical uses.

knowledge sharing session

Saturday, 3 December 2016

Understanding Filesystems

No comments:

Post a Comment