Linux (kernel)

From Wikipedia, the free encyclopedia

Linux
Tux
Developer: Linus Torvalds (creator) and others
Latest release: 2.6.19.1 / 11 December 2006
Use: Kernel
License: GNU General Public License
Website: http://www.kernel.org/

The Linux kernel is a Unix-like operating system kernel. It is the namesake of the Linux, or GNU/Linux, family of operating systems. Released under the GNU General Public License (GPL) and developed by contributors worldwide, Linux is one of the most prominent examples of open source software.[1].

Linux was created by Linus Torvalds in 1991. At the time, the GNU Project had created many of the components required for a free software operating system, but its own kernel, GNU Hurd, was incomplete and unavailable. The BSD operating system had not yet freed itself from legal encumbrances. This left a space for the Linux kernel to fill, and despite the limited functionality of the early versions it rapidly accumulated developers and users. Early on, the Minix community contributed code and ideas to the Linux kernel, and today it has received contributions from thousands of programmers.

Contents

[edit] History

Main article: History of Linux

In April 1991 - Linus Torvalds, then 21 years old, started working on some simple ideas for an operating system. He starts with a task switcher in 386 assembly and a terminal driver. Then, on, 25 August 1991, Torvalds posts to comp.os.minix:

   
“
I'm doing a (free) operating system (just a hobby, won't be big and professional like gnu) for 386(486) AT clones. This has been brewing since April, and is starting to get ready. I'd like any feedback on things people like/dislike in minix, as my OS resembles it somewhat (same physical layout of the file-system (due to practical reasons) among other things).

I've currently ported bash(1.08) and gcc(1.40), and things seem to work. This implies that I'll get something practical within a few months [...] Yes - it's free of any minix code, and it has a multi-threaded fs. It is NOT portable (uses 386 task switching etc), and it probably never will support anything other than AT-harddisks, as that's all I have :-(.

[...] It's mostly in C, but most people wouldn't call what I write C. It uses every conceivable feature of the 386 I could find, as it was also a project to teach me about the 386. As already mentioned, it uses a MMU, for both paging (not to disk yet) and segmentation. It's the segmentation that makes it REALLY 386 dependent (every task has a 64Mb segment for code & data - max 64 tasks in 4Gb. Anybody who needs more than 64Mb/task - tough cookies). [...] Some of my "C"-files (specifically mm.c) are almost as much assembler as C. [...] Unlike minix, I also happen to LIKE interrupts, so interrupts are handled without trying to hide the reason behind them.[2]

   
”

After that, many people contributed code to the project. By September 1991, Linux version 0.01 is released. It had 10,239 lines of code. In October 1991, Linux version 0.02 is released.[3]

In December 1991, Linux 0.11 is released. This version is the first that is self-hosted - Linux 0.11 can be compiled under Linux 0.11. This was also around the time Linus adopted the GPL over his own do-not-sell-it license.

A newsgroup alt.os.linux was started, and on January 19, 1992, the first post to alt.os.linux was made. [4] March 31, 1992 had alt.os.linux become comp.os.linux [5]

The X Window System was soon ported to Linux. In March 1992, Linux version 0.95 is the first to be capable of running X. This was a big version number jump (from 0.1x to 0.9x). During the whole of 1993, and early 1994, 15 development versions of version 0.99 appeared. This is the result of the jump.

Finally, on March 14, 1994, Linux 1.0.0 is released. (176,250 lines of code). In March 1995, Linux 1.2.0 is released (310,950 lines of code).

Linus decides, on May 9, 1996, to adopt Tux the penguin as mascot for Linux.

Version 2 of Linux, released on June 9, 1996, was a landmark. Strong development continued:

[edit] Pronunciation

In 1992, Torvalds explained how he pronounces the word Linux (IPA: /'lɪnʉks/)

'li' is pronounced with a short [ee] sound: compare prInt, mInImal etc. 'nux' is also short, non-diphthong, like in pUt {IPA /ʊ/}. It's partly due to minix: linux was just my working name for the thing, and as I wrote it to replace minix on my system, the result is what it is... linus' minix became linux.

—Linus Torvalds, comp.os.linux newsgroup[21]

An audio file of Torvalds saying "Hello, this is Linus Torvalds, and I pronounce Linux as Linux" can be found online.[1] Note that in English, "Linux" and "Minix" are usually pronounced with a short i (IPA: /ɪ/) sound that is different from Torvalds' Finland Swedish pronunciation of these words.

[edit] Legal aspects

[edit] Licensing terms

Initially, Torvalds released Linux under a license which forbade any commercial exploitation. This was soon changed to the GNU General Public License (GPL). This license allows distribution and even sale of possibly modified versions of Linux but requires that all those copies be released under the same license and be accompanied by source code.

Torvalds has described licensing Linux under the GPL as the "best thing I ever did."[6]

[edit] Firmwares controversy

One point of licensing controversy is the Linux's use of firmware "binary blobs" to support some hardware devices. Richard Stallman claims that these blobs make Linux partially non-free, and that distributing Linux may even be violating the GPL (which requires "complete corresponding source code" to be available).[7]

[edit] GPL version 3

Currently, Linux is licensed under version 2 of the GPL, and there is some controversy over how easily it could be changed to use later GPL versions such as the upcoming version 3 (and whether this is desirable).[8] Torvalds himself indicated in version 2.4.0 that his own code is only under version 2.[9] However, the terms of the GPL state that if no version is specified, then any version may be used, and Alan Cox pointed out that very few other Linux contributors have specified a particular version of the GPL.[10]

[edit] Loadable Kernel Modules licensing

It is debated whether Loadable Kernel Modules (LKMs) should be considered derivative works under copyright law, and thereby fall under the terms of the GPL. Torvalds has stated his belief that LKMs using only a limited, "public" subset of the kernel interfaces can sometimes be non-derived works, thus allowing some binary-only drivers and other LKMs not obeying the GPL. Not all Linux contributors agree with this interpretation, however, and even Torvalds agrees that many LKMs are clearly derived works, and indeed he writes that "kernel modules ARE derivative 'by default'". On the other hand Torvalds has also said that "one gray area in particular is something like a driver that was originally written for another operating system (ie. clearly not a derived work of Linux in origin). [...] THAT is a gray area, and _that_ is the area where I personally believe that some modules may be considered to not be derived works simply because they weren't designed for Linux and don't depend on any special Linux behaviour."[11] Especially proprietary graphics drivers are heavily discussed. Ultimately, such questions can only be resolved by a court.

[edit] Trademark

Linux is a registered trademark of Linus Torvalds in the United States and some other countries. This is the result of an incident in which William Della Croce, Jr., who was not involved in the Linux project, trademarked the name and subsequently demanded royalties for its use. Several Linux backers retained legal counsel and filed suit against Della Croce, who agreed in 1998 to assign the trademark to Torvalds.

[edit] SCO litigation

For more details on this topic, see SCO-Linux controversies.

In March 2003, the SCO Group (SCO) filed a lawsuit against IBM claiming that IBM had contributed some portions of SCO's copyrighted Unix source code to Linux in violation of IBM's license to use that Unix source code. Additionally, SCO sent letters to a number of companies warning that their use of Linux without a license from SCO may be a violation of copyright law, and claimed in the press that they would be suing individual Linux users. This controversy has generated lawsuits by SCO against Novell, DaimlerChrysler (partially dismissed in July, 2004), and AutoZone, and retaliatory lawsuits by Red Hat and others against SCO.

[edit] Technical features

Linux supports true preemptive multitasking (both in user mode and kernel mode), virtual memory, shared libraries, demand loading, shared copy-on-write executables, memory management, the Internet protocol suite, and threading.

[edit] Architecture

Linux is a monolithic kernel. Device drivers and kernel extensions run in kernel space (ring 0), with full access to the hardware, although some exceptions run in user space. The GNU/Linux graphics subsystem (the X Window System) is not part of the kernel, is optional, and runs in user space, in contrast with Microsoft Windows.

Kernel mode preemption means device drivers can be preempted under certain conditions. This latter feature was added to handle hardware interrupts correctly, and to improve support for symmetric multiprocessing (SMP). Preemption also improves latency, increasing responsiveness and making Linux more suitable for real-time applications.

The fact that Linux is not a microkernel was the topic of the Tanenbaum-Torvalds debate[12] which was started in 1992 by Andrew S. Tanenbaum with Linus Torvalds regarding Linux and kernel architecture in general on the Usenet discussion group comp.os.minix.[13] Tanenbaum argues that microkernels are superior to monolithic kernels and that, for this reason, Linux is obsolete. This subject was revisited in 2006.[14] [15]

Unlike traditional monolithic kernels, device drivers are easily configured as modules, and loaded or unloaded while running the system.

[edit] Kernel panic

Main article: Kernel panic

In Linux, a "panic" is an unrecoverable system error detected by the kernel as opposed to similar errors detected by user space code. It is possible for kernel code to indicate such a condition by calling the panic function located in the header file sys/system.h. However, most panics are the result of unhandled processor exceptions in kernel code, such as references to invalid memory addresses. These are typically indicative of a bug somewhere in the call chain leading to the panic. They can also indicate a failure of hardware, such as a failed RAM cell or errors in arithmetic functions in the processor caused by a processor bug, overheating/damaged processor, or a soft error.

[edit] Programming languages

Linux is written in that version of the C programming language which is supported by GCC (which has introduced a number of extensions and changes to standard C), together with a number of relatively short sections of code written in the assembly language (in GCC's "AT&T-style" syntax) of the target architecture. Because of the extensions to C it supports, GCC was for a long time the only compiler capable of correctly building Linux. Recently, Intel claims to have modified its C compiler so that it is also capable of correctly compiling it.[16]

Many other languages are used in some way, primarily in connection with the kernel build process (the methods whereby the bootable image is created from the sources). These include Perl, Python, and various shell scripting languages. Some drivers may also be written in C++, Fortran, or other languages, but this is strongly discouraged. Linux's build system only officially supports GCC as a kernel and driver compiler.

[edit] Portability

While not originally designed to be portable, Linux is now one of the most widely ported operating system kernels, running on a diverse range of systems from the iPAQ (a handheld computer) to the IBM System z9 (a massive mainframe server that can run hundreds or even thousands of concurrent Linux instances). Linux runs as the main operating system on IBM's Blue Gene supercomputers. As of June 2006, Linux is the OS on 75% of systems on the Top 500 supercomputers list, including the top two on the list. Torvalds included, perhaps humorously, BogoMips into the kernel as a performance pseudo-comparison tool.

It is important to note that Torvalds' efforts were also directed successfully at a different sort of portability. Portability, according to Torvalds, was the ability to easily compile applications from a variety of sources on his system; thus Linux originally became popular in part because it required the least effort to get popular free software and other open source applications running.

Linux supports the following machine architectures:

  • Alpha architecture:
  • H8 architecture from Renesas Technology, formerly Hitachi.
    • H8/300
    • H8/500

[edit] Supported binary formats

Until version 1.2, the a.out binary format was used. Now the default format is ELF, or Executable and Linkable Format. Linux supports many other binary formats.

[edit] Versions

Further developing his own code and integrating changes made by other programmers, Linus Torvalds keeps releasing new versions of the Linux kernel. These are called "vanilla" kernels, meaning they have not been modified by anyone. Many Linux operating system vendors modify the kernels of their product, mainly in order to add support for drivers or features which have not officially been released as stable, while some distributions, such as Slackware, rely on vanilla kernels.

[edit] Version numbering

The version number of the Linux kernel currently consists of four numbers, following a recent change in the long-standing policy of a three-number versioning scheme. For illustration, let it be assumed that the version number is composed thus: A.B.C[.D] (e.g. 2.2.1, 2.4.13 or 2.6.12.3).

  • The A number denotes the kernel version. It is changed least frequently, and only when major changes in the code and the concept of the kernel occur. It has been changed twice in the history of the kernel: In 1994 (version 1.0) and in 1996 (version 2.0).
  • The B number denotes the major revision of the kernel.
    • Prior to the Linux 2.6.x series, even numbers indicate a stable release, i.e. one that is deemed fit for production use, such as 1.2, 2.4 or 2.6. Odd numbers have historically been development releases, such as 1.1 or 2.5. They were for testing new features and drivers until they became sufficiently stable to be included in a stable release.
    • Starting with the Linux 2.6.x series, there is no significance to even or odd numbers, with new feature development going on in the same kernel series. Linus Torvalds has stated that this will be the model for the foreseeable future.
  • The C number indicates the minor revision of the kernel. In the old three-number versioning scheme, this was changed when security patches, bugfixes, new features or drivers were implemented in the kernel. With the new policy, however, it is only changed when new drivers or features are introduced; minor fixes are handled by the D number.
  • A D number first occurred when a grave error, which required immediate fixing, was encountered in 2.6.8's NFS code. However, there were not enough other changes to legitimize the release of a new minor revision (which would have been 2.6.9). So, 2.6.8.1 was released, with the only change being the fix of that error. With 2.6.11, this was adopted as the new official versioning policy. Bug-fixes and security patches are now managed by the fourth number, whereas bigger changes are only implemented in minor revision changes (the C number).

Also, sometimes after the version there will be some more letters such as 'rc1' or 'mm2'. The 'rc' refers to release candidate and indicates a non-official release. Other letters are usually (but not always) the initials of a person. This indicates a development branch of the kernel by that person. e.g. ck stands for Con Kolivas, ac stands for Alan Cox, whereas mm stands for Andrew Morton.

The development model for Linux 2.6 was a significant change from the development model for Linux 2.5. Previously there was a stable branch (2.4) where only relatively minor and safe changes were merged, and an unstable branch (2.5), where bigger changes and cleanups were allowed. This meant that users would always have a well-tested 2.4 version with the latest security and bug fixes to use, though they would have to wait for the features which went into the 2.5 branch. The 2.5 branch was then eventually declared stable and renamed to 2.6. But instead of opening an unstable 2.7 branch, the kernel developers elected to continue putting major changes into the 2.6 "stable" branch. This had the desirable effect of not having to maintain an old stable branch, making new features quickly available, and getting more testing of the latest code.

However, the new 2.6 development model also meant that there was no stable branch for people just wanting security and bug fixes, and not needing the latest features. Fixes were only put into the latest version, so if a user wanted a version with all known bugs fixed they would also get all the latest features, which had not been well tested, and risked breaking things which had previously worked. A partial fix for this was the previously mentioned fourth version number digit (y in 2.6.x.y), which are series of point releases created by the stable team (Greg Kroah-Hartman, Chris Wright, maybe others). The stable team only released updates for the most recent kernel however, so this did not solve the problem of the missing stable kernel series. Linux distribution vendors, such as Red Hat and Debian, maintain the kernels which ship with their releases, so a solution for some people is to just follow a vendor kernel.

As a response to the lack of a stable kernel tree where people could coordinate the collection of bugfixes, in December of 2005 Adrian Bunk announced that he would keep releasing 2.6.16.y kernels when the stable team moved on to 2.6.17 [2]. He also plans to include driver updates, making the maintenance of the 2.6.16 series very similar to the old rules for maintenance of a stable series such as 2.4 [3].

[edit] Maintenance

While Linus Torvalds supervises code changes and releases to the latest kernel versions, he has delegated the maintenance of older versions to other programmers:

Kernel series Maintainer
2.4 Willy Tarreau (former maintainer Marcelo Tosatti)
2.6.16 Adrian Bunk
2.6 Andrew Morton / Linus Torvalds

Other Linux kernel programmers include Robert Love and Ingo Molnar. (See the Linux MAINTAINERS file).

[edit] Stable version history

  • Version 1.0 of March 1994 supported only single-processor i386 machines.
  • Version 1.2 of March 1995 added support for Alpha, Sparc and MIPS.
  • Version 2.0 of June 1996 added support for more processors and included SMP support.
  • Version 2.2 of January 1999 (The Wonderful World of Linux 2.2).
  • Version 2.6 - current (December 17, 2003 to the present)[20]
    • integrated uClinux (for microcontrollers) [citation needed]
    • CPU support: with support for Hitachi's H8/300 series, the NEC v850, and Motorola's embedded m68k processors, NUMA support, support for NCR's Voyager architecture, support for Intel's hyperthreading and Physical Address Extension (PAE)
    • integrated the ALSA sound driver
    • OS support:
      • Improved APIC support.
      • Increased the maximum number of users and groups each from 65,536 to over 4 billion.
      • Increased the maximum number of process ids from 32,768 to over 1 billion.
      • Increased the maximum number of device types (major device) from 255 to 4095 and the maximum number of devices of each type (minor device) from 255 to more than a million.
      • Improved 64-bit support and filesystems of up to 16 terabytes on common hardware.
      • Improvements to the "overall responsiveness" for interactive processes (the kernel became fully pre-emptible and the I/O scheduler was rewritten).
      • Support for futexes, a rewrite of threading infrastructure to allow the Native POSIX Thread Library (NPTL) to be used.
      • An improved module loader.
      • User-mode Linux integration.
      • 2.6.11 Infiniband support
    • Storage Support:
      • LVM version 2
      • support for SGI's XFS filesystem.
      • A new "system filesystem" called sysfs, destined to relieve procfs of its system related information.
      • 2.6.12 (17 June 2005) iSCSI support
      • 2.6.13 inotify support
      • 2.6.14 9P support
      • 2.6.14 FUSE support
      • 2.6.17 Online reshaping of software raid5/6

[edit] References

  1. ^ September 25, 2006 e-mail by Linus Torvalds: Linux is open source, not free software.
  2. ^ What would you like to see most in minix?.
  3. ^ Free minix-like kernel sources for 386-AT.
  4. ^ Troubles with Partitions.
  5. ^ It's here!.
  6. ^ http://hotwired.goo.ne.jp/matrix/9709/5_linus.html
  7. ^ http://www.gnu.org/philosophy/linux-gnu-freedom.html
  8. ^ http://lwn.net/Articles/169797/
  9. ^ http://www.uwsg.iu.edu/hypermail/linux/kernel/0009.1/0096.html
  10. ^ http://lwn.net/Articles/169831/
  11. ^ http://www.ussg.iu.edu/hypermail/linux/kernel/0312.0/0670.html
  12. ^ O'Reilly (1999). The Tanenbaum-Torvalds Debate. O'Reilly. Retrieved on November 22, 2006.
  13. ^ Tanenbaum, A. (1992). LINUX is obsolete. Google Groups. Retrieved on May 10, 2006.
  14. ^ "diegocgteleline.es" & "ScuttleMonkey" (2006). Torvalds on the Microkernel Debate. Slashdot. Retrieved on September 28, 2006.
  15. ^ Tanenbaum, A. (2006). Tanenbaum-Torvalds Debate: Part II. Tanenbaum, A.. Retrieved on November 22, 2006.
  16. ^ http://www.pyrillion.org/index.html?showframe=linuxkernelpatch.html
  17. ^ http://palmtelinux.sf.net
  18. ^ ELKS project
  19. ^ Wonderful World of Linux 2.4
  20. ^ The Wonderful World of Linux 2.6

[edit] See also

  • menuconfig — a tool for configuring Linux prior to building it from source
  • Revolution OS — a documentary on the history of Linux featuring several interviews with prominent hackers, including Torvalds

[edit] External links

Wikibooks
Wikibooks has more about this subject:
Wikibooks
Wikibooks has more about this subject:

The following are to be merged

[edit] CPU Ports

[edit] Linux 2.6