Usenet

A diagram of Usenet servers and clients. The blue, green, and red dots on the servers represent the groups they carry. Arrows between servers indicate newsgroup group exchanges (feeds). Arrows between clients and servers indicate that a user is subscribed to a certain group and reads or submits articles.

Usenet is a worldwide distributed discussion system available on computers. It was developed from the general-purpose UUCP dial-up network architecture. Tom Truscott and Jim Ellis conceived the idea in 1979, and it was established in 1980.[1] Users read and post messages (called articles or posts, and collectively termed news) to one or more categories, known as newsgroups. Usenet resembles a bulletin board system (BBS) in many respects and is the precursor to Internet forums that are widely used today. Discussions are threaded, as with web forums and BBSs, though posts are stored on the server sequentially. The name comes from the term "users network".[2][3]

One notable difference between a BBS or web forum and Usenet is the absence of a central server and dedicated administrator. Usenet is distributed among a large, constantly changing conglomeration of servers that store and forward messages to one another in so-called news feeds. Individual users may read messages from and post messages to a local server operated by a commercial usenet provider, their Internet service provider, university, employer, or their own server.

Usenet has significant cultural importance in the networked world, having given rise to, or popularized, many widely recognized concepts and terms such as "FAQ", "flame", and "spam".[4]

Introduction

Usenet was conceived in 1979 and publicly established in 1980, at the University of North Carolina at Chapel Hill and Duke University,[5][1] over a decade before the World Wide Web was developed and the general public received access to the Internet, making it one of the oldest computer network communications systems still in widespread use. It was originally built on the "poor man's ARPANET", employing UUCP as its transport protocol to offer mail and file transfers, as well as announcements through the newly developed news software such as A News. The name Usenet emphasized its creators' hope that the USENIX organization would take an active role in its operation.[6]

The articles that users post to Usenet are organized into topical categories known as newsgroups, which are themselves logically organized into hierarchies of subjects. For instance, sci.math and sci.physics are within the sci.* hierarchy, for science. Or, talk.origins and talk.atheism are in the talk.* hierarchy. When a user subscribes to a newsgroup, the news client software keeps track of which articles that user has read.[7]

In most newsgroups, the majority of the articles are responses to some other article. The set of articles that can be traced to one single non-reply article is called a thread. Most modern newsreaders display the articles arranged into threads and subthreads.

When a user posts an article, it is initially only available on that user's news server. Each news server talks to one or more other servers (its "newsfeeds") and exchanges articles with them. In this fashion, the article is copied from server to server and should eventually reach every server in the network. The later peer-to-peer networks operate on a similar principle, but for Usenet it is normally the sender, rather than the receiver, who initiates transfers. Usenet was designed under conditions when networks were much slower and not always available. Many sites on the original Usenet network would connect only once or twice a day to batch-transfer messages in and out.[8] This is largely because the POTS network was typically used for transfers, and phone charges were lower at night.

The format and transmission of Usenet articles is similar to that of Internet e-mail messages. The difference between the two is that Usenet articles can be read by any user whose news server carries the group to which the message was posted, as opposed to email messages, which have one or more specific recipients.[9]

Today, Usenet has diminished in importance with respect to Internet forums, blogs and mailing lists. Usenet differs from such media in several ways: Usenet requires no personal registration with the group concerned; information need not be stored on a remote server; archives are always available; and reading the messages requires not a mail or web client, but a news client. The groups in alt.binaries are still widely used for data transfer.

ISPs, news servers, and newsfeeds

Many Internet service providers, and many other Internet sites, operate news servers for their users to access. ISPs that do not operate their own servers directly will often offer their users an account from another provider that specifically operates newsfeeds. In early news implementations, the server and newsreader were a single program suite, running on the same system. Today, one uses separate newsreader client software, a program that resembles an email client but accesses Usenet servers instead. Some clients such as Mozilla Thunderbird and Outlook Express provide both abilities.

Not all ISPs run news servers. A news server is one of the most difficult Internet services to administer well because of the large amount of data involved, small customer base (compared to mainstream Internet services such as email and web access), and a disproportionately high volume of customer support incidents (frequently complaining of missing news articles that are not the ISP's fault). Some ISPs outsource news operation to specialist sites, which will usually appear to a user as though the ISP ran the server itself. Many sites carry a restricted newsfeed, with a limited number of newsgroups. Commonly omitted from such a newsfeed are foreign-language newsgroups and the alt.binaries hierarchy which largely carries software, music, videos and images, and accounts for over 99 percent of article data.

There are also Usenet providers that specialize in offering service to users whose ISPs do not carry news, or that carry a restricted feed.

See also news server operation for an overview of how news systems are implemented.

Newsreaders

Newsgroups are typically accessed with newsreaders: applications that allow users to read and reply to postings in newsgroups. These applications act as clients to one or more news servers. Although historically, Usenet was associated with the Unix operating system developed at AT&T, newsreaders are available for all major operating systems.[10] Modern mail clients or "communication suites" commonly also have an integrated newsreader. Often, however, these integrated clients are of low quality, compared to standalone newsreaders, and incorrectly implement Usenet protocols, standards and conventions. Many of these integrated clients, for example the one in Microsoft's Outlook Express, are disliked by purists because of their misbehavior.[11]

With the rise of the World Wide Web (WWW), web front-ends (web2news) have become more common. Web front ends have lowered the technical entry barrier requirements to that of one application and no Usenet NNTP server account. There are numerous websites now offering web based gateways to Usenet groups, although some people have begun filtering messages made by some of the web interfaces for one reason or another.[12][13] Google Groups[14] is one such web based front end and some web browsers can access Google Groups via news: protocol links directly.[15]

Moderated and unmoderated newsgroups

A minority of newsgroups are moderated, meaning that messages submitted by readers are not distributed directly to Usenet, but instead are emailed to the moderators of the newsgroup for approval. The moderator is to receive submitted articles, review them, and inject approved articles so that they can be properly propagated worldwide. Articles approved by a moderator must bear the Approved: header line. Moderators ensure that the messages that readers see in the newsgroup conform to the charter of the newsgroup, though they are not required to follow any such rules or guidelines.[16] Typically, moderators are appointed in the proposal for the newsgroup, and changes of moderators follow a succession plan.[17]

Historically, a mod.* hierarchy existed before Usenet reorganization.[18] Now, moderated newsgroups may appear in any hierarchy, typically with .moderated added to the group name.

Usenet newsgroups in the Big-8 hierarchy are created by proposals called a Request for Discussion, or RFD. The RFD is required to have the following information: newsgroup name, checkgroups file entry, and moderated or unmoderated status. If the group is to be moderated, then at least one moderator with a valid email address must be provided. Other information which is beneficial but not required includes: a charter, a rationale, and a moderation policy if the group is to be moderated.[19] Discussion of the new newsgroup proposal follows, and is finished with the members of the Big-8 Management Board making the decision, by vote, to either approve or disapprove the new newsgroup.

Unmoderated newsgroups form the majority of Usenet newsgroups, and messages submitted by readers for unmoderated newsgroups are immediately propagated for everyone to see. Minimal editorial content filtering vs propagation speed form one crux of the Usenet community. One little cited defense of propagation is canceling a propagated message, but few Usenet users use this command and some news readers do not offer cancellation commands, in part because article storage expires in relatively short order anyway. Almost all unmoderated Usenet groups have become collections of spam.[20][21][22]

Creation of moderated newsgroups often becomes a hot subject of controversy, raising issues regarding censorship and the desire of a subset of users to form an intentional community.

Technical details

Usenet is a set of protocols for generating, storing and retrieving news "articles" (which resemble Internet mail messages) and for exchanging them among a readership which is potentially widely distributed. These protocols most commonly use a flooding algorithm which propagates copies throughout a network of participating servers. Whenever a message reaches a server, that server forwards the message to all its network neighbors that haven't yet seen the article. Only one copy of a message is stored per server, and each server makes it available on demand to the (typically local) readers able to access that server. The collection of Usenet servers has thus a certain peer-to-peer character in that they share resources by exchanging them, the granularity of exchange however is on a different scale than a modern peer-to-peer system and this characteristic excludes the actual users of the system who connect to the news servers with a typical client-server application, much like an email reader.

RFC 850 was the first formal specification of the messages exchanged by Usenet servers. It was superseded by RFC 1036 and subsequently by RFC 5536 and RFC 5537.

In cases where unsuitable content has been posted, Usenet has support for automated removal of a posting from the whole network by creating a cancel message, although due to a lack of authentication and resultant abuse, this capability is frequently disabled. Copyright holders may still request the manual deletion of infringing material using the provisions of World Intellectual Property Organization treaty implementations, such as the United States Online Copyright Infringement Liability Limitation Act, but this would require giving notice to each individual news server administrator.

On the Internet, Usenet is transported via the Network News Transfer Protocol (NNTP) on TCP Port 119 for standard, unprotected connections and on TCP port 563 for SSL encrypted connections which is offered only by a few sites.

Organization

The "Big Nine" hierarchies of Usenet

The major set of worldwide newsgroups is contained within nine hierarchies, eight of which are operated under consensual guidelines that govern their administration and naming. The current Big Eight are:

See also the Great Renaming.

The alt.* hierarchy is not subject to the procedures controlling groups in the Big Eight, and it is as a result less organized. Groups in the alt.* hierarchy tend to be more specialized or specific—for example, there might be a newsgroup under the Big Eight which contains discussions about children's books, but a group in the alt hierarchy may be dedicated to one specific author of children's books. Binaries are posted in alt.binaries.*, making it the largest of all the hierarchies.

Many other hierarchies of newsgroups are distributed alongside these. Regional and language-specific hierarchies such as japan.*, malta.* and ne.* serve specific countries and regions such as Japan, Malta and New England. Companies and projects administer their own hierarchies to discuss their products and offer community technical support, such as the historical gnu.* hierarchy from the Free Software Foundation. Microsoft closed its newsserver in June 2010, providing support for its products over forums now.[23] Some users prefer to use the term "Usenet" to refer only to the Big Eight hierarchies; others include alt as well. The more general term "netnews" incorporates the entire medium, including private organizational news systems.

Informal sub-hierarchy conventions also exist. *.answers are typically moderated cross-post groups for FAQs. An FAQ would be posted within one group and a cross post to the *.answers group at the head of the hierarchy seen by some as a refining of information in that news group. Some subgroups are recursive—to the point of some silliness in alt.*.

Binary content

A visual example of the many complex steps required to prepare data to be uploaded to Usenet newsgroups. These steps must be done again in reverse to download data from Usenet.

Usenet was originally created to distribute text content encoded in the 7-bit ASCII character set. With the help of programs that encode 8-bit values into ASCII, it became practical to distribute binary files as content. Binary posts, due to their size and often-dubious copyright status, were in time restricted to specific newsgroups, making it easier for administrators to allow or disallow the traffic.

The oldest widely used encoding method for binary content is uuencode, from the Unix UUCP package. In the late 1980s, Usenet articles were often limited to 60,000 characters, and larger hard limits exist today. Files are therefore commonly split into sections that require reassembly by the reader.

With the header extensions and the Base64 and Quoted-Printable MIME encodings, there was a new generation of binary transport. In practice, MIME has seen increased adoption in text messages, but it is avoided for most binary attachments. Some operating systems with metadata attached to files use specialized encoding formats. For Mac OS, both Binhex and special MIME types are used.

Other lesser known encoding systems that may have been used at one time were BTOA, XX encoding, BOO, and USR encoding.

In an attempt to reduce file transfer times, an informal file encoding known as yEnc was introduced in 2001. It achieves about a 30% reduction in data transferred by assuming that most 8-bit characters can safely be transferred across the network without first encoding into the 7-bit ASCII space.

The most common method of uploading large binary posts to Usenet is to convert the files into RAR archives and create Parchive files for them. Parity files are used to recreate missing data when not every part of the files reaches a server.

Binary retention time

This is a list of some of the biggest binary groups. With 1341+ days retention, the (binary) Usenet storage (which binsearch.info indexes) is more than 33 petabytes 33000 terabytes).[24]

Each news server generally allocates a certain amount of storage space for post content in each newsgroup. When this storage has been filled, each time a new post arrives, old posts are deleted to make room for the new content. If the network bandwidth available to a server is high but the storage allocation is small, it is possible for a huge flood of incoming content to overflow the allocation and push out everything that was in the group before it. If the flood is large enough, the beginning of the flood will begin to be deleted even before the last part of the flood has been posted.

Binary newsgroups are only able to function reliably if there is sufficient storage allocated to a group to allow readers enough time to download all parts of a binary posting before it is flushed out of the group's storage allocation. This was at one time how posting of undesired content was countered; the newsgroup would be flooded with random garbage data posts, of sufficient quantity to push out all the content to be suppressed. This has been compensated by service providers allocating enough storage to retain everything posted each day, including such spam floods, without deleting anything.

The average length of time that posts are able to stay in the group before being deleted is commonly called the retention time. Generally the larger Usenet servers have enough capacity to archive several years of binary content even when flooded with new data at the maximum daily speed available. A good binaries service provider must not only accommodate users of fast connections (3 megabit) but also users of slow connections (256 kilobit or less) who need more time to download content over a period of several days or weeks.

Major NSPs have a retention time of more than 4 years.[25] This results in more than 33 petabytes (33000 terabytes) of storage.[26]

In part because of such long retention times, as well as growing Internet upload speeds, Usenet is also used by individual users to store backup data in a practice called Usenet backup, or uBackup.[27] While commercial providers offer more easy to use online backup services, storing data on Usenet is free of charge (although access to Usenet itself may not be). The method requires the user to manually select, prepare and upload the data. Because anyone can potentially download the backup files, the data is typically encrypted. After the files are uploaded, the uploader does not have any control over them; the files are automatically copied to all Usenet providers, so there will be multiple copies of it spread over different geographical locations around the world—desirable in a backup scheme.

While binary newsgroups can be used to distribute completely legal user-created works, open-source software, and public domain material, some binary groups are used to illegally distribute commercial software, copyrighted media, and obscene material.

ISP-operated Usenet servers frequently block access to all alt.binaries.* groups to both reduce network traffic and to avoid related legal issues. Commercial Usenet service providers claim to operate as a telecommunications service, and assert that they are not responsible for the user-posted binary content transferred via their equipment. In the United States, Usenet providers can qualify for protection under the DMCA Safe Harbor regulations, provided that they establish a mechanism to comply with and respond to takedown notices from copyright holders.[28]

Removal of copyrighted content from the entire Usenet network is a nearly impossible task, due to the rapid propagation between servers and the retention done by each server. Petitioning a Usenet provider for removal only removes it from that one server's retention cache, but not any others. It is possible for a special post cancellation message to be distributed to remove it from all servers, but many providers ignore cancel messages by standard policy, because they can be easily falsified and submitted by anyone.[29][30] For a takedown petition to be most effective across the whole network, it would have to be issued to the origin server to which the content has been posted, before it has been propagated to other servers. Removal of the content at this early stage would prevent further propagation, but with modern high speed links, content can be propagated as fast as it arrives, allowing no time for content review and takedown issuance by copyright holders.[31]

Establishing the identity of the person posting illegal content is equally difficult due to the trust-based design of the network. Like SMTP email, servers generally assume the header and origin information in a post is true and accurate. However, as in SMTP email, Usenet post headers are easily falsified so as to obscure the true identity and location of the message source.[32] In this manner, Usenet is significantly different from modern P2P services; most P2P users distributing content are typically immediately identifiable to all other users by their network address, but the origin information for a Usenet posting can be completely obscured and unobtainable once it has propagated past the original server.[33]

Also unlike modern P2P services, the identity of the downloaders is hidden from view. On P2P services a downloader is identifiable to all others by their network address. On Usenet, the downloader connects directly to a server, and only the server knows the address of who is connecting to it. Some Usenet providers do keep usage logs, but not all make this logged information casually available to outside parties such as the Recording Industry Association of America.[34][35] The existence of anonymising gateways to USENET also complicates the tracing of a postings true origin.

History

Newsgroup experiments first occurred in 1979. Tom Truscott and Jim Ellis of Duke University came up with the idea as a replacement for a local announcement program, and established a link with nearby University of North Carolina using Bourne shell scripts written by Steve Bellovin. The public release of news was in the form of conventional compiled software, written by Steve Daniel and Truscott.[5][37] In 1980, Usenet was connected to ARPANET through UC Berkeley which had connections to both Usenet and ARPANET. Mark Horton, the graduate student who set up the connection, began "feeding mailing lists from the ARPANET into Usenet" with the "fa" ("From ARPANET"[38]) identifier.[39] Usenet gained 50 member sites in its first year, including Reed College, University of Oklahoma, and Bell Labs,[5] and the number of people using the network increased dramatically; however, it was still a while longer before Usenet users could contribute to ARPANET.[40]

Network

UUCP networks spread quickly due to the lower costs involved, and the ability to use existing leased lines, X.25 links or even ARPANET connections. By 1983, thousands of people participated from more than 500 hosts, mostly universities and Bell Labs sites but also a growing number of Unix-related companies; the number of hosts nearly doubled to 940 in 1984. More than 100 newsgroups existed, more than 20 devoted to Unix and other computer-related topics, and at least a third to recreation.[41][5] As the mesh of UUCP hosts rapidly expanded, it became desirable to distinguish the Usenet subset from the overall network. A vote was taken at the 1982 USENIX conference to choose a new name. The name Usenet was retained, but it was established that it only applied to news.[42] The name UUCPNET became the common name for the overall network.

In addition to UUCP, early Usenet traffic was also exchanged with Fidonet and other dial-up BBS networks. Widespread use of Usenet by the BBS community was facilitated by the introduction of UUCP feeds made possible by MS-DOS implementations of UUCP, such as UFGATE (UUCP to FidoNet Gateway), FSUUCP and UUPC. In 1986, RFC 977 provided the Network News Transfer Protocol (NNTP) specification for distribution of Usenet articles over TCP/IP as a more flexible alternative to informal Internet transfers of UUCP traffic. Since the Internet boom of the 1990s, almost all Usenet distribution is over NNTP.[43]

Software

Early versions of Usenet used Duke's A News software, designed for one or two articles a day. Matt Glickman and Horton at Berkeley produced an improved version called B News that could handle the rising traffic (about 50 articles a day as of late 1983).[5] With a message format that offered compatibility with Internet mail and improved performance, it became the dominant server software. C News, developed by Geoff Collyer and Henry Spencer at the University of Toronto, was comparable to B News in features but offered considerably faster processing. In the early 1990s, InterNetNews by Rich Salz was developed to take advantage of the continuous message flow made possible by NNTP versus the batched store-and-forward design of UUCP. Since that time INN development has continued, and other news server software has also been developed.[44]

Public venue

Usenet was the first Internet community and the place for many of the most important public developments in the pre-commercial Internet. It was the place where Tim Berners-Lee announced the launch of the World Wide Web,[45] where Linus Torvalds announced the Linux project,[46] and where Marc Andreessen announced the creation of the Mosaic browser and the introduction of the image tag,[47] which revolutionized the World Wide Web by turning it into a graphical medium.

Internet jargon and history

Many jargon terms now in common use on the Internet originated or were popularized on Usenet.[48] Likewise, many conflicts which later spread to the rest of the Internet, such as the ongoing difficulties over spamming, began on Usenet.[49]

"Usenet is like a herd of performing elephants with diarrhea. Massive, difficult to redirect, awe-inspiring, entertaining, and a source of mind-boggling amounts of excrement when you least expect it."
Gene Spafford, 1992

Decline

Sascha Segan of PC Magazine said in 2008 that "Usenet has been dying for years".[50] Segan said that some people pointed to the Eternal September in 1993 as the beginning of Usenet's decline. Segan believes that when pornographers and software crackers began putting large (non-text) files on Usenet by the late 1990s, Usenet disk space and traffic increased correspondingly. Internet service providers questioned why they needed to host space for pornography and unauthorized software. When the State of New York opened an investigation on child pornographers who used Usenet, many ISPs dropped all Usenet access or access to the alt.* hierarchy.[51]

In response, John Biggs of TechCrunch said "As long as there are folks who think a command line is better than a mouse, the original text-only social network will live on".[52]

AOL discontinued Usenet access in 2005. In May 2010, Duke University, whose implementation had kicked off Usenet more than 30 years earlier, decommissioned its Usenet server, citing low usage and rising costs.[53][54] After 32 years, the Usenet news service link at the University of North Carolina at Chapel Hill (news.unc.edu) was retired on February 4, 2011.

Usenet traffic changes

Over time, the amount of Usenet traffic has steadily increased. As of 2010 the number of all text posts made in all Big-8 newsgroups averaged 1,800 new messages every hour, with an average of 25,000 messages per day.[55] However, these averages are minuscule in comparison to the traffic in the binary groups.[56] Much of this traffic increase reflects not an increase in discrete users or newsgroup discussions, but instead the combination of massive automated spamming and an increase in the use of .binaries newsgroups[55] in which large files are often posted publicly. A small sampling of the change (measured in feed size per day) follows:

Daily VolumeDaily PostsDateSource
4.5 GiB 1996 Dec Altopia.com
9 GiB 1997 Jul Altopia.com
12 GiB 554 k 1998 Jan Altopia.com
26 GiB 609 k 1999 Jan Altopia.com
82 GiB 858 k 2000 Jan Altopia.com
181 GiB 1.24 M 2001 Jan Altopia.com
257 GiB 1.48 M 2002 Jan Altopia.com
492 GiB 2.09 M 2003 Jan Altopia.com
969 GiB 3.30 M 2004 Jan Altopia.com
1.30 TB 2004-09-30 Octanews.net
1.38 TB 2004-12-31 Octanews.net
1.52 TiB 5.09 M 2005 Jan Altopia.com
1.34 TB 2005-01-01 Octanews.net
1.30 TB 2005-01-01 Newsreader.com
1.81 TB 2005-02-28 Octanews.net
1.87 TB 2005-03-08 Newsreader.com
2.00 TB 2005-03-11 Various sources
2.27 TiB 7.54 M 2006 Jan Altopia.com
2.95 TiB 9.84 M 2007 Jan Altopia.com
3.07 TiB 10.13 M 2008 Jan Altopia.com
3.80 TB 2008-04-16 Newsdemon.com
4.60 TB 2008-11-01 Giganews.com
4.65 TiB 14.64 M 2009 Jan Altopia.com
6.00 TB 2009 Dec Newsdemon.com
5.42 TiB 15.66 M 2010 Jan Altopia.com
8.00 TB 2010 Sep Newsdemon.com
7.52 TiB 20.12 M 2011 Jan Altopia.com
8.25 TB 2011 Oct Thecubenet.com
9.29 TiB 23.91 M 2012 Jan Altopia.com
11.49 TiB 28.14 M 2013 Jan Altopia.com
14.61 TiB 37.56 M 2014 Jan Altopia.com
15.50 TB 2014 Feb Newsdemon.com
17.50 TB 2015 Jan Newsdemon.com
17.87 TiB 44.19 M 2015 Jan Altopia.com
23.50 TB 2015 Nov Newsdemon.com
23.87 TiB 55.59 M 2016 Jan Altopia.com
27.80 TiB 64.55 M 2017 Jan Altopia.com

In 2008, Verizon Communications, Time Warner Cable and Sprint Nextel signed an agreement with Attorney General of New York Andrew Cuomo to shut down access to sources of child pornography.[57] Time Warner Cable stopped offering access to Usenet. Verizon reduced its access to the "Big 8" hierarchies. Sprint stopped access to the alt.* hierarchies. AT&T stopped access to the alt.binaries.* hierarchies. Cuomo never specifically named Usenet in his anti-child pornography campaign. David DeJean of PC World said that some worry that the ISPs used Cuomo's campaign as an excuse to end portions of Usenet access, as it is costly for the Internet service providers and not in high demand by customers. In 2008 AOL, which no longer offered Usenet access, and the four providers that responded to the Cuomo campaign were the five largest Internet service providers in the United States; they had more than 50% of the U.S. ISP marketshare.[58] On June 8, 2009, AT&T announced that it would no longer provide access to the Usenet service as of July 15, 2009.[59]

AOL announced that it would discontinue its integrated Usenet service in early 2005, citing the growing popularity of weblogs, chat forums and on-line conferencing.[60] The AOL community had a tremendous role in popularizing Usenet some 11 years earlier.[61]

In August 2009, Verizon announced that it would discontinue access to Usenet on September 30, 2009.[62][63] JANET(UK) announced it will discontinue Usenet service, effective July 31, 2010, citing Google Groups as an alternative.[64] Microsoft announced that it would discontinue support for its public newsgroups (msnews.microsoft.com) from June 1, 2010, offering web forums as an alternative.[65]

Primary reasons cited for the discontinuance of Usenet service by general ISPs include the decline in volume of actual readers due to competition from blogs, along with cost and liability concerns of increasing proportion of traffic devoted to file-sharing and spam on unused or discontinued groups.[66][67]

Some ISPs did not include pressure from Attorney General of New York Andrew Cuomo's aggressive campaign against child pornography as one of their reasons for dropping Usenet feeds as part of their services.[68] ISPs Cox and Atlantic Communications resisted the 2008 trend but both did eventually drop their respective Usenet feeds in 2010.[69][70][71]

Archives

Public archives of Usenet articles have existed since the early days of Usenet, such as the system created by Kenneth Almquist in late 1982.[72][73] Distributed archiving of Usenet posts was suggested in November 1982 by Scott Orshan, who proposed that "Every site should keep all the articles it posted, forever."[74] Also in November of that year, Rick Adams responded to a post asking "Has anyone archived netnews, or does anyone plan to?"[75] by stating that he was, "afraid to admit it, but I started archiving most 'useful' newsgroups as of September 18."[76] In June 1982, Gregory G. Woodbury proposed an "automatic access to archives" system that consisted of "automatic answering of fixed-format messages to a special mail recipient on specified machines."[77]

In 1985, two news archiving systems and one RFC were posted to the Internet. The first system, called keepnews, by Mark M. Swenson of The University of Arizona, was described as "a program that attempts to provide a sane way of extracting and keeping information that comes over Usenet." The main advantage of this system was to allow users to mark articles as worthwhile to retain.[78] The second system, YA News Archiver by Chuq Von Rospach, was similar to keepnews, but was "designed to work with much larger archives where the wonderful quadratic search time feature of the Unix ... becomes a real problem."[79] Von Rospach in early 1985 posted a detailed RFC for "archiving and accessing usenet articles with keyword lookup." This RFC described a program that could "generate and maintain an archive of Usenet articles and allow looking up articles based on the article-id, subject lines, or keywords pulled out of the article itself." Also included was C code for the internal data structure of the system.[80]

The desire to have a fulltext search index of archived news articles is not new either, one such request having been made in April 1991 by Alex Martelli who sought to "build some sort of keyword index for [the news archive]."[81] In early May, Mr. Martelli posted a summary of his responses to Usenet, noting that the "most popular suggestion award must definitely go to 'lq-text' package, by Liam Quin, recently posted in alt.sources."[82]

The huge site http://asstr.org archives and indexes erotic and pornographic stories posted to the Usenet group alt.sex.stories.

Today, the archiving of Usenet has led to a fear of loss of privacy.[83] An archive simplifies ways to profile people. This has partly been countered with the introduction of the X-No-Archive: Yes header, which is itself controversial.[84]

Archives by Google Groups and DejaNews

Web-based archiving of Usenet posts began in 1995 at Deja News with a very large, searchable database. In 2001, this database was acquired by Google.[85]

Google Groups hosts an archive of Usenet posts dating back to May 1981. The earliest posts, which date from May 1981 to June 1991, were donated to Google by the University of Western Ontario with the help of David Wiseman and others,[86] and were originally archived by Henry Spencer at the University of Toronto's Zoology department.[87] The archives for late 1991 through early 1995 were provided by Kent Landfield from the NetNews CD series[88] and Jürgen Christoffel from GMD.[89] The archive of posts from March 1995 onward was started by the company DejaNews (later Deja), which was purchased by Google in February 2001. Google began archiving Usenet posts for itself starting in the second week of August 2000.

Google has been criticized by Vice and Wired contributors as well as former employees for its stewardship of the archive and for breaking its search functionality.[90][91][92]

See also

References

  1. 1 2 From Usenet to CoWebs: interacting with social information spaces, Christopher Lueg, Danyel Fisher, Springer (2003), ISBN 1-85233-532-7, ISBN 978-1-85233-532-8
  2. The jargon file v4.4.7, Jargon File Archive.
  3. Chapter 3 - The Social Forces Behind The Development of Usenet, Netizens Netbook by Ronda Hauben and Michael Hauben.
  4. "USENET Newsgroup Terms – SPAM". Archived from the original on 2012-09-15.
  5. 1 2 3 4 5 Emerson, Sandra L. (October 1983). "Usenet / A Bulletin Board for Unix Users". BYTE. pp. 219–236. Retrieved 31 January 2015.
  6. "Invitation to a General Access UNIX Network", James Ellis and Tom Truscott, in First Official Announcement of USENET, NewsDemon (K&L Technologies, Inc), 1979
  7. Lehnert, Wendy G.; Kopec, Richard (2007). Web 101. Addison Wesley. p. 291. ISBN 9780321424679
  8. "Store And Forward Communication: UUCP and FidoNet". Archived from the original on 2012-06-30.. Carnegie Mellon School of Computer Science.
  9. Kozierok, Charles M. (2005). The TCP/IP guide: a comprehensive, illustrated Internet protocols reference. No Starch Press. p. 1401. ISBN 978-159327-047-6
  10. "Open Directory Usenet Clients". Dmoz.org. October 9, 2008. Archived from the original on 2012-07-30. Retrieved December 14, 2010.
  11. Jain, Dominik (July 30, 2006). "OE-QuoteFix Description". Archived from the original on 2012-09-21. Retrieved June 4, 2007.
  12. "Improve-Usenet". October 13, 2008. Archived from the original on 2012-07-13.
  13. "Improve-Usenet Comments". October 13, 2008. Archived from the original on April 26, 2008. Retrieved June 29, 2009.
  14. "Google Groups". Groups.google.com. Archived from the original on 2012-05-25. Retrieved December 14, 2010.
  15. "News: links to Google Groups". Archived from the original on 2012-07-12.
  16. "Who can force the moderators to obey the group charter?". Big-8.org. Archived from the original on 2012-08-04. Retrieved December 14, 2010.
  17. "How does a group change moderators?". Big-8.org. Archived from the original on 2012-07-19. Retrieved December 14, 2010.
  18. "Early Usenet Newsgroup Hierarchies". Livinginternet.com. October 25, 1990. Archived from the original on 2012-09-21. Retrieved December 14, 2010.
  19. "How to Create a New Big-8 Newsgroup". Big-8.org. July 7, 2010. Archived from the original on 2012-07-22. Retrieved December 14, 2010.
  20. "The Social Machine: Designs for Living Online". Today, Usenet still exists, but it is an unsociable morass of spam, porn, and pirated software
  21. "Unraveling the Internet’s oldest and weirdest mystery". Groups filled with spam, massive fights took place against spammers and over what to do about the spam. People stopped using their email addresses in messages to avoid harvesting. People left the net.
  22. "The American Way of Spam". ...many of the newsgroups have since been overrun with junk messages.
  23. Microsoft Responds to the Evolution of Communities, Announcement, undated. "Microsoft hitting 'unsubscribe' on newsgroups". Archived from the original on 2012-07-12., CNET, May 4, 2010.
  24. "Usenet storage is more than 33 petabytes (33000 terabytes)". binsearch.info. Archived from the original on 2012-09-21. Retrieved October 7, 2016.
  25. "Giganews FAQ – How long are articles available?". Giganews.com. Archived from the original on 2012-09-04. Retrieved October 23, 2012.
  26. "9 petabyte of usenet storage on giganews.com". giganews.com. Archived from the original on 2012-09-21. Retrieved February 14, 2012.
  27. "usenet backup (uBackup)". Wikihow.com. Archived from the original on 2012-09-18. Retrieved February 14, 2012.
  28. "Digital Millenium Copyright Act". Archived from the original on 2012-09-10.
  29. "Cancel Messages FAQ". Archived from the original on February 15, 2008. Retrieved June 29, 2009. ...Until authenticated cancels catch on, there are no options to avoid forged cancels and allow unforged ones...
  30. Microsoft knowledgebase article stating that many servers ignore cancel messages "Support.microsoft.com". Archived from the original on 2012-07-19.
  31. "Microsoft Word - Surmacz.doc" (PDF). Retrieved December 14, 2010.
  32. ...every part of a Usenet post may be forged apart from the left most portion of the "Path:" header... "By-users.co.uk". Archived from the original on 2012-07-23.
  33. "Better living through forgery". Newsgroup: news.admin.misc. 1995-06-10. Usenet: StUPidfuk01@uunet.uu.net. Archived from the original on 2012-07-24. Retrieved December 5, 2014.
  34. "Giganews Privacy Policy". Giganews.com. Archived from the original on 2012-07-31. Retrieved December 14, 2010.
  35. "Logging Policy". Aioe.org. June 9, 2005. Archived from the original on 2012-07-08. Retrieved December 14, 2010.
  36. "Quux.org". Archived from the original on 2012-07-14. Retrieved December 14, 2010.
  37. LaQuey, Tracy (1990). The User's directory of computer networks. Digital Press. p. 386. ISBN 978-1555580476
  38. "And So It Begins".
  39. "History of the Internet, Chapter Three: History of Electronic Mail".
  40. Hauben, Michael and Hauben, Ronda. "Netizens: On the History and Impact of Usenet and the Internet, On the Early Days of Usenet: The Roots of the Cooperative Online Culture". First Monday vol. 3 num.August 8, 3 1998
  41. Haddadi, H. (2006). "Network Traffic Inference Using Sampled Statistics". University College London.
  42. Horton, Mark (December 11, 1990). "Arachnet". Archived from the original on 2012-09-21. Retrieved 4 June 2007.
  43. Huston, Geoff (1999). ISP survival guide: strategies for running a competitive ISP. Wiley. p. 439.
  44. "Unix/Linux news servers". Newsreaders.com. Archived from the original on 2012-09-05. Retrieved December 14, 2010.
  45. Tim Berners-Lee (August 6, 1991). "WorldWideWeb: Summary". Newsgroup: alt.hypertext. Usenet: 6487@cernvax.cern.ch. Retrieved June 4, 2007.
  46. Torvalds, Linus. "What would you like to see most in minix?". Newsgroup: comp.os.minix. Usenet: 1991Aug25.205708.9541@klaava.Helsinki.FI. Retrieved September 9, 2006.
  47. Marc Andreessen (March 15, 1993). "NCSA Mosaic for X 0.10 available.". Newsgroup: comp.infosystems.wais, comp.infosystems, alt.hypertext, comp.windows.x comp.infosystems.gopher, comp.infosystems.wais, comp.infosystems, alt.hypertext, comp.windows.x Check |newsgroup= value (help). Usenet: MARCA.93Mar14225600@wintermute.ncsa.uiuc.edu. Retrieved 4 June 2007.
  48. Kaltenbach, Susan (December 2000). "The Evolution of the Online Discourse Community" (PDF). Verb Doubling: Doubling a verb may change its semantics, Soundalike Slang: Punning jargon, The -P convention: A LISPy way to form questions, Overgeneralization: Standard abuses of grammar, Spoken Inarticulations: Sighing and <*sigh*&rt;ing, Anthropomorphization: online components were named "Homunculi," daemons," etc., and there were also "confused" programs. Comparatives: Standard comparatives for design quality
  49. Campbell, K. K. (October 1, 1994). "Chatting With Martha Siegel of the Internet's Infamous Canter & Siegel". Electronic Frontier Foundation. Archived from the original on November 25, 2007. Retrieved September 24, 2010.
  50. Segan, Sascha (July 31, 2008). "R.I.P Usenet: 1980-2008". PC Magazine. Retrieved July 21, 2017.
  51. Sascha Segan (July 31, 2008). "R.I.P Usenet: 1980–2008". PC Magazine. p. 2. Archived from the original on 2012-09-09. Retrieved May 8, 2011.
  52. ""Reports of Usenet's Death Are Greatly Exaggerated". Archived from the original on 2012-07-16.." TechCrunch. August 1, 2008. Retrieved on May 8, 2011.
  53. Cara Bonnett (May 17, 2010). "A Piece of Internet History". Duke Today. Archived from the original on 2012-07-11. Retrieved May 24, 2010.
  54. Andrew Orlowski (May 20, 2010). "Usenet's home shuts down today". The Register. Archived from the original on 2012-09-21. Retrieved May 24, 2010.
  55. 1 2 "Top 100 text newsgroups by postings". NewsAdmin. Archived from the original on 2006-10-16. Retrieved December 14, 2010.
  56. "Top 100 binary newsgroups by postings". NewsAdmin. Archived from the original on 2006-10-16. Retrieved December 14, 2010.
  57. Rosencrance, Lisa. "3 top ISPs to block access to sources of child porn". Archived from the original on 2012-07-22.. Computer World. June 8, 2008. Retrieved on April 30, 2009.
  58. DeJean, David. "Usenet: Not Dead Yet." PC World. Tuesday October 7, 2008. "2". Archived from the original on 2012-09-21.. Retrieved on April 30, 2009.
  59. "ATT Announces Discontinuation of USENET Newsgroup Services". NewsDemon. June 9, 2009. Archived from the original on 2012-09-21. Retrieved June 18, 2009.
  60. Hu, Jim. ""AOL shutting down newsgroups". Archived from the original on 2012-07-23.." CNet. January 25, 2005. Retrieved on May 1, 2009.
  61. "AOL Pulls Plug on Newsgroup Service". Betanews.com. Archived from the original on 2012-07-22. Retrieved December 14, 2010.
  62. Bode, Karl. "Verizon To Discontinue Newsgroups September 30". Archived from the original on 2012-07-31.. DSLReports. August 31, 2009. Retrieved on October 24, 2009.
  63. ""Verizon Newsgroup Service Has Been Discontinued". Archived from the original on 2012-09-21." Verizon Central Support. Retrieved on October 24, 2009.
  64. Ukerna.ac.uk
  65. "Microsoft Responds to the Evolution of Communities". microsoft.com. Archived from the original on 2003-06-22. Retrieved September 1, 2011.
  66. "AOL shutting down newsgroups". cnet.com/. January 25, 2005. Archived from the original on 2008-08-29. Retrieved September 1, 2011.
  67. "Verizon To Discontinue Newsgroups". dslreports.com. August 31, 2009. Archived from the original on 2012-03-06. Retrieved September 1, 2011.
  68. "The Comcast Newsgroups Service Discontinued". dslreports.com. September 16, 2008. Archived from the original on December 6, 2014. Retrieved December 5, 2014.
  69. "Cox to Drop Free Usenet Service June 30th". Zeropaid.com. April 22, 2010. Archived from the original on 2012-09-21. Retrieved September 3, 2011.
  70. "Cox Discontinues Usenet, Starting In June". Geeknet, Inc. April 21, 2010. Archived from the original on 2012-09-21. Retrieved September 1, 2011.
  71. "Cox Communications and Atlantic Broadband Discontinue Usenet Access". thundernews.com. April 27, 2010. Archived from the original on 2012-09-12. Retrieved September 1, 2011.
  72. "How to obtain back news items". Archived from the original on 2012-07-10. Retrieved December 14, 2010.
  73. "How to obtain back news items (second posting)". Newsgroup: net.general. December 21, 1982. Retrieved December 5, 2014. message-id:bnews.spanky.138
  74. "Distributed archiving of netnews". Archived from the original on 2012-07-08. Retrieved December 14, 2010.
  75. "Archive of netnews". Archived from the original on 2012-07-24. Retrieved December 14, 2010.
  76. "Re: Archive of netnews". Archived from the original on 2012-07-15. Retrieved December 14, 2010.
  77. "Automatic access to archives". Archived from the original on 2012-07-12. Retrieved December 14, 2010.
  78. "keepnews – A Usenet news archival system". Archived from the original on 2012-07-17. Retrieved December 14, 2010.
  79. "YA News Archiver". Archived from the original on 2012-07-09. Retrieved December 14, 2010.
  80. "RFC usenet article archive program with keyword lookup". Archived from the original on 2012-07-15. Retrieved December 14, 2010.
  81. "Looking for fulltext indexing software for archived news". Archived from the original on 2012-09-21. Retrieved December 14, 2010.
  82. "Summary: search for fulltext indexing software for archived news". Archived from the original on 2012-07-08. Retrieved December 14, 2010.
  83. Segan, Sascha (January 1, 1970). "R.I.P Usenet: 1980–2008 – Usenet's Decline – Columns by PC Magazine". Pcmag.com. Archived from the original on 2012-09-09. Retrieved December 14, 2010.
  84. Strawbridge, Matthew (2006). Netiquette: Internet Etiquette in the Age of the Blog. Software Reference. p. 53. ISBN 978-0955461408
  85. Cullen, Drew (February 12, 2001). "Google saves Deja.com Usenet service". Archived from the original on 2012-09-21.. The Register.
  86. Wiseman, David. "Magi's NetNews Archive Involvement", csd.uwo.ca.
  87. Mieszkowski, Katharine. ""The Geeks Who Saved Usenet". Archived from the original on 2012-07-10.", archive.salon.com (January 7, 2002).
  88. Feldman, Ian. "Usenet on a CD-ROM, no longer a fable". Archived from the original on 2012-07-07., "TidBITS" (February 10, 1992)
  89. "Google Groups Archive Information". Archived from the original on 2012-07-09. (December 21, 2001)
  90. Poulsen, Kevin. "Google’s Abandoned Library of 700 Million Titles". Wired.
  91. Braga, Matthew. "Google, a Search Company, Has Made Its Internet Archive Impossible to Search". Motherboard.
  92. Edwards, Douglas (2011). I'm Feeling Lucky: The Confessions of Google Employee Number 59. Houghton Mifflin Harcourt. pp. 209–213. ISBN 978-0-547-41699-1.

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.