This is a primer for producing documents in HTML, the hypertext markup language used on the World Wide
Web. This guide is intended to be an introduction to using HTML and creating files for the Web. You should
also check your local bookstore; there are many volumes about the Web and HTML that could be useful.
This guide is copyright by the Board of Trustees of the University of Illinois. All rights reserved. The guide
may not be reproduced without written permission from the National Center for Supercomputing
Applications and it may not be sold.
Getting Started
Terms to Know
What Isn't Covered
HTML Version
HTML Documents
What an HTML Document Is
HTML Editors
Getting Your Files on a Server
Tags Explained
The Minimal HTML Document
A Teaching Tool
Markup Tags
HTML
HEAD
TITLE
BODY
Headings
Paragraphs
Lists
Preformatted Text
Extended Quotations
Addresses
Forced Line Breaks/Postal Addresses
Horizontal Rules
Character Formatting
Logical Versus Physical Styles
Escape Sequences
Linking
Relative Pathnames Versus Absolute Pathnames
URLs
Links to Specific Sections
Mailto
Inline Images
Image Size Attributes
Aligning Images
Alternate Text for Images
Background Graphics
Background Color
External Images, Sounds, and Animations
Tables
Table Tags
General Table Format
Tables for Nontabular Information
Fill-out Forms
Troubleshooting
Avoid Overlapping Tags
Embed Only Anchors and Character Tags
Do the Final Steps
Commenting Your Files
For More Information
Style Guides
Other Introductory Documents
Additional Online References
Thanks
Getting Started
Terms to Know
WWW World Wide Web
Web World Wide Web
SGML Standard Generalized Markup Language--a standard for describing markup languages
DTD Document Type Definition--this is the formal specification of a markup language, written using SGML
HTML HyperText Markup Language--HTML is an SGML DTD
In practical terms, HTML is a collection of platform-independent styles (indicated by markup tags) that
define the various components of a World Wide Web document. HTML was invented by Tim
Berners-Lee while at CERN, the European Laboratory for Particle Physics in Geneva.
What Isn't Covered
This primer assumes that you:
know how to use NCSA Mosaic or some other Web browser
have a general understanding of how Web servers and client browsers work
have access to a Web server (or that you want to produce HTML documents for personal use in
local-viewing mode)
HTML Version
This guide reflects the most current specification--HTML Version 2.0-- plus some additional features that have
been widely and consistently implemented in browsers. Future versions and new features for HTML are under
development.
HTML Documents
What an HTML Document Is
HTML documents are plain-text (also known as ASCII) files that can be created using any text editor (e.g.,
Emacs or vi on UNIX machines; BBEdit on a Macintosh; Notepad on a Windows machine). You can also use
word-processing software if you remember to save your document as "text only with line breaks."
HTML Editors
Some WYSIWYG editors are available (e.g., HotMetal, which is available for several platforms or Adobe
PageMill for Macintoshes). You may wish to try one of them after you learn some of the basics of HTML
tagging. It is useful to know enough HTML to code a document before you determine the usefulness of a
WYSIWYG editor.
If you haven't already selected your software, refer to an online listing of HTML editors (organized by
platform) to help you in your search for appropriate software.
Getting Your Files on a Server
If you have access to a Web server at school or work, contact your webmaster (the individual who maintains
the server) to see how you can get your files on the Web. If you do not have access to a server at work or
school, check to see if your community operates a FreeNet, a community-based network that provides free
access to the Internet. Lacking a FreeNet, you may need to contact a local Internet provider that will post your
files on a server for a fee. (Check your local newspaper for advertisements or with your Chamber of
Commerce for the names of companies.)
Tags Explained
An element is a fundamental component of the structure of a text document. Some examples of elements are
heads, tables, paragraphs, and lists. Think of it this way: you use HTML tags to mark the elements of a file for
your browser. Elements can contain plain text, other elements, or both.
To denote the various elements in an HTML document, you use tags. HTML tags consist of a left angle
bracket (<), a tag name, and a right angle bracket (>). Tags are usually paired (e.g., <H1> and </H1>) to start
and end the tag instruction. The end tag looks just like the start tag except a slash (/) precedes the text within
the brackets. HTML tags are listed below.
Some elements may include an attribute, which is additional information that is included inside the start tag.
For example, you can specify the alignment of images (top, middle, or bottom) by including the appropriate
attribute with the image source HTML code. Tags that have optional attributes are noted below.
NOTE: HTML is not case sensitive. <title> is equivalent to <TITLE> or <TiTlE>. There are a few
exceptions noted in Escape Sequences below.
Not all tags are supported by all World Wide Web browsers. If a browser does not support a tag, it (usually)
just ignores it.
The Minimal HTML Document
Every HTML document should contain certain standard HTML tags. Each document consists of head and
body text. The head contains the title, and the body contains the actual text that is made up of paragraphs, lists,
and other elements. Browsers expect specific information because they are programmed according to HTML
and SGML specifications.
Required elements are shown in this sample bare-bones document:
<html>
<head>
<TITLE>A Simple HTML Example</TITLE>
</head>
<body>
<H1>HTML is Easy To Learn</H1>
<P>Welcome to the world of HTML.
This is the first paragraph. While short it is
still a paragraph!</P>
<P>And this is the second paragraph.</P>
</body>
</html>
The required elements are the <html>, <head>, <title>, and <body> tags (and their corresponding end tags).
Because you should include these tags in each file, you might want to create a template file with them. (Some
browsers will format your HTML file correctly even if these tags are not included. But some browsers won't!
So make sure to include them.)
Click to see the formatted version of the example. A longer example is also available but you should read
through the rest of the guide before you take a look. This longer-example file contains tags explained in the
next section.
A Teaching Tool
To see a copy of the file that your browser reads to generate the information in your current window, select
View Source (or the equivalent) from the browser menu. The file contents, with all the HTML tags, are
displayed in a new window.
This is an excellent way to see how HTML is used and to learn tips and constructs. Of course, the HTML
might not be technically correct. Once you become familiar with HTML and check the many online and
hard-copy references on the subject, you will learn to distinguish between "good" and "bad" HTML.
Remember that you can save a source file with the HTML codes and use it as a template for one of your Web
pages or modify the format to suit your purposes.
Markup Tags
HTML
This element tells your browser that the file contains HTML-coded information. The file extension .html also
indicates this an HTML document and must be used. (If you are restricted to 8.3 filenames (e.g.,
LeeHome.htm, use only .htm for your extension.)
HEAD
The head element identifies the first part of your HTML-coded document that contains the title. The title is
shown as part of your browser's window (see below).
TITLE
The title element contains your document title and identifies its content in a global context. The title is displayed
somewhere on the browser window (usually at the top), but not within the text area. The title is also what is
displayed on someone's hotlist or bookmark list, so choose something descriptive, unique, and relatively
short. A title is also used during a WAIS search of a server.
For example, you might include a shortened title of a book along with the chapter contents: NCSA Mosaic
Guide (Windows): Installation. This tells the software name, the platform, and the chapter contents, which is
more useful than simply calling the document Installation. Generally you should keep your titles to 64
characters or fewer.
BODY
The second--and largest--part of your HTML document is the body, which contains the content of your
document (displayed within the text area of your browser window). The tags explained below are used within
the body of your HTML document.
Headings
HTML has six levels of headings, numbered 1 through 6, with 1 being the most prominent. Headings are
displayed in larger and/or bolder fonts than normal body text. The first heading in each document should be
tagged <H1>.
The syntax of the heading element is:
<Hy>Text of heading </Hy>
where y is a number between 1 and 6 specifying the level of the heading.
Do not skip levels of headings in your document. For example, don't start with a level-one heading (<H1>)
and then next use a level-three (<H3>) heading.
Paragraphs
Unlike documents in most word processors, carriage returns in HTML files aren't significant. So you don't
have to worry about how long your lines of text are (better to have them fewer than 72 characters long
though). Word wrapping can occur at any point in your source file, and multiple spaces are collapsed into a
single space by your browser.
In the bare-bones example shown in the Minimal HTML Document section, the first paragraph is coded as
<P>Welcome to the world of HTML.
This is the first paragraph.
While short it is
still a paragraph!</P>
In the source file there is a line break between the sentences. A Web browser ignores this line break and starts
a new paragraph only when it encounters another <P> tag.
Important: You must indicate paragraphs with <P> elements. A browser ignores any indentations or blank
lines in the source text. Without <P> elements, the document becomes one large paragraph. (One exception is
text tagged as "preformatted," which is explained below.) For example, the following would produce identical
output as the first bare-bones HTML example:
<H1>Level-one heading</H1>
Getting Started
Terms to Know
WWW World Wide Web
Web World Wide Web
SGML Standard Generalized Markup Language--a standard for describing markup languages
DTD Document Type Definition--this is the formal specification of a markup language, written using SGML
HTML HyperText Markup Language--HTML is an SGML DTD In practical terms, HTML is a collection of platform-independent styles (indicated by markup tags) that define the various components of a World Wide Web document. HTML was invented by Tim Berners-Lee while at CERN, the European Laboratory for Particle Physics in Geneva.
What Isn't Covered This primer assumes that you: know how to use NCSA Mosaic or some other Web browser have a general understanding of how Web servers and client browsers work have access to a Web server (or that you want to produce HTML documents for personal use in local-viewing mode)
HTML Version This guide reflects the most current specification--HTML Version 2.0-- plus some additional features that have been widely and consistently implemented in browsers. Future versions and new features for HTML are under development.
HTML Documents
What an HTML Document Is HTML documents are plain-text (also known as ASCII) files that can be created using any text editor (e.g., Emacs or vi on UNIX machines; BBEdit on a Macintosh; Notepad on a Windows machine). You can also use word-processing software if you remember to save your document as "text only with line breaks."
HTML Editors Some WYSIWYG editors are available (e.g., HotMetal, which is available for several platforms or Adobe PageMill for Macintoshes). You may wish to try one of them after you learn some of the basics of HTML tagging. It is useful to know enough HTML to code a document before you determine the usefulness of a WYSIWYG editor. If you haven't already selected your software, refer to an online listing of HTML editors (organized by platform) to help you in your search for appropriate software.
Getting Your Files on a Server If you have access to a Web server at school or work, contact your webmaster (the individual who maintains the server) to see how you can get your files on the Web. If you do not have access to a server at work or school, check to see if your community operates a FreeNet, a community-based network that provides free access to the Internet. Lacking a FreeNet, you may need to contact a local Internet provider that will post your files on a server for a fee. (Check your local newspaper for advertisements or with your Chamber of Commerce for the names of companies.)
Tags Explained An element is a fundamental component of the structure of a text document. Some examples of elements are heads, tables, paragraphs, and lists. Think of it this way: you use HTML tags to mark the elements of a file for your browser. Elements can contain plain text, other elements, or both. To denote the various elements in an HTML document, you use tags. HTML tags consist of a left angle bracket (<), a tag name, and a right angle bracket (>). Tags are usually paired (e.g., <H1> and </H1>) to start and end the tag instruction. The end tag looks just like the start tag except a slash (/) precedes the text within the brackets. HTML tags are listed below. Some elements may include an attribute, which is additional information that is included inside the start tag. For example, you can specify the alignment of images (top, middle, or bottom) by including the appropriate attribute with the image source HTML code. Tags that have optional attributes are noted below.
NOTE: HTML is not case sensitive. <title> is equivalent to <TITLE> or <TiTlE>. There are a few exceptions noted in Escape Sequences below. Not all tags are supported by all World Wide Web browsers. If a browser does not support a tag, it (usually) just ignores it.
The Minimal HTML Document Every HTML document should contain certain standard HTML tags. Each document consists of head and body text. The head contains the title, and the body contains the actual text that is made up of paragraphs, lists, and other elements. Browsers expect specific information because they are programmed according to HTML and SGML specifications. Required elements are shown in this sample bare-bones document:
<html>
<head>
<TITLE>A Simple HTML Example</TITLE>
</head>
<body>
<H1>HTML is Easy To Learn</H1>
<P>Welcome to the world of HTML.
This is the first paragraph. While short it is
still a paragraph!</P>
<P>And this is the second paragraph.</P>
</body>
</html> The required elements are the <html>, <head>, <title>, and <body> tags (and their corresponding end tags). Because you should include these tags in each file, you might want to create a template file with them. (Some browsers will format your HTML file correctly even if these tags are not included. But some browsers won't! So make sure to include them.) Click to see the formatted version of the example. A longer example is also available but you should read through the rest of the guide before you take a look. This longer-example file contains tags explained in the next section.
A Teaching Tool To see a copy of the file that your browser reads to generate the information in your current window, select View Source (or the equivalent) from the browser menu. The file contents, with all the HTML tags, are displayed in a new window. This is an excellent way to see how HTML is used and to learn tips and constructs. Of course, the HTML might not be technically correct. Once you become familiar with HTML and check the many online and hard-copy references on the subject, you will learn to distinguish between "good" and "bad" HTML. Remember that you can save a source file with the HTML codes and use it as a template for one of your Web pages or modify the format to suit your purposes.
Markup Tags
HTML This element tells your browser that the file contains HTML-coded information. The file extension .html also indicates this an HTML document and must be used. (If you are restricted to 8.3 filenames (e.g.,
LeeHome.htm, use only .htm for your extension.)
HEAD The head element identifies the first part of your HTML-coded document that contains the title. The title is shown as part of your browser's window (see below).
TITLE The title element contains your document title and identifies its content in a global context. The title is displayed somewhere on the browser window (usually at the top), but not within the text area. The title is also what is displayed on someone's hotlist or bookmark list, so choose something descriptive, unique, and relatively short. A title is also used during a WAIS search of a server. For example, you might include a shortened title of a book along with the chapter contents: NCSA Mosaic
Guide (Windows): Installation. This tells the software name, the platform, and the chapter contents, which is more useful than simply calling the document Installation. Generally you should keep your titles to 64 characters or fewer.
BODY The second--and largest--part of your HTML document is the body, which contains the content of your document (displayed within the text area of your browser window). The tags explained below are used within the body of your HTML document.
Headings HTML has six levels of headings, numbered 1 through 6, with 1 being the most prominent. Headings are displayed in larger and/or bolder fonts than normal body text. The first heading in each document should be tagged <H1>. The syntax of the heading element is:
<Hy>Text of heading </Hy> where y is a number between 1 and 6 specifying the level of the heading. Do not skip levels of headings in your document. For example, don't start with a level-one heading (<H1>) and then next use a level-three (<H3>) heading.
Paragraphs Unlike documents in most word processors, carriage returns in HTML files aren't significant. So you don't have to worry about how long your lines of text are (better to have them fewer than 72 characters long though). Word wrapping can occur at any point in your source file, and multiple spaces are collapsed into a single space by your browser. In the bare-bones example shown in the Minimal HTML Document section, the first paragraph is coded as
<P>Welcome to the world of HTML.
This is the first paragraph.
While short it is
still a paragraph!</P> In the source file there is a line break between the sentences. A Web browser ignores this line break and starts a new paragraph only when it encounters another <P> tag.
Important: You must indicate paragraphs with <P> elements. A browser ignores any indentations or blank lines in the source text. Without <P> elements, the document becomes one large paragraph. (One exception is text tagged as "preformatted," which is explained below.) For example, the following would produce identical output as the first bare-bones HTML example:
<H1>Level-one heading</H1>