# Specification file for the subset of HTML to be permitted in RFCs (DRAFT)
#
# $Id: permitted,v 1.2 1997/07/28 14:34:41 esr Exp $
#
# The theory is that rendering the HTML to ASCII via lynx -dump has to
# not lose information.
#
# To generate this set, we started with HTML 3.2. First, we threw out every
# form of interactive markup (applets, forms, isindex, etc). Then we tossed
# out IMG and AREA tags, so that every RFC will remain just one document
# without a requirement for auxilliary images to be hauled around. Then we
# chucked the deprecated tags LISTING and XMP.
#
# Then we threw out the physical style tags B, I, TT, U, FONT, BIG, SMALL,
# and STRIKE to enforce strctural markup.
#
# The BODY attributes ALINK BGCOLOR LINK TEXT VLINK that affect HTML
# presentation only over the whole document are theoretically OK, but
# we exclude them so HTML renderings will have an unfussy, uniform look.
#
# We have retained all the structure tags CITE, CODE, DFN, EM, KBD, SAMP,
# STRONG and VAR in spite of the fact that lynx -dump does not render them
# all as distinct (CITE, EM, and STRONG are rendered with underline; CODE, DFN,
# KBD, SAMP, and VAR have no effects). This is to assist in Postscript
# rendering. Authors must be warned that changes between indistinguishable
# fonts may not be used to convey information.
#
# Table markup ought to be permitted, and will be as soon as lynx or any
# other HTML-to-text converter is able to handle tables. (Hmmm...it's
# been pointed out that tables are problematic for non-visual rendering.)
#
# Allow the following HTML tags and attributes. These are a subset of 3.2:
#
A HREF NAME REL REV TITLE
ADDRESS
BLOCKQUOTE
BODY
BR CLEAR
CENTER
CITE # Lynx renders as underline
CODE # No effect under Lynx
DD
DFN # No effect under Lynx
DIR
DIV ALIGN
DL COMPACT
DT
EM # Lynx renders as underline
H1 ALIGN
H2 ALIGN
H3 ALIGN
H4 ALIGN
H5 ALIGN
H6 ALIGN
HEAD
HR ALIGN NOSHADE WIDTH
HTML VERSION
KBD # No effect under Lynx
LI
LINK HREF REL REV TITLE
META CONTENT CHARSET NAME # CHARSET?
OL COMPACT START TYPE
P ALIGN
PLAINTEXT
PRE WIDTH
SAMP # No effect under Lynx
STRONG # Lynx renders as underline
TITLE
UL COMPACT STYLE
VAR # No effect under Lynx
#
# The caption/table/td/th/tr elements are a problem. In theory they
# should be fine, but in practice I don't know of any translator that
# can compile them to text
#
#CAPTION ALIGN VALIGN
#TABLE ALIGN BORDER CELLPADDING CELLSPACING WIDTH
#TD ALIGN COLSPAN HEIGHT NOWRAP ROWSPAN VALIGN WIDTH
#TH ALIGN COLSPAN HEIGHT NOWRAP ROWSPAN VALIGN WIDTH
#TR ALIGN VALIGN
#
# Allow HTML 3.2 entities that have printable rendering in 7-bit ASCII
#
" # quotation mark
& # ampersand
< # less than
> # greater than
# only proposed in 3.2, not officially standard