Click to See Complete Forum and Search --> : html character tags (I think)


David Harrison
01-10-2003, 05:01 AM
I know that &lt; is <
and &gt; is >
but how would I found out the rest

Charles
01-10-2003, 05:13 AM
You need to know the base 10 number of the character in the character set that you are using. Then you would use &amp#160; for a non-breaking space, for example. However, the HTML 4.01 DTD creates three sets of mnemonic entities for characters in the iso-8859-1 set. To use them you need to use a correct and full 4.01 DOCTYPE and declare that you are using that character set:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
"http://www.w3.org/TR/html4/strict.dtd">
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

You can find the list of mnemonic character entities at http://www.w3.org/TR/html4/sgml/entities.html.

Robert Wellock
01-10-2003, 05:14 AM
You could look up the HTML Character Entity References or visit a resource like Unicode: http://www.unicode.org/

Rick Bull
01-10-2003, 08:45 AM
I have a list of all the entities in the XHTML 1.0 DTD at http://www.rickbull.co.uk/tutorials/HTML/Useful%20Information/entities.php

jeffmott
01-10-2003, 03:14 PM
Charles
You need to know the base 10 number of the character in the character set that you are using...mnemonic entities for characters in the iso-8859-1 set...and declare that you are using that character set
Actually, character references are independent of the specific document's encoding. There is a difference between the document character set and the character encoding.

HTML Document Representation
SGML requires that each application (including HTML) specify its document character set...HTML uses the much more complete character set called the Universal Character Set (UCS), defined in [ISO10646]...[ISO10646] is character-by-character equivalent to Unicode ([UNICODE])...authors may use SGML character references. Character references are a character encoding-independent mechanism for entering any character from the document character set.
Meaning you would need to know the numerical code position in UNICODE of the desired character, rather than the character set you are using.

Charles
01-11-2003, 03:00 PM
Originally posted by jeffmott
Actually, character references are independent of the specific document's encoding. There is a difference between the document character set and the character encoding.


Meaning you would need to know the numerical code position in UNICODE of the desired character, rather than the character set you are using. Having poured over the spec for a while, I stand corrected.

David Harrison
01-12-2003, 08:06 AM
Thanks to everyone for your interest, I now have what I need.