JavaScript - Domparse

DOM parsing script

This is a Version 5 script.

The attributes are not shown by Ice Browser.

Explorer 5 on Mac has a bug in numbering the LI's in the nodemap: every OL starts with the number following the number of the previous LI, regardless of in which OL this LI is situated.

The question for 2003: What shall we do with the W3C DOM?

This developer script prints out (part of the) document structure for you. This is particularly useful when you're writing your own scripts to generate parts of an HTML document and something goes wrong.

When you're writing your own scripts to generate HTML and something goes wrong (nothing shows up, for example) and you desparately want to see the HTML code your script has generated, you can use the script below. In this situation 'View Source' doesn't help because it only shows the original HTML. This script, however, views the source for you.

First some words about text nodes between HTML tags, followed by an example of the nodemap and then on to the actual script.

Text nodes

When I tested this script, I discovered that the output of Netscape and Explorer (surprise!) don't quite match. Netscape and Explorer 5 on Mac have a lot of empty text nodes wherever there's whitespace between a closing and the next opening tag.

For instance, take this bit of HTML:

<H2>The title</H2>

<P>The first paragraph</P>

What about the space between the </H2> and the <P>?

Explorer 5 on Windows completely disregards the space between the tags and does not consider it a text node.
Explorer 5 on Mac considers it a text node which always has a length of 1, regardless of the actual whitespace. The content is one space.
Netscape 6 (M17) considers each line break a character, so that in the example above the length of the text node is 2. The content consists of \n's.

Only when you place the tags like

<H2>The title</H2><P>The first paragraph</P>

the text node between the tags disappears.

Another way to make the text node disappear is not closing your tag. If you do

<P>The first paragraph

<P>The second paragraph</P>

there's no more text node between the paragraphs.

To the script below I added a routine that hides empty text nodes from the nodemap.

Example

Even with the empty text node problem solved, there are plenty of incompatibilities in the document structure, the strangest of which is that Explorer refuses to print the values of the form fields. Anyway, load this page in Explorer 5 on Windows, Explorer 5 on Mac and Netscape 6 (any platform), view the nodemap of thirdtest (the form) and have fun puzzling out the differences.

Below you see the form that rules the script. Fill in the ID of the element where the script should start, check whatever you want to check and press the button. For testing purposes, I gave several elements in this page an ID.
You can fill in id firsttest to view the nodemap of this paragraph, id secondtest for the nodemap of special DIV that contains the document up to this paragraph and id thirdtest for the nodemap of the P containing the form.
If you don't enter anything or the ID doesn't exist, the script takes the document as root.

The nodemap

The script

How to use the script

Copy the script into the head of the page. Copy the FORM and the DIV to wherever you want. Then use the form to generate the nodemap inside the DIV.

First, assign a readroot. This is an element with an ID of your choice. Fill in the ID in the text field and use the form. Now you get a map of the node and you can (hopefully) find out what goes wrong where.

Each node is inside a <SPAN>. Text nodes get CLASS="text" and attributes get CLASS="attr" so you can improve the output by writing a style sheet for the two (or copying mine).