Because PureHTML internally parses HTML, it is able to keep track of line
breaks, and white spacing. Using this knowledge, PureHTML is able to provide
layout that is highly accessible. PureHTML also takes advantage where
possible of 'Alt' and 'Title' attributes to provide alternative descriptions
for images and other multimedia elements.
To illustrate the advances, we setup pureHTML
and compared it to BETSIE. BETSIE has been a popular HTML to text converter,
however it suffers problems due to poor page parsing methods. We have
taken screen shots which illustrate PureHTML's intelligent parsing.
| bbc.co.uk image parsed with betsie |
bbc.co.uk image parsed with PureHTML |
 |
 |
|