[PyKDE] Creating a DOM from HTML source with PyKDE

Alan Ezust alan.ezust at gmail.com
Fri Jul 14 21:00:17 BST 2006


On 6/7/06, Robin Haswell <rob at digital-crocus.com> wrote:
> Hey there
>
> I'm trying to use PyKDE to grab a DOM from HTML source for a
> privoxy-like project. I was wondering if anyone could get me started on
> this? I'm using PyKDE because I'm struggling to find an HTML parser I
> can use which won't choke on the majority of HTML out there.
> BeautifulSoup is unsuitable and I shudder at building PyXPCOM. Any help
> would be very appreciated :-)

Have you tried xml.parsers.sgmlop? it's a newer parser lib that might
also work for your purposes. There is an sgml parser and an XML
parser, supporting the same interface.




More information about the PyQt mailing list