PyXML died a neglectful death a ways back now. I failed to notice. And I was too much in denial to listen to the many many kind individuals who tried to tell me. We've been relying on PyXML for the python extensions. Now we need to replace it. I'm sure anyone who has ever worked with extensions will be happier when it is gone. The major requirements I can think of are:
- Unicode support - Excellent namespace handling - Excellent XPath support (more than just basic support, many XPath features are too useful to be without) - documentation for new extension devs - in distribution or easy to obtain
anything else?
If I was left to my own devices and I had to do something right now, I would start with LXML. (Not a big vote of confidence that the front page of the website is offline http://codespeak.net/lxml/ check the google cache :-)) LXML uses the ElementTree api as much as possible, this API is coming standard with python 2.5 so I expect people will be getting familiar with it. It is a wrapper around libxml2 so it has a lot of power behind the scenes (adds things like xpath and xslt support). I'm not sure how things sit with the easy to obtain issue, but there are binaries available for win32. Not sure about OSX. And everyone on linux compiles everything anyway. :-)
Aaron Spike