Hi, I am pleased to announce Harvest-1.9.13. This ia a development version. What's new: * Improved soif2xml.pl. soif2xml.pl uses a smarter approach to keep the XML tree in sync with the SOIF tree. It also should create fewer invalid XML files. * Improved zquery.pl. * Essence quits immediately when it can't compile quick-sum.cf. Essence crashed when a broken regular expression was used to summarize a candidate with essence's interneal regex based summarizer, causing loss of all data gathered until the crash. The new behaviour should avoid data loss and make it easier to write quick-sum.cf. * Broker doesn't eliminate duplicate data from a gatherer based on MD5. (Suggested by jwaite@ti.com) Former behaviour of the broker was to avoid duplicate documents. This was done by comparing MD5s of documents. If a gatherer sent summaries of two to identical documents "a.html" and "b.html", only one of them was kept in the broker. This behaviour was changed. It is now possible to have identical documents under different names. * Updated to curl 7.10.8. Please, test the new Zebra based search system by pointing your browser to: http://your.server/Harvest/brokers/zquery/ I appreciate any feedback. I would like to invite everybody interested in developing, using or simply keeping an eye on the future development of Harvest, to join the mailinglist. To join the mailinglist, please use: https://lists.sourceforge.net/lists/listinfo/harvest-devel For more information about Harvest, please visit: http://harvest.sourceforge.net/ Thank you. kj