[sword-devel] OSIS huh?

Greg Hellings greg.hellings at gmail.com
Thu Jul 10 12:43:21 MST 2008


On Thu, Jul 10, 2008 at 1:00 PM, David Haslam <d.haslam at ukonline.co.uk> wrote:
>
> http://www.oxygenxml.com/xml_diff_and_merge.html
> http://www.oxygenxml.com/xml_diff_and_merge.html
>
> Not free, but Oxygen was recommended by one of my SIL contacts.
>
> -- David Haslam

I've downloaded that since then - unfortunately, either the XML diff
portion is disabled for the trial license or I don't know how to make
it work.  If it doesn't work any better than the free ones, I'm not
willing to pay $400 for what I've seen so far to risk that.  It does
look like nice software, though.  One of the few click-through
installers for Linux I've seen that actually works.

As to Ryan's suggestion of xmldiff -- I've been running that for the
past 12 hours on the two documents.  Unfortunately the algorithms it
uses are atrociously complex.  It's up to 2.9 GB of RAM usage and has
pegged out one of the processors on the machine for 12 straight hours.
 They claim that any document with >100 nodes is bound to take a long
time to process - the KJV files seem to have that count beat by a few
10's of thousands.  No idea on node count, but there's upwards of 65k
lines in the file, many of them with more than one element.

But, the modifications I have made at least are producing valid XML
output.  I'll try a comparison of the module -> mod2osis -> XML
->osis2mod -> module round-trip by looking at the modules, next.  As
long as the XML files are close enough to the original to produce the
same results on import, I'll be happy with the results at this time.
Since SWORD does some moving around on import (specifically with the
titles/headers), an exact replication is beyond the abilities at this
time, as I see it.  However, equivalent export should be possible.

I'll send along a patch, if applicable, when I'm happy with the
re-import into Sword.

--Greg

>
>
> Greg Hellings wrote:
>>
>>
>> The lack of an XML-aware diff-like tool keeps me from being able to
>> look at the content of the New Testament because I have not been able
>> to construct a regex to convince diff to ignore the
>> order-of-attributes differences.  However, I see no differences in the
>> actual OT content except those two mentioned above.  If anyone knows a
>> an XML-aware diff-like tool, I would love to hear about it, as it
>> would make this much, much easier.
>>
>> --Greg
>>
>
> --
> View this message in context: http://www.nabble.com/OSIS-huh--tp18372016p18388539.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>



More information about the sword-devel mailing list