I'm trying to compare a ~65k auto-generated text file for difference in a couple sections while I iterate the generation process.
My problem is that a text comparison gets confused fairly quickly about section layouts. Which is to be expected, I'm not expecting it to parse arbitrary formats perfectly.
There are several unique sections that I can manually "Align With..." using text searches for for the unique strings to fix this up and get a much better diff. But as soon as I generate new output for comparison. I lose all those manual alignments.
I'd like to insert some firm anchoring points as hints to beyond compare to shortcut the manual "Align With..." steps. I can guarantee that my anchors are a unique line in each file that perfectly matches a single line in the other.
I've been trying to use the "Everything Else" file format grammar options to accomplish this. I've got a grammar rule correctly identifying what I call "Complex Text Comparison" formats. But if I copy the relevant matching rule down to "Line weights" and give it a priority of 5. I can't seem to affect any difference in the automatic comparison.
In my test file. The first relevant anchor in each file is separated by ~5k lines. Is there a maximum distance at play here? Is there a super secret hinting system I can leverage to make rock solid unique anchoring points?
My problem is that a text comparison gets confused fairly quickly about section layouts. Which is to be expected, I'm not expecting it to parse arbitrary formats perfectly.
There are several unique sections that I can manually "Align With..." using text searches for for the unique strings to fix this up and get a much better diff. But as soon as I generate new output for comparison. I lose all those manual alignments.
I'd like to insert some firm anchoring points as hints to beyond compare to shortcut the manual "Align With..." steps. I can guarantee that my anchors are a unique line in each file that perfectly matches a single line in the other.
I've been trying to use the "Everything Else" file format grammar options to accomplish this. I've got a grammar rule correctly identifying what I call "Complex Text Comparison" formats. But if I copy the relevant matching rule down to "Line weights" and give it a priority of 5. I can't seem to affect any difference in the automatic comparison.
In my test file. The first relevant anchor in each file is separated by ~5k lines. Is there a maximum distance at play here? Is there a super secret hinting system I can leverage to make rock solid unique anchoring points?
Comment