If this is your first visit, be sure to
check out the FAQ by clicking the
link above. You may have to register
before you can post: click the register link above to proceed. To start viewing messages,
select the forum that you want to visit from the selection below.
One of our grammar types is Lines, which can start at line 1, and go for x lines. Or you can define something more specific using Delimited or a RegEx.
The key step is, once the grammar is defined, it also needs to be marked as Unimportant. By default, the defined grammar is still Important until it is changed to Unimportant in the Session Settings. Then, Ignore Unimportant Differences should be enabled in the toolbar.
The default Session Settings (editable on the Home screen) control the importance of a "Rules-based scan" in the Folder Compare, when comparing multiple files.
Is the page header exactly that specific text every time? You could create a Basic grammar for the first line of the header that is a copy/paste of that line exactly, and another Basic grammar that is a copy/paste of line2 of the header, etc. Then these lines, regardless of where they appear, would be unimportant.
A page number or timestamp could be assigned a RegEx mask as part of the Basic grammar definition. Did you get a chance to review the above video link? To ignore a page header, we would need to identify it. If it occurs at the beginning of the file we could use the Lines grammar, but if it occurs multiple times at random points throughout the file, we need to find an expression that finds that text and matches on it. Once defined, it can be marked as Unimportant.
In the above screenshot you provided, is that whole bit of text the header, or is just the first line the header?
Per the attached screen shot, each page of the report contains a 'heading' that consists of four lines. The first line will be different on each page as it contains the date and time.
RegEx such as "[0-9][0-9]" will work for the for "Date: 01/30/15 16:39." However, the expression is also cause issues within the body of the report.
Again, I am seeking a solution that will work on a page heading and not cause other issues / conflicts with the data provide within the body of the report
The file input into Beyond Compare is a single .txt that is a single, long scroll of multiple pages, correct? I was hoping to get a larger example in order to see what the other lines in the file are, as that would help determine the type of definition that can match on the PageHeader but not match on other data.
Given the above screenshot, I would use something like, a Lines grammar defined as:
^.*Date: \d\d/\d\d/\d\d.*Page:.*$
and next 3 lines
Regular Expression Enabled
Given the complexity of Regular Expressions and the wide variety of text that might appear in your file, it is difficult to create an exact solution without access to more information. If you can send in a full example file, with line numbers of the Page Headers you need to ignore, that would help. You can attach here to the forum thread, or email [email protected] and please include a link back to this forum thread in the email for our reference.
Comment