Parses an HTML log file them into an event sequence and a GUI model.
The parsing process can be parameterized. This allows to ignore ids or indexes of GUI elements in the log files. If they are ignored, the GUI model is more harmonized and GUI elements are considered equal although they are not. This may be helpful, e.g., if you have a table where each row is semantically the same. Without ignoring indexes or ids of the rows, each row is treated separately. But with ignored indexes or ids, all rows are considered the same.
To ignore the indexes, add -clearIndex= as parameter to the command call. To ignore ids, add -clearId= to the command call. The path to the GUI element is written using the HTML tag names and either their index or their id as identification. E.g., to denote all rows in a table where the table has the id "table_1" you can specify "table(htmlId=table_1)/tbody/tr". To denote e.g. all divs being the child of a div with an index 1, you specify "div[1]/div".
$USAGE$
path to the file to be parsed
[] array of sequences into which the parsed events shall be stored
[] used to define GUI elements of which the ids shall be ignored
[] used to define GUI elements of which the indexes shall be ignored
Example(s):
parseDirHTML /path/to/file.log
parseDirHTML /path/to/file.log sequences -clearId=table(htmlId=overview)/tbody[0]/tr