Displays the data extracted using automatic structure recognition algorithms.
The guess view tries to understand the structure of the data found in the current page, if any.
Note: The program analyzes the available html source code of the page. Labels and field/record separators are looked for, using many different strategies. The program eventually gives a rating to each possible structure found and decides of the best possible answer, if any. The Challenge of these intelligent algorithms is to understand even non-tabulated data and we will make sure they become more and more efficient, but the very nature of the problem makes it impossible to ever get close to a 100% success rate.
If your need for the scraped data is critical in your workflow and if you cannot afford failed automatic recognition, it may not be a good idea to rely on automatic features like this one. In these cases, you should probably define a scraper and use the right click menu option: 'Apply Scraper to Selected URLs'. This way, if you have thoroughly tested the scraper you have designed, the process will be reliable and reproducible, at least as long as the online source is not altered and remains accessible.
Source URL: As in the datasheets of the other views, the Source URL is placed in the first column and is hidden by default.
Note: Use the column picker at the top right corner to show/hide them. When a row is moved to the Catch, the Source URL is always included in the moved data, even if the column is hidden in the view datasheet. All other hidden columns are ignored in the transfer.
In general, as for the other views, the content of this view can be filtered and sorted to extract specific data, using the filter controls and right-click functions in the datasheet. You can also move it to the Catch (or export it to a file).