Chapter 4. Best Practices, Tips, and Tricks
This chapter describes best practices for enhancing your extractors, handling non-straightforward cases, and increasing your ability to obtain the desired data from websites.
Topic 1. Chaining extractors tutorial
This topic uses the chain workflow and describes how to extract data from websites where pages with lists of items link to subsequent pages containing details for each item.
This topic describes how to extract data from webpages that appear to scroll down forever.
Topic 3. Getting data from behind a login
This topic describes how to extract data from websites which require login details.
Topic 4. Working with hidden elements
This topic describes how to display hidden elements (for example, dropdown lists, multiple tabs, and hidden buttons) in the editor, so you can select them using the point-and-click interface.
Topic 5. Using manual XPath
This topic describes how to use manual XPath to extract data using the inspection tool in your browser.
Topic 6. Understanding regular expressions
This topic explains regular expressions and provides regular expression substitution examples.
This topic describes how to exchange extractors with other users.