Import.io User Guide

Chapter 4. Best Practices, Tips, and Tricks


This chapter describes best practices for enhancing your extractors, handling non-straightforward cases, and increasing your ability to obtain the desired data from websites.

Topic 1. Chaining extractors tutorial

This topic uses the chain workflow and describes how to extract data from websites where pages with lists of items link to subsequent pages containing details for each item.

Topic 2. Handling URLs that don’t change with pagination (“infinite scroll”)

This topic describes how to extract data from webpages that appear to scroll down forever.

Topic 3. Getting data from behind a login

This topic describes how to extract data from websites which require login details.

Topic 4. Working with hidden elements

This topic describes how to display hidden elements (for example, dropdown lists, multiple tabs, and hidden buttons) in the editor, so you can select them using the point-and-click interface.

Topic 5. Using manual XPath

This topic describes how to use manual XPath to extract data using the inspection tool in your browser.

Topic 6. Understanding regular expressions

This topic explains regular expressions and provides regular expression substitution examples.

Topic 7. Exchanging extractors with other users

This topic describes how to exchange extractors with other users.