Bring Web Data into Alteryx with the InterWorks Import.io Connector

Data

Bring Web Data into Alteryx with the InterWorks Import.io Connector

Import.io is the best tool for scraping structured data off the web. Alteryx is the best tool for self-service data analysis. It doesn’t take long to realize that we should put the two together.

Import.io > Alteryx” src=”/wp-content/uploads/sites/default/files/blog-content/AlteryxImportioPatrick.jpg” /></p><p>Using the Import.io API, the InterWorks Import.io connector allows you to read structured data from the web and bring it directly into your Alteryx workflow.</p><p><strong>Note:</strong> If you want to achieve similar results with Tableau, my colleague <a href=Robert Rouse built a handy Import.io web connector of his own

Setup

To install the InterWorks macro, unzip the attached folder to the location on your computer where you want the macros to be permanently saved. I suggest: DocumentsMy Alteryx MacrosInterWorks Macros Then, run the installer wizard Install.yxwz and choose Install. Once you restart Alteryx, the macros will now be available on your toolbar just like any other tool:

Alteryx - InterWorks Macros

You will also need to set up a free account at http://import.io. Once you’ve signed up, the account page will show your API Key:

Import.io API Key

Automatically Extract Web Data

Import.io provides easy access to web data through their Magic API. On the homepage, enter any URL and the Magic API will attempt to extract structured data. As an example, try using the InterWorks People page

Import.io Extraction Page

Import.io will return a structured table of the InterWorks employee directory. To do the same thing in Alteryx, use the InterWorks connector, enter your API Key, select Magic API in the tool configuration and enter the URL. The tool will the return the JSON data extracted from the webpage right into your Alteryx module:

JSON dat in Alteryx Module

Using the Connector API in Alteryx

The Magic API works wonders, but sometimes it is unable to find exactly what you are looking for. In this case, you may need to train an extractor or connector. An extractor allows you to build a custom tool to scrape data from similarly structured web pages. A connector is an extractor with a macro attached. This will allow you to record actions such as using page searches before extracting data. Building a connector is easy; but for this example, we will use one that has already been created.

Robert Rouse, in another blog, has provided a connector that allows us to pull data from a formatted Wikipedia table:

Wikipedia colors

To use this connector in Alteryx, enter your API key, the connector ID and the input variable:

How to use the connector in Alteryx

Try It Out

We’re excited to bring together the simple data scraping capabilities with our favorite data analytics tools. InterWorks is always looking for feedback and ways that we can improve our tools to better help our clients. Create a connector, try out the macro and tell us what you think!

More About the Author

Michael Treadwell

Data Engineering Lead
Resources from the ‘Advanced Analytics Done Right’ Webinar Predictive analytics is more than just dragging and dropping tools in to your workflow. Predictive analytics drive business critical ...
Scraping Web Data with Alteryx: Session Recap I’d like to say a big Thank you! to everyone that came to my session at Alteryx Inspire today! I always enjoy having the opportunity to ...

See more from this author →

Subscribe to our newsletter

  • I understand that InterWorks will use the data provided for the purpose of communication and the administration my request. InterWorks will never disclose or sell any personal data except where required to do so by law. Finally, I understand that future communications related topics and events may be sent from InterWorks, but I can opt-out at any time.
  • This field is for validation purposes and should be left unchanged.

InterWorks uses cookies to allow us to better understand how the site is used. By continuing to use this site, you consent to this policy. Review Policy OK

×

Interworks GmbH
Ratinger Straße 9
40213 Düsseldorf
Germany
Geschäftsführer: Mel Stephenson

Kontaktaufnahme: markus@interworks.eu
Telefon: +49 (0)211 5408 5301

Amtsgericht Düsseldorf HRB 79752
UstldNr: DE 313 353 072