Data

The hottest stories about data science, analytics and the industry

big data

The best podcasts for people who love all things data

Written by:

In the quest to absorb all we can the data world, reading blog after blog can get a little tiring. As we’ve explored the world of data podcasts, we’ve come across several big ones that we’d love to share here (with a major h/t to @mattfogel on Medium, who has showcased many of the same […]

Love in lyrics

Written by:

Love in Lyrics Inspired by Valentines Day, this small team decided to use lyrics data to see how music has changed over the decades when it comes to love. Did you know the 90s sang about love most? Well, you do now! loveinlyrics.com

What do Startups, Video Game Developers and Canadian Public Servants have in common?

Written by:

Answer: they’re all the subject of projects that use data collected via import.io!

At import, we love data – like, a lot. But we know that data is only as fun or as interesting as what you choose to do with it. That’s why we believe in celebrating the cool uses of data that people, just like you, are doing everyday. Here are the top three most interesting data stories we’ve been sent this week…

Who’s winning the social media battle in the UK election and does it matter?

Written by:

I’ve recently moved to a new apartment. When you’re moving, there are lots of things to sort out. Internet being one of them. After spending more than 5 hours on the phone over the course of 2 weeks with an internet provider, and still not having internet, I decided to take my frustrations public. I sent them an angry tweet! And lo-and-behold within 2 hours I had the internet up and running. This really showed me the power of social media and it got me thinking – is it going to be a deciding factor in this years election.

With election fever well and truly sweeping the UK, I thought it would be fun to take a look at who’s winning the battle on social media. To do this, I used import.io to build APIs to the Facebook and Twitter pages to each party to gather the amount of likes, followers and tweets each party has. Then I used Tableau to give you a visual representation of the winners and losers on social media.

14 fantastic examples of complex data visualized

Written by:

By 2015, the global annual rate of data production is expected to reach 5.6 zettabytes, double the rate of growth in 2012, according to IDC. That’s a lot of information to digest, especially in black-and-white typeface or an unattractive, 50-page PDF. Visuals are processed 60,000 times faster in the brain than text which makes it […]

Most popular sources on growth hackers

Written by:

So, I was playing around with our new Bulk Extract feature after my webinar on it earlier this week, and I came up with this super cool use case that I just had to share. The great thing about Bulk is that it helps you get 1,000s of rows of data without breaking a sweat. 

Like many people, I always enjoy taking a break to check out Growth Hackers to see what the latest trending news is. Which got me thinking: Is there one source that gets posted more than any other? 

ThatGift

Written by:

ThatGift When finding a present for your partner gets tough, do what this guy did and build your own affiliate site! ThatGift lets you search all the best unique gift sites at one time. that-gift.co.uk

Marketing espionage with Ron Gat

Written by:

Last Monday’s User Meetup included a brilliant talk by Ron Gat, Co-founder of Launchalyzer, about how he uses data for Marketing Espionage. Just like the stellar name implies, marketing espionage is about finding what successful marketers in your area are doing, taking the best ideas and incorporating them into your own strategy. 

A data analysis of music samples

Written by:

This is a guest post by I’m Emanuele Conti, a freelance graphic designer and co-founder of Tuxtax.

Creating a Music Infographic

For this self-initiated project, I wanted to combine my passions for music and data by creating an infographic for music. 

I used www.whosampled.com as the source for my analysis. This site has well-structured pages, with a lot of informations about sampled songs, so it was the right choice for this project. Import.io was extremely helpful in collecting the data I needed.

How to use import.io in academic research

Written by:

This is a guest post by Dr Neal Haddaway (@nealhaddaway), Project Manager at MISTRA EviEM.

What is grey literature?

Typically when policy-makers and practitioners need to understand how something works, for example a painkiller or a particular method of conserving an endangered species, they would commission a piece of research to investigate and provide an answer. This research can be very time-consuming and expensive, and would likely just be one case study. I work in an area of research called secondary synthesis. Essentially this means getting the most out of research we already have, rather than commissioning new research. By looking at what has already been done and published we can combine lots of smaller studies into one large experiment, using data that exists in research articles.

A conference strategy built with Silk and import.io

Written by:

Silk’s CEO Salar al Khafaji was invited to attend the DLD confab this year. It’s a great conference with entrepreneurs, scientists, and politician. Salar likes to organize his conference forays in advance and connect with people he hopes to meet for coffee or a chat. This isn’t as straightforward as it sounds. Often, the directories that conferences provider are static and hard to filter. This can make viewing speakers and figuring out who to contact cumbersome.

Using import and infogram to analyze jobs data

Written by:

Guest post by Saurav, Co-Founder of Factlabs.in

Motivation

While doing a Job search, we came across many aspirants who have less idea about the prospects and trends about the openings in their areas of interest, so we at factlabs.in thought of displaying it in an Infographic using Infogr.am and extracting data using import.io Magic API as primary tool.

Who posts the most interesting links on Growth Hackers? (hint: it’s not Sean Ellis!)

Written by:

Because the evenings are closing in and its getting dark earlier, some I find myself looking for indoor activities to keep me busy. Because I belong to the Data Addicts Anonymous club (support group here) I thought I would try to set a record for some kind of data insight… and this is what I came up with:

  1. Extract the first 20 pages of data from Growth Hackers 
  2. Slap it into a pivot table
  3. Remove the outliers
  4. Play with the numbers
  5. Make a Chart/Viz

The spooky data project

Written by:

Halloween is rapidly approaching, and panic is setting in. What are you going to go as this year? Will it be the scary zombie, the topical GoT character, the fun filled group costume or something that shows off a little leg? This weeks webinar is for those of you who haven’t quite decided what to be for Halloween yet. I’ve got a very special (return) guest with me, the wonderful and talented Jewel Loree from Tableau, who is going to help me build a Halloween costume dashboard. As you can see, Jewel and I already have our costumes sorted; but we know how hard it can be to decide. Which is why we’ve turned to the trusty data to help you figure out whether to go scary or sexy this year!

The future of wearables

Written by:

Import.io was once again super excited to host another Big Data Debate. We managed to secure a bigger venue this time (Central Foundation Boys School), which meant we had space for even more data lovers. And good thing too, because this event was packed! Which may have had a little something to do with that night’s subject: wearables.

How much does prostitution contribute to the UK economy?

Written by:

How many prostitutes are there in the UK?  According to the Office of National Statistics the answer is 60,879.  In figures due to be released next week this number is being used to add £5.314bn to the official size of the UK economy.  But the ONS only attempted to measure the number of female prostitutes.  If male prostitutes are included in the count then the contribution that prostitution makes to the UK economy rises to £8.856bn.  

Become a data scientist in less than an hour

Written by:

We had a very special guest presenter for this last webinar; Louis Dorard, author of the popular book Bootstrapping Machine Learning, was here to show us how everyone can do data science with the help of a few tools. To prove this he showed a simple three step way to analyse the relationship between price and all the other attributes of the house (square footage, number of bathrooms, etc). In practical terms, doing this should allow us to predict how much a house might sell for based on it’s characteristics. Pretty useful if you’re looking to sell your place soon!

Real estate data infographic

Written by:

At import we love a good data Visualisation, but over the last few weeks import.io has been working with our friends at NeoMAM studios to explore the role of web data in custom/bespoke content infographics. 

Data Viz and infographics are two different things; A common use of infographics to appeal to a wider audience and to drive traffic to the creator, and often to drive a message. In our mind a data viz presents Data in an easy to understand way where as and infographic might better present an idea or concept.

So we decided to look at some data, and see if we could derive an idea. We wanted to see if we could make some entertaining and interesting visual infographic style content based on web data… and here is the result.

The World Cup of data

Written by:

Today’s webinar is all about my favorite thing – Football (or “Soccer” for our American viewers).

With the World Cup in full swing, I thought it would be fun to show you how Chris created this ultimate football stats web app which updates live.

The boom and the bust

Written by:

A new devastating housing crash is in the making, the International Monetary Fund warned on Wednesday. Its global house price index has shown that in many countries property prices are well above their historical average. The IMF called on governments and central banks to take quick action on booms, which are a threat to global economic stability.

The world’s most expensive housing market is Canada, where housing is 33% above the long-run historical average relative to incomes and 87% relative to rents. An average Canadian house is now worth $409,708, up 7.6% in the past year.

Can we ever get enough of music festivals?

Written by:

One thing is certain: if you are planning to go to a festival this summer, Queens of the Stone Age is the one band you will not miss. They are set to appear in 15 major music events across Europe, always as headliners. Boring, right?

This is what 14% of gig-goers think ahead of the new festival season. A third will not attend one this year, and the main reason is that too many events are staging the same acts. Meanwhile, they are getting too expensive.

Analysing Designer News with import.io

Written by:

Personal background

Hi, I’m Artiom Dashinsky, a young UI/UX Designer and entrepreneur from sunny Tel Aviv who likes to define himself as maker. I’d like to share with you the latest thing I’ve made using import.io.

Designer News

Every field has it’s go-to source for all the new and hot information you should know from around the industry. Tech-people have Hacker News created by Y-Combinator. But for designers, that site has to be Designer News.

Data mining vs data collection

Written by:

Data is the hot new thing, and as such it has spawned a bunch of new terms and jargon, which can be pretty hard to keep track of. To help you sound like a data guru instead of a data noob, I’ll be taking you through some of the terms people tend to get a bit confused about.

One of the most common phrases I hear being used incorrectly is Data Mining. There is a very important distinction between Data Mining and Data Collection. I know they sound like they’d be the same thing, but they’re actually very different.

Big data in the media

Written by:

New month, new debate, new panel talking some serious #DataProblems. The big difference? This was our first Big Data Debate (BDD) as sole organisers of the event.

Cue the panic and excitement!

We first got involved in the BDD to meet people equally as interested in harnessing the power of data on the web as we all are at import.io! From day one we loved co-organising this event. Creating a relaxed environment for everyone to gather and discuss common issues and, most importantly, indulge in beer and pizza proved a pretty great way to spend the evening.

One dataset can start something big

Written by:

Here at import•io we often talk of the opportunities a web of data holds for instigating social and environmental change. It is our conviction that today’s data can prevent problems tomorrow and that in democratising access to data, our technology can allow people to hold their governments accountable in a way that hasn’t been previously possible.

A Different Perspective

It is easy for all of us to look at datasets without context, seeing it as no more than well-structured words and numbers. Following a dataset through from its creation to the point where its impact can be measured requires both the right technology, and a new way of seeing the possibilities of data on the web. Spotting something on a web page and turning it into a table of rows and columns to do something different is what our recent work with Oxfam has shown is possible: that one dataset can start something big.  

Dram-atic insights

Written by:

You may or may not know this about me, but I consider myself to be something of a whisky connoisseur. I even have my own personal whisky blog where I put up my own reviews and thoughts on whisky. Anyway, the other day I was looking at a few other whisky review sites and it got me thinking about if I could use data to determine once and for all what the best dram is – in case you hadn’t noticed, I’m also a bit of a data nerd.

So I did what I usually do in these situations and took to the web to collect some data. Using a series of Crawlers and Connectors I built using import•io, I collected data from the top 7 whisky review sites that had 1/100 scales for ranking whisky.

The Roadless traveled

Written by:

There is so much that can be done by augmenting data that is found on the web today. Presenting existing data in a new format, enriched with content from multiple other sources, can itself become a business.

If you’re using import•io, you don’t need a huge amount of coding knowledge to get data from the web and feed it into your app or website. And, since using our tool is free, it’s a pretty cost effective way to aggregate and normalise a huge amount of data very quickly!

One re-Direction

Written by:

Using import•io to get structured data from websites is generally pretty straightforward. Every site is different though and occasionally we run into a data problem that requires a little old-school ingenuity to solve. When that happens, we put our best man – that would be me – on the case! And, since I’m such a generous guy, I’m sharing these with you so if you find yourself in a similar situation you’ll know just what to do.

Animation inspiration in 87seconds

Written by:

At import•io the data our users are extracting isn’t always what you’d think. I know sometimes our vision of data is static spreadsheets full of numbers and dates. But, did you know you can use import to extract more rich and visually stimulating data like images and videos? Well, you can!

Getting Inspired

87seconds is a startup which helps companies come up with fun ways to explain their products through animated videos. One of their biggest challenges is to continuously renew their design styles. So they created an inspiration feed of interesting animation videos from around the web.

Big data will rule in 2015

Written by:

Last night we had the pleasure of co-hosting yet another Big Data Debate. They are getting to be pretty popular so we had to move to a bigger venue for this one. The lovely people at Shoreditch Works let us rent out the Shoreditch Village Hall for the evening and were a great help in arranging the obligatory beer and pizza. 

Let’s Debate!

We kicked off the evening with the usual 10 minute presentations. First up was Alex Denis from educational planning company Pearson, followed by James Hodge from Splunk who showed us an adorable picture of his new baby!

Fashion as data

Written by:

Inspired by my recent experience at London Fashion Week, I was curious about the biggest players in the entertainment and fashion industries.

To find out, I extracted data from a bunch of different websites using import•io and fed the resulting data set into PiktoChart to make a super cool viz. 

Remix your data!

Written by:

One of the best things about running the User Success team at import•io is all the cool use cases you come across on a daily basis. The innovative ways our users find to benefit from and integrate data they get from our platform never cease to amaze me. It seemed a bit selfish to keep all these awesome ideas to myself, so I’m going to be showcasing some use cases that catch my attention.

Hopefully, these use cases will give you a little inspiration for how you too can use data to do some pretty cool stuff. After all, it’s easy to use the import•io browser to extract data, but what happens next? What do you do with the data?