The Benefits of Using Source Data Over a Traditional Scrape

James Curtis

Here are some key benefits of using source data and associated plugins against a traditional scrape tool. We hope this helps explain the difference and enables you to improve your data quality.

1. Better control over which data is outputted

While in development stages, scrapes or spiders, are engineered to match what is currently live on the site. While trying to be generic enough to ensure retrieving maximum data, that will only depend on the site structure at setup time. Plugins such as Magento or Demandware, however, allow merchants to have total control over the data that will be exported, providing merchants the security of making sure all the inventory is exported, regardless of navigation changes that may happen on the site.

2. Develop one export every time

When a spider is developed, engineers have to specify selectors to pick up data. Be it Xpath selectors or CSS selectors, each one of them is dependant on the current site setup. If the website interface is altered, be it a change in the CSS stylesheet or in the template structure, engineers will have to reproduce these changes. This will obviously impact feed generation, as it requires some time to be allocated for the change to be replicated. With an associated plugin, as it is directly sourcing the information from the internal database of the site, changes made on the UI will have no impact, and will therefore ensure a feed export is constantly generated with the most accurate data possible.

3. Full data exported almost instantly

As spiders have to navigate on each product page, the total running time can be quite long. We try to retrieve sites information within a 24h window to make sure prices are as accurate as possible, and stock information are matching what a user would see on site. However, site navigation relies on multiple factors, such as web server response time, page loads, and count. In other words, the busier a webserver is, and the higher the page count, the slower and longer it will take for the data to refresh in the system. While we try to ensure we retrieve 100% of the product information available on merchants' sites, it may very well happen that discrepancies will appear, as pages can fail to load, or not respond within the timeout thresholds. Plugins, such as Magento or Demandware, export straight from the site database, meaning that all these limitations will be removed, leading to a quicker turn around for refreshing the data in product feeds.

If you want to find out more on how we can help you better manage your data, don't hesitate to contact us. We would be more than happy to help.

>>> {{cta('daf779e3-9e21-4fe9-990b-bbab1a6ac742')}}

Want to Learn More?

Discover the true power of the IR platform - book your demo today

Book a Demo

Intelligent Reach Feed Specification & Product Feed FAQs

By Guy Sneesby

Guides

Onboarding with the Intelligent Reach product data management platform

By Guy Sneesby

Guides

What are YouTube Shorts Ads and how are they used in E-commerce?

By Henry Fosdike

The Award-Winning PLatform, Loved by OUR Customers...

Globas Business Excellence Awards Outstanding New Product Service

The Award-Winning PLatform, Loved by OUR Customers...

Contact Us

We are passionate about high quality product data. So passionate, in fact, that we offer a completely free feed review to highlight the ways you could improve your feed as quickly and as easily as possible.

Get in Touch

Searchspring acquires Intelligent Reach

Reach Modules

Data Management Module

Marketplace Module

Local Module

Intelligence Modules

Data Connector Module

Experiments Module

Book Your Demo

Solutions by Role

E-commerce Teams

Digital Marketers

Digital Agencies

Solutions by Need

Manage Marketplaces

Sell on Marketplaces

Optimise Google Shopping

Increase Profitability

Drive Online to Offline

Let's chat

Seraphine

PrettyLittleThing

Motorpoint

Clarins

Pets at Home

River Island

Ego

DISCOVER GREAT STORIES

News

Guides

How to Sell on...

Hints & Tips

Webinars

e-Books

E-commerce Insights

About Us

Our Team

Our Partners

Beyond Customer Success

Pricing

Join Us

Get in Touch

Reach Modules

Data Management Module

Marketplace Module

Local Module

Intelligence Modules

Data Connector Module

Experiments Module

Book Your Demo

Solutions by Role

E-commerce Teams

Digital Marketers

Digital Agencies

Solutions by Need

Manage Marketplaces

Sell on Marketplaces

Optimise Google Shopping

Increase Profitability

Drive Online to Offline

Let's chat

Seraphine

PrettyLittleThing

Motorpoint

Clarins

Pets at Home

River Island

Ego

DISCOVER GREAT STORIES

News

Guides

How to Sell on...

Hints & Tips

Webinars

e-Books

E-commerce Insights

About Us

Our Team

Our Partners

Beyond Customer Success