Sample Reports | ProWebScraper Data Examples

A sporting goods retailer opens this file every Monday at 7am. 2,000 SKUs across 6 competitors. They filter by price gap, reprice 340 SKUs before lunch, and close the file. Here is what that file looks like.

Key questions this answers:

What are competitors charging right now (total landed price)?

Who offers free shipping vs paid shipping?

Where am I overpriced vs underpriced?

Which markets have the highest price variance?

2,000

SKUs tracked

competitor sites

98.2%

completeness

99.1%

accuracy

Competitor Price Report — Jan 13, 2023

Your SKU

Product

Competitor

Your Price

Comp Price

Δ%

Stock

URL

Timestamp

TG-BPD21

Babolat Pure Drive 2021

TennisOnly

$339.00

$349.95

+3.2%

In Stock

tennisonly.com.au/BPD21

2023-01-13T06:14Z

TG-BPD21

Babolat Pure Drive 2021

TennisDirect

$339.00

$369.99

+9.1%

In Stock

tennisdirect.com.au/babolat-pd

2023-01-13T06:18Z

TG-BPD21

Babolat Pure Drive 2021

Strungout

$339.00

$349.95

+3.2%

In Stock

strungout.com.au/babolat-pd

2023-01-13T06:22Z

TG-BPDL

Babolat Pure Drive Lite

TennisOnly

$339.00

$319.00

−8.6%

In Stock

tennisonly.com.au/BPDL21

2023-01-13T06:14Z

TG-BPDL

Babolat Pure Drive Lite

TennisWarehouse

$339.00

$319.00

−8.6%

In Stock

tenniswarehouse.com.au/bpdl

2023-01-13T06:19Z

TG-BPDL

Babolat Pure Drive Lite

Strungout

$339.00

$329.95

−5.5%

In Stock

strungout.com.au/bpdl

2023-01-13T06:22Z

TG-BPDT

Babolat Pure Drive Team

TennisOnly

$339.00

$329.00

−5.7%

Limited

tennisonly.com.au/BPD21T

2023-01-13T06:14Z

TG-BPDT

Babolat Pure Drive Team

TennisWarehouse

$339.00

$329.00

−5.7%

In Stock

tenniswarehouse.com.au/bpdt

2023-01-13T06:20Z

TG-BPDTR

Babolat Pure Drive Tour

TennisOnly

$339.00

$349.95

+3.2%

In Stock

tennisonly.com.au/BPDT21

2023-01-13T06:14Z

TG-BPDTR

Babolat Pure Drive Tour

TennisWarehouse

$339.00

$349.00

+2.9%

In Stock

tenniswarehouse.com.au/bpdtr

2023-01-13T06:20Z

TG-BPD26

Babolat Pure Drive JR 26

TennisOnly

$169.00

$189.95

+12.4%

In Stock

tennisonly.com.au/BPDJ26

2023-01-13T06:15Z

TG-BPD26

Babolat Pure Drive JR 26

TennisWarehouse

$169.00

$189.95

+12.4%

In Stock

tenniswarehouse.com.au/jr26

2023-01-13T06:20Z

TG-BPDSL

Babolat Pure Drive Super Lite

TennisOnly

$349.00

$339.94

−2.6%

In Stock

tennisonly.com.au/BPSP21

2023-01-13T06:15Z

TG-BPDSL

Babolat Pure Drive Super Lite

TennisDirect

$349.00

$349.99

+0.3%

In Stock

tennisdirect.com.au/bpdsl

2023-01-13T06:18Z

TG-BPAR

Babolat Pure Aero Rafa

TennisOnly

$375.00

$389.95

+4.0%

In Stock

tennisonly.com.au/BPAR

2023-01-13T06:15Z

TG-BPAR

Babolat Pure Aero Rafa

TennisDirect

$375.00

$389.99

+4.0%

Backordered

tennisdirect.com.au/aero-rafa

2023-01-13T06:18Z

TG-BPD110

Babolat Pure Drive 110

TennisWarehouse

$339.00

—

Not found on site

tenniswarehouse.com.au/bd110

2023-01-13T06:20Z

TG-BPDP

Babolat Pure Drive Plus

Strungout

$339.00

$0.00

Anomaly

—

strungout.com.au/bpdp

2023-01-13T06:22Z

↑ Row 7: Limited stock flagged — not just "In Stock" vs "Out." Row 16: Backordered — different from out-of-stock. Row 17: Product not found — flagged explicitly, never blank. Row 18: $0.00 anomaly — flagged as probable scraper error, excluded from analysis, queued for same-day fix.

Delivery

Delivery confirmation — what hits your inbox with every run

Run Summary

Run completed: 2,000 products from 6 sites

Completeness: 98.2% | Accuracy: 99.1%

Flagged for review:

12 products not found on TennisWarehouse (likely delisted)

1 anomaly: Pure Drive Plus at Strungout showing $0.00
(scraper error — excluded from file, queued for fix)

Strungout.com.au returned 503 at 06:22Z — retried at 06:45Z, succeeded.
4 products collected on retry, no data loss.

Delivery Details

Files delivered to: s3://your-bucket/pricing/2023-01-13/

Schema: v2.3 (unchanged since Nov 2022)

Next run: Jan 28, 2023

The $0 anomaly was caught before it reached your BI dashboard. The 503 error was retried automatically — no data loss, no manual intervention. In a self-service tool, that row corrupts your average and the site outage means missing data until someone notices.

Why This Matters

Multi-tier pricing — when one price is not enough

Real data from a NZ pet retailer. Same products, but each site offers different pricing tiers:

Product

Site

One-Time

Repeat Delivery

Old Price

On Sale?

Black Hawk Cat Chicken 12kg

Animates

$125.59

$133.44

$156.99

Yes

Pro Plan Dog Weight 15.4kg

Animates

$178.39

$189.54

$222.99

Yes

Royal Canin Digestive Care

PetStock

$3.13

$2.66

—

Royal Canin Maxi Puppy 4kg

PetStock

$65.90

$56.02

—

PetStock shows $65.90 on-page. The subscription price is $56.02. If you track headline prices only, you are comparing against the wrong number.

Edge Case

How "product not found" works (never blank)

When TennisWarehouse delists a product mid-cycle, the row does not disappear or show blank fields. It is flagged: "Not found on site." We distinguish "delisted," "out of stock," and "scraper error" — three different situations requiring three different responses from your pricing team.

A rug manufacturer needed to know which retailers were undercutting RRP — and which were compliant. We tracked 4,729 SKUs across 8 UK retailers for 8 months. This is from their actual January 2026 enforcement report.

Key questions this answers:

Which sellers are breaking MAP right now?

How much are they undercutting by (exact amount)?

Is this a one-time or chronic pattern?

Do I have evidence for enforcement action?

£754

largest gap found

8,988

price changes

988.2%

retailers tracked

4,729

SKUs monitored

Violation Report — January 2026 (selected products)

Product

Size

RRP

Retailer

Price

vs RRP

URL

Timestamp

Screenshot

Form Wool — Green

200×290

£1,399

£644.76

−54%

beddingmill.co.uk/form-green

2026-01-27T08:20Z

S3 link

Form Wool — Green

200×290

£1,399

RugShop

£805.95

−42%

therugshopuk.co.uk/form-green

2026-01-27T08:22Z

S3 link

Form Wool — Green

200×290

£1,399

LandOfRugs

£862.99

−38%

landofrugs.co.uk/form-green

2026-01-27T08:24Z

S3 link

Form Wool — Green

200×290

£1,399

John Lewis

£1,399.00

At RRP

johnlewis.com/form-green

2026-01-27T08:25Z

S3 link

Gatsby — Blue

200×290

£1,109

£633.07

−43%

beddingmill.co.uk/gatsby-blue

2026-01-27T08:26Z

S3 link

Gatsby — Blue

200×290

£1,109

LandOfRugs

£704.99

−36%

landofrugs.co.uk/gatsby-blue

2026-01-27T08:28Z

S3 link

Gatsby — Blue

200×290

£1,109

Furniture Village

£1,109.00

At RRP

furniturevillage.co.uk/gatsby

2026-01-27T08:29Z

S3 link

Tate — Grey

200×290

£1,059

M&S

£478.80

−55%

marksandspencer.com/tate-grey

2026-01-27T08:30Z

S3 link

Tate — Grey

200×290

£1,059

£608.93

−43%

beddingmill.co.uk/tate-grey

2026-01-27T08:32Z

S3 link

Tate — Grey

200×290

£1,059

Heal's

£1,059.00

At RRP

heals.com/tate-grey

2026-01-27T08:33Z

S3 link

Aurora Galaxy — Gold

160×230

£289

£147.99

−49%

beddingmill.co.uk/aurora-gold

2026-01-27T08:34Z

S3 link

Aurora Galaxy — Gold

160×230

£289

John Lewis

£289.00

At RRP

johnlewis.com/aurora-gold

2026-01-27T08:35Z

S3 link

↑ Highlighted: £754 gap — Form Wool Green at BM (£644.76) vs RRP (£1,399). Same product, same month, same report. Compliant retailers like John Lewis and Heal's are tracked alongside violators — the report shows the full picture, not just problems.

Screenshot

Screenshot evidence — 3 examples from this report

Every violation includes a timestamped screenshot showing the advertised price, URL bar, and capture time:

beddingmill.co.uk/asiatic-form-wool-green-200x290

Asiatic Form Wool Rug — Green 200x290cm

£644.76

£1399.00

−54% vs RRP

Captured: 2026-01-27T08:32:14Z

ID: MAP-2026-0251

URL: beddingmill.co.uk

therugshopuk.co.uk/asiatic-gatsby-blue-200x290

Asiatic Gatsby Rug — Blue 200x290cm

£791.34

£1,109.00

−29% vs RRP

Captured: 2026-01-27T08:34:22Z

ID: MAP-2026-0263

URL: therugshopuk.co.uk

marksandspencer.com/asiatic-tate-grey-200x290

Asiatic Tate Rug — Grey 200x290cm

£478.80

£1,059.00

−55% vs RRP

Captured: 2026-01-27T08:36:48Z

ID: MAP-2026-0270

URL: marksandspencer.com

Retailers cannot claim "we never charged that." This evidence has been used in actual enforcement actions.

Evidence

Evidence packet — what gets forwarded to legal

Retailer

Name + storefront ID

Product

Code + name + size

RRP

Manufacturer price

Advertised price

After all discounts

Violation amount

Exact £ below RRP

Product URL

Live link

UTC timestamp

ISO 8601

Screenshot

S3 link, timestamped

History

First-time vs chronic

Edge Case

How "product not found" works (never blank)

Month

BM Price

RRP

Violation

Status

Jun 2025

£918.72

£1,399

−34%

Violation

Jul 2025

£626.40

£1,399

−55%

Violation

Aug 2025

£626.40

£1,399

−55%

Violation

Sep 2025

£657.72

£1,399

−53%

Violation

Oct 2025

£829.91

£1,399

−41%

Violation

Nov 2025

£801.99

£1,399

−43%

Violation

Dec 2025

£560.63

£1,399

−60%

Violation

Jan 2026

£644.76

£1,399

−54%

Chronic ×8

8 consecutive months below RRP. The price fluctuates — £560 one month, £829 the next — but never reaches RRP. This pattern turns "we will look into it" into a cease-and-desist.

How It Works

Unauthorized seller detection

Marketplace

Seller

Seller ID

Products

Lowest Price

vs MAP

First Seen

Amazon UK

BargainRugsDirect

A3K8F2M1X9

£89.99

−40%

Nov 2025

eBay UK

homedeals_clearance

hdc-7821

£72.50

−51%

Oct 2025

Amazon UK

QualityHomeGoods

A1P7R3N5K2

£134.00

−8%

Jan 2026

One workwear brand found 700 unauthorized sellers over 4 years. They were hiding in the 40% coverage gap their previous vendor could not reach.

A luxury marketplace's account team uses this before every seller meeting. Instead of "please add more products" they say: "You have 29 Gucci products. Average seller has 258. Here's which categories you're missing."

Key questions this answers:

Which sellers have the biggest assortment gaps?

Where are category-level gaps (bags, shoes, accessories)?

Which brands are underrepresented across my network?

Which sellers suddenly dropped products this week?

174

sellers

209

brands

365K

products/wk

2,554

largest gap

Seller Ranking — Gucci Depth

Seller

Gucci Products

vs Avg (258)

Status

VITKAC

2,555

+2,297

BMLeader

Level Shoes

859

+601

Strong

Fashion Clinic

509

+251

Above Avg

BENCHMARK AVERAGE: 258 products

Boutique Tricot

−229

Below

Leigh's

−252

Below

NIDA

−257

Below

↑ Highlighted: Boutique Tricot — 29 vs 258 average. That gap is the conversation your account team now has with exact numbers.

Why This Matters

Category gaps — 876 clothing items, zero bags

Seller

Brand

Clothing

Shoes

Bags

Gap

Apranga Lithuania

Max Mara

876

Bags + Shoes

The Mint Company

Polo Ralph Lauren

256

Bags missing

Tessabit Group

Stone Island

195

Bags + Shoes

Marais

Alexander McQueen

164

Bags missing

876 clothing items, zero bags. Not a strategy decision — a category expansion opportunity. One marketplace used this data in seller meetings: assortment went from 50% to 98% completion.

Scale

Missing brands across your network

Brand

Sellers NOT Carrying

Out of 174

Gap

Prada

135

174

77.6%

Gucci

119

174

68.4%

Fendi

119

174

68.4%

Balenciaga

109

174

62.6%

Edge Case

Early warning — 4,253 products to zero in one week

Seller

This Week

Last Week

Change

Priority

TOPS! Lithuania

4,253

−100%

Critical

Colognese 1882

252

1,766

−86%

Critical

Dell'Oglio Group

2,064

3,288

−37%

Warning

Without monitoring, nobody notices TOPS! Lithuania vanished for weeks. Your account team knows Monday morning.

A luxury fashion marketplace needed to match 2,068 Farfetch products against the Isabel Marant US catalog. 6.8 million candidate pairs were scanned. 3,795 were evaluated with text + image verification. 632 exact matches were confirmed. Here is what that process looks like.

Key questions this answers:

Which of my products are sold by competitors?

How confident is each match?

Where did automated matching fail (and human QA caught it)?

What is the color variant vs exact match distinction?

6.8M

pairs scanned

3,795

candidates evaluated

632

exact matches

99%

accuracy after QA

Match Results — Farfetch → Isabel Marant US

Farfetch Product

Candidate Match

Score

Method

Decision

Lisia crochet dress (white, $2,850)

LISIA DRESS — Ecru ($2,850)

100%

Text + Image

Confirmed

Tess asymmetrical shirt (beige, $706)

TESS SHIRT — Off White ($820)

100%

Text + Attr

Confirmed

Nya leather slingback sandals (black, $575)

Nya Sandals — Black ($345)

98%

Text + Image

Confirmed

Silao bucket bag (black, $1,290)

Nya Sandals — BlaSILAO BAG — Black ($903)ck ($345)

98%

Text + Image

Confirmed

Izae ruffled trim cardigan (ecru, $650)

IZAE CARDIGAN — Ecru ($650)

100%

Text + Image

Confirmed

Oskan Moon suede shoulder bag (camel, $840)

OSKAN MOON — Taupe ($990)

90%

Text + Image

Variant

55mm Dalby draped leather boots (camel, $990)

DALBY BOOTS — Brown ($990)

95%

Text + Image

Variant

40mm Duerto cowboy boots (black, $663)

Duerto Cowboy Boots — Black ($690)

60%

Text + Image

Rejected

Madinea floral-print blouse (purple, $1,118)

LIRHETTA TOP — Crushed Berry ($285)

20%

Text + Image

Different

Zael wool-blend midi dress (black, $995)

Meloe dress — Ecru ($654)

20%

Text only

No match

↑ Highlighted: Oskan Moon at 90% — same product line, different color (camel vs taupe). Dalby Boots at 95% — same silhouette, different shade. Both classified as color variants, not exact matches. Row 8: Duerto boots at 60% — same model name but different material and design details. Automated systems accept 60%+ matches. Human QA catches the difference.

4-method matching process

1. Text

Product names, brand, model. Fast but cannot distinguish accessories from base products or color variants.

2. Image

Visual product photo comparison. Catches color and model variants that text matching misses.

3. Attributes

Size, weight, specs, material. Catches near-misses like suede vs leather versions of the same bag.

4. Human QA

Every match 60–98% goes to human review. The 90% Oskan Moon was correctly classified as a variant here — not a false exact match.

How It Works

Matching pipeline — from 6.8M pairs to 632 matches

Step 1: Scan all possible pairs

2,068 × ~3,300 = 6.8M candidate combinations

Step 2: Text + image filter

6.8M → 3,795 candidates above similarity threshold

Step 3: Attribute verification

Color, material, size, design details checked

Step 4: Human QA on 60–98% matches

→ 632 exact matches confirmed (30.6%)

→ 393 classified as color/size variants (19.0%)

→ 1,043 no match on Isabel Marant US (50.4%)

50.4% had no match — products on Farfetch that Isabel Marant US doesn't carry. That gap is itself a competitive insight: which products are available through resellers but not the official US store.

Edge Case

Why automated matching fails at scale

"Oskan Moon suede shoulder bag" and "OSKAN MOON shoulder bag" score 90% on text similarity. Automated systems call that a match. But the Farfetch version is camel brown suede. The Isabel Marant US version is taupe leather. Different color, different material — a variant, not an exact match. If your pricing decisions depend on correct matches, automated matching silently corrupts your price comparisons.

A workwear brand tracked Buy Box ownership across Amazon US. The question is not just "who is winning" — it is "why are they winning, and what is the price gap I need to close?"

Key questions this answers:

Who is winning the Buy Box on each product?

What is the exact price gap I need to close?

Are unauthorized sellers stealing my Buy Box?

products tracked

seller listings

$0.84

smallest winning gap

unauthorized sellers

Buy Box Ownership — Amazon US

Product

ASIN

Seller

Price

Buy Box

Gap to Win

Portwest FR94 Coverall

B07K2K1CMQ

Amazon.com

$79.83

Winner

—

Portwest FR94 Coverall

B07K2K1CMQ

Sheffield Supply

$80.60

—

−$0.77

Portwest FR94 Coverall

B07K2K1CMQ

WorkwearDirect

$78.99

—

+$0.84

Portwest US440 Jacket

B074N92T9X

Amazon.com

$19.99

Winner

—

Portwest US440 Jacket

B074N92T9X

Kilronan Safety

$29.99

—

−$10.00

Portwest UH445 Hi-Vis

B01M66681Y

Kilronan Safety

$48.99

Winner

—

Portwest UH445 Hi-Vis

B01M66681Y

Atlantic Safety

$47.99

—

+$1.00

Bizweld FR Jacket

B07DEF9012

Sheffield Supply

$89.99

Winner

—

Bizweld FR Jacket

B07DEF9012

Amazon.com

$92.50

—

−$2.51

↑ Highlighted: WorkwearDirect is $0.84 cheaper but does not hold the Buy Box. Price is not the only factor — seller rating, fulfillment method, and stock history matter too.

Why This Matters

What Buy Box tracking reveals over time

A single snapshot tells you who is winning. Weekly tracking tells you patterns: which sellers consistently undercut, when ownership shifts, and whether unauthorized sellers are gaining ground. One brand found that 3 unauthorized sellers were rotating Buy Box wins by cycling prices — a pattern only visible with historical data.

A wholesale apparel distributor tracks size-level pricing across 11 competitor sites. Same product, same color, different sizes — and prices change at the size break. If you track at product level, you miss the variant-level competition.

Key questions this answers:

What is the price for each size/color variant?

Which variants are out of stock at which sites?

Where are size-level pricing premiums inconsistent?

competitor sites

size variants

$7.07

largest size premium

Monthly

delivery cycle

Variant Pricing — Bella+Canvas 3719 Pullover Hoodie (Black)

Site

Style

2XL

3XL

JiffyShirts

3719

$26.25

$30.29

$33.32

ShirtMax

3719

$26.54

$29.50

$32.42

BlankStyle

3719

$23.02

$25.58

$28.12

ShirtSpace

3719

$23.85

$26.92

$29.59

↑ Highlighted: BlankStyle is cheapest at every size — $23.02 (base) vs JiffyShirts $26.25. That is a $3.23 gap at S–XL that widens to $5.20 at 3XL.

Variant Pricing — Bella+Canvas 3739 Full-Zip Hoodie (Black)

Site

Style

2XL

3XL

JiffyShirts

3739

$30.29

$33.32

—

ShirtMax

3739

$28.20

$31.16

—

BlankStyle

3739

$24.46

$27.02

$29.56

ShirtSpace

3739

$25.36

$28.01

$30.64

JiffyShirts and ShirtMax do not carry 3XL in this style. BlankStyle and ShirtSpace do. If you sell 3XL, only 2 of 4 competitors offer it — that is pricing power.

Why This Matters

Why variant-level tracking matters

Product-level pricing shows "Pullover Hoodie: $26.25." But the real competitive picture is: $26.25 at S–XL, $30.29 at 2XL, $33.32 at 3XL. A competitor who is cheaper at S–M but expensive at 2XL+ has a different strategy than one who is cheaper across all sizes. Variant tracking surfaces this — product-level tracking hides it.

How It Works

Quantity tier pricing (captured per variant)

Some wholesalers offer quantity discounts per size. Real data from Needen:

Product

Size

Qty 1

Qty 12

Qty 72

Qty 144

Qty 288

Qty 576

Adult Contrast Hoodie

$16.74

$16.23

$15.72

$15.22

$14.71

$14.46

Adult Contrast Hoodie

$16.74

$16.23

$15.72

$15.22

$14.71

$14.46

Adult Contrast Hoodie

$17.90

$17.35

$16.80

$16.25

$15.70

$15.45

XL premium is $1.16 at qty 1 but only $0.99 at qty 576. We capture every tier the site offers.

Every deliverable includes

Schema Stability

You define the fields. Column names do not change without sign-off. 3+ business days notice. Version tracking on every change.

Delivery Formats

CSV · Excel · JSON · Google Sheets · S3 · BigQuery · Snowflake · Redshift · SFTP · API

Response Times

4hr

Routine

2hr

Data

1hr

Critical

What you will actually receive

Every deliverable includes

Request sample data from your actual sites