WebCategorize

Workflow Integration

WebCategorize

Automatically classify the content of every page you capture. After each screenshot job, ScreenshotCenter sends the page's HTML to WebCategorize and stores the AI-powered classification result alongside the capture.

What is WebCategorize?

WebCategorize is a real-time web content classification API. It analyses HTML or plain text and returns standardised topic categories, a confidence score, the detected language, and an IAB-style content taxonomy — all in a single API call.

It is designed for developers building content moderation, ad targeting, audience segmentation, brand safety checks, and research pipelines where knowing what a page is about is as important as having a visual snapshot of it.

Visit webcategorize.com

How it works

  1. 1

    Request a screenshot with html=1

    Pass html=1 and your WebCategorize app ID in the apps parameter. ScreenshotCenter captures the page HTML alongside the screenshot image.

  2. 2

    HTML is submitted to WebCategorize

    Once the screenshot job completes, the captured HTML and page URL are forwarded to the WebCategorize /html/ endpoint using your API key.

  3. 3

    Classification result is stored in the screenshot response

    The full WebCategorize response — categories, confidence scores, language, and predictions — is available in apps[].output_data on every screenshot object.

Quick start

Request a screenshot and classify the page in one API call:

curl "https://api.screenshotcenter.com/api/v1/screenshot/create\
  ?key=YOUR_API_KEY\
  &url=https://example.com\
  &html=1\
  &apps=my-webcategorize-app"

The classification result appears under apps[].output_data in the response once the job finishes:

{
  "apps": [{
    "app_id": "my-webcategorize-app",
    "type": "webcategorize",
    "status": "finished",
    "output_data": {
      "status": "Finished",
      "language": "en",
      "predictions": [
        { "id": "WEB4-1", "name": "Technology > Software", "score": 0.94 }
      ]
    }
  }]
}

Use cases

Brand safety monitoring

Verify that ad placements and partner pages belong to expected content categories before publishing.

Content moderation pipelines

Flag adult, violent, or off-topic pages automatically during large-scale crawl and archival jobs.

SEO and competitive research

Classify competitor pages and SERP results to understand topical authority and content gaps at scale.

Audience segmentation

Enrich lead and prospect data with the category and language of their website for smarter targeting.

Setup

  1. 1. Create a free account at webcategorize.com and generate an API key.
  2. 2. In the ScreenshotCenter dashboard, go to Apps and add a new WebCategorize integration. Paste your API key (or leave it blank to use the platform default).
  3. 3. When requesting screenshots, include html=1 and the apps=<your-app-id> parameter.
  4. 4. Read the classification from apps[].output_data in the screenshot response.