Threat Intelligence

Find what the search-indexed open web exposes about you — before an attacker does

Exposed files, forgotten subdomains, phishing pages, and leaked references get indexed by search engines. cloro is the open-web reconnaissance layer for defenders: automate Google dorking and SERP recon on your own domains and brand terms, get structured JSON back, and feed it into your detection pipeline. It sees what Google and the AI engines surface — not the dark web, breach databases, or your network telemetry.

4.8 · 33 reviews

Try 500 credits for free

Security and appsec teams running open-web exposure detection on cloro

99.99% uptime

1,000,000,000 monthly API calls

What is open-web threat intelligence?

Open-web threat intelligence is the practice of finding what public, search-indexed sources reveal about your organization's exposure — misconfigured hosts, forgotten documents, phishing pages impersonating you, and references leaked into indexed content. It is a subset of OSINT — in effect an OSINT API scoped to what search engines and AI answers can surface — and it is fundamentally a reconnaissance discipline: run the queries an attacker would, first, on your own scope.

It is deliberately one layer of a larger stack. It does not include the dark web, breach and credential databases, or network telemetry — those need dedicated tools. What it does cover, the search-indexed surface, is both the cheapest place for attackers to start and the one most defensive programs monitor least. cloro is the API for that layer, and it is careful to say where it stops.

Related guide

Google search operators

The operator syntax — site:, filetype:, inurl:, intitle: — that turns a search box into an exposure scanner.

See the operators

Related guide

Enumerate every URL on a domain

How to discover a domain's full search-indexed footprint — the first step of an attack-surface audit.

Read the guide

The open-web recon layer — one honest surface, clearly bounded

cloro covers exactly one thing well: what search engines and AI answers have indexed. Google organic results (with the full operator syntax), Google News, and all 7 AI answer engines — as structured JSON, on your cadence. It is not a dark-web feed, a breach-credential database, or a network sensor. Combine it with those tools; don't mistake it for them.

Why security teams build open-web recon on cloro

Attackers start reconnaissance where it's cheapest: a search box. Exposed configs, staging hosts, and phishing kits impersonating you are often one dork away — and already indexed. Defenders should run the same queries first, on a schedule, as structured data. That's the workload cloro is built for, and it stops exactly where the search index does.

Google site: operator results auditing a domain's search-indexed footprint, with Google's own 'do you own this domain?' prompt

Your attack surface includes everything Google has indexed

A site: query against your own domain is the fastest external-attack-surface audit there is — it shows every subdomain, path, and document Google has indexed, including the staging host and the PDF nobody meant to publish. Run it as an API call on a schedule, diff against last week, and a newly indexed sensitive path becomes an alert instead of a breach post-mortem. This is EASM from the search-index side. See how to enumerate every indexed URL on a domain.

Google results for an operator-scoped technical query returned as ranked, parseable entries

Google dorking is reconnaissance both sides already run

Search operators — filetype:, intitle:, inurl:, site: — turn Google into an exposure scanner. Attackers use them to find exposed assets; defenders should run the same queries first, on their own scope. cloro executes operator queries via /v1/monitor/google and returns parsed results, so a defensive dorking playbook becomes a scheduled job with structured output instead of manual searching. Authorized scope only — see the operator reference.

AI Overview answering an "is this domain legit" style query with cited sources

Exposure and reputation signals surface in AI answers too

When someone asks an AI engine "is <domain> safe?" or "what is <company> known for," the answer is assembled from live web search and cites its sources. That is a reputation surface security and comms teams now have to watch — a phishing domain that gets cited, or a breach rumor an engine repeats. cloro returns the full AI answers with their citations across ChatGPT, Perplexity, Gemini, Copilot, Grok, AI Overview, and AI Mode.

Structured JSON results array with position, URL, and label fields — open-web signals as pipeline input

An honest feed beats an over-claiming one

cloro returns open-web signals — the SERP and AI-answer presence of a domain, URL, or brand term — as structured JSON your pipeline can score. It does not pretend to be a scored reputation database, a dark-web monitor, a credential-leak service, or a threat-intel platform. Treat it as a high-signal input alongside those systems: a domain newly ranking for phishing-adjacent terms is a lead your enrichment stack confirms, not a verdict cloro issues.

From dork playbook to exposure alerts

The core recipe: your authorized scope (your domains, your brand terms) → a set of defensive dork queries → scheduled SERP snapshots → diff for newly indexed exposures → alert. Structured JSON out, your detection logic in the middle. Run it only against assets you own or are authorized to test.

Scheduled exposed-asset sweep on your own domain

python

import requests
from urllib.parse import urlparse

# AUTHORIZED SCOPE ONLY: your own domains / assets you're permitted to test.
DOMAIN = "example.com"

# Defensive dork playbook — surface exposures before an attacker does.
# Sanitized set; tune to your stack. Runs against YOUR domain only.
dorks = [
    f"site:{DOMAIN} -www",                          # subdomains + stray hosts
    f"site:{DOMAIN} filetype:pdf",                  # indexed documents
    f"site:{DOMAIN} inurl:staging OR inurl:dev",    # non-prod hosts indexed
    f"site:{DOMAIN} intitle:\"index of\"",          # open directory listings
]

findings = []
for dork in dorks:
    response = requests.post(
        "https://api.cloro.dev/v1/monitor/google",
        headers={
            "Authorization": "Bearer sk_live_your_api_key_here",
            "Content-Type": "application/json",
        },
        json={"query": dork, "country": "US", "device": "desktop"},
    )
    for r in response.json()["result"]["organicResults"]:
        findings.append({
            "dork": dork,
            "url": r["link"],
            "host": urlparse(r["link"]).netloc,
            "title": r.get("title"),
        })

# Persist to your warehouse; diff against last run for NEWLY indexed exposures,
# then route new findings to your triage queue (Jira / TheHive / Slack).
for f in findings:
    print(f)

javascript

import { URL } from "node:url";

// AUTHORIZED SCOPE ONLY: your own domains / assets you're permitted to test.
const DOMAIN = "example.com";

// Defensive dork playbook — surface exposures before an attacker does.
const dorks = [
  `site:${DOMAIN} -www`,                       // subdomains + stray hosts
  `site:${DOMAIN} filetype:pdf`,               // indexed documents
  `site:${DOMAIN} inurl:staging OR inurl:dev`, // non-prod hosts indexed
  `site:${DOMAIN} intitle:"index of"`,         // open directory listings
];

const findings = [];
for (const dork of dorks) {
  const res = await fetch("https://api.cloro.dev/v1/monitor/google", {
    method: "POST",
    headers: {
      Authorization: "Bearer sk_live_your_api_key_here",
      "Content-Type": "application/json",
    },
    body: JSON.stringify({ query: dork, country: "US", device: "desktop" }),
  });
  const result = (await res.json()).result;
  for (const r of result.organicResults) {
    findings.push({ dork, url: r.link, host: new URL(r.link).hostname, title: r.title });
  }
}

// Persist; diff against last run for NEWLY indexed exposures → triage queue.
console.log(findings);

curl

# One defensive dork against YOUR OWN domain — wrap in your scheduler.
# Authorized scope only.
curl -X POST https://api.cloro.dev/v1/monitor/google \
  -H "Authorization: Bearer sk_live_your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "site:example.com filetype:pdf",
    "country": "US",
    "device": "desktop"
  }' | jq '[.result.organicResults[] | {url: .link, title: .title}]'
# Diff the URL set against your last run → newly indexed exposures.

Response example

200 OK application/json

{
  "success": true,
  "result": {
    "organicResults": [],
    "relatedSearches": []
  }
}

Try 500 credits for free View documentation

Use cases

Four defensive open-web recon programs security teams run on cloro data — authorized scope only.

External attack-surface audit

Scheduled site: sweeps of your own domains surface every indexed subdomain, path, and document — diffed for newly exposed assets before an attacker finds them.

SERP API site: dorks Weekly diff

Best for: Appsec and blue teams

Phishing-page surfacing

Monitor brand-plus-login and typosquat queries for pages impersonating you — the search-visible slice of phishing, routed straight to takedown.

Brand queries Allowlist diff Takedown queue

Best for: SOC and brand security

Defensive dorking automation

Turn a manual dork playbook into a scheduled job: operator queries against your scope, parsed results, deltas into your triage queue.

Operator queries Scheduler SOAR / triage

Best for: Detection engineering

Open-web layer for EASM / TI vendors

Per-tenant keys and async webhook batching make cloro the search-indexed collection source under your attack-surface or threat-intel platform.

Per-tenant keys Async + webhooks Your platform

Best for: Threat-intel and EASM vendors

Pricing that scales with you

Pick a plan that fits your volume. Price per credit drops as you scale.

Hobby

$0.40

per 1,000 credits

$100/mo
250,000 credits
20 concurrent jobs
Email support

Starter

$0.39

per 1,000 credits

$250/mo
650,000 credits
50 concurrent jobs
Email support

Estimate your monthly cost and plan

ChatGPT (full response)

7 credits each

ChatGPT (web search)

5 credits each

Perplexity

3 credits each

Grok

4 credits each

Copilot

5 credits each

AI Mode

4 credits each

Gemini

4 credits each

Google Search

AI Overview

3 credits / 1 page

Google News

3 credits / 1 page

Monthly requests

Credits needed

Recommended plan:

Open-web threat intelligence, answered

What exactly does cloro cover — and where does it stop?+

cloro covers the search-indexed open web: what Google Search, Google News, and the AI answer engines surface for a query. That makes it a strong fit for Google dorking at scale, exposed-asset discovery, phishing-page surfacing, and open-web reputation signals. It explicitly does not cover the dark web, breach or credential-leak databases, network or endpoint telemetry, or scored threat-intel feeds. If a workload needs any of those, cloro is a complementary input — not the source. We would rather draw that line clearly than sell you a capability we don't have.

Is this legal, and what are the rules for using it?+

Querying a public search engine for content it has already indexed is a normal, legal activity — but what you do with the results is governed by authorization and law. Run reconnaissance only against assets you own or are explicitly authorized to test (your domains, an engagement in scope), respect the CFAA and your local equivalents, never access exposed data you find beyond confirming the exposure, and follow responsible-disclosure practice for anything you surface about a third party. cloro is data infrastructure; the authorization and compliance obligations sit with you. See the legal landscape for public-web data collection.

How do I automate a defensive Google dorking playbook?+

Encode your dork set as queries and run them through /v1/monitor/google on a schedule against your authorized scope. Operators like site:, filetype:, inurl:, and intitle: all work in the query string, and you get parsed results back instead of HTML to scrape (see the operator reference). Persist each run, diff against the last for newly indexed exposures, and route deltas to your triage queue. The point of automating it is coverage and cadence — a human dorking once a quarter misses what an attacker finds the week you ship a misconfiguration.

You mention a "domain reputation API" and "IOC feed" — is that a scored database?+

No — and the distinction matters. cloro returns open-web signals, not scores: the SERP and AI-answer presence of a domain or URL, as structured data. That is a genuine input to a reputation, IOC, or malicious-URL-detection pipeline — a domain suddenly ranking for phishing-adjacent brand terms, or a lookalike URL cited by an AI engine in a scam context, is a real signal — but cloro does not assign a reputation score or maintain a curated indicator list. Feed the signals into your own scoring, or into a dedicated reputation/IOC provider that does. Calling it a "feed input" is honest; calling it a "reputation database" would not be.

How does this fit alongside my existing threat-intel and EASM stack?+

As the open-web collection layer. Attack-surface-management and threat-intel platforms correlate many sources — passive DNS, certificate transparency, breach data, network scans; cloro adds the one most of them under-cover: what search engines and AI answers have indexed about your assets and brand. Pipe cloro findings into your SIEM, SOAR, or ASM platform as another enrichment source. It is deliberately a component, not a replacement — which is why it embeds cleanly rather than competing with your platform of record.

How does this relate to brand protection and adverse media?+

Same open-web infrastructure, different query sets and intent. Brand protection watches brand and typosquat terms for counterfeit and impersonation — heavy overlap with phishing-page detection here. Adverse media screening shares the security/risk persona and the entity-monitoring pattern. And the AI-answer reconnaissance angle is the defensive mirror of AI visibility tracking. One API key and one credit pool cover all of them.

What does a realistic open-web recon program cost?+

Pay-per-call: a Google SERP call is 3 credits (n=10), +2 per additional results page. A defensive program of 200 dork queries across your domains, run daily, is 200 × 30 × 3 = 18k credits/month — comfortably inside the Hobby plan ($100/month, 250k credits). Add a daily AI-answer reputation check on a set of monitored domains (ChatGPT 5 + Perplexity 3 credits per domain) and you're still well under it. Vendors running recon across many client scopes scale into Growth ($500/month, 1.35M credits) with per-tenant keys and async webhook delivery. Full breakdown on the pricing page.

We're a threat-intel or EASM vendor. Can we embed cloro?+

Yes — that's the intended shape. Issue per-tenant API keys so each customer's recon usage meters separately, and drive collection through the async endpoints (`POST /v1/monitor/google/async` and per-engine `/v1/monitor//async`) with webhook delivery — at scale, sync polling burns concurrency you don't need. You own the correlation, scoring, and analyst UI; cloro owns open-web collection at 99.99% uptime. We sell the data layer, not a competing platform.

Google search operators cheat sheet and example queries

Technical Guides

Google Search Operators: Complete List and Examples

Use Google search operators to find exact phrases, PDFs, indexed pages, mentions, and competitor content. Includes examples and power-search workflows.

Network graph showing URL discovery on a domain

Technical Guides

How to Find All URLs on a Domain: 5 Proven Methods

From sitemaps to Python crawlers. Learn every method to discover every single page on a website, including hidden endpoints and orphan pages.

Is web scraping legal — 2026 guide cover

Research

Is Web Scraping Legal? 2026 Rules (US + EU)

Yes — scraping public web data is legal in the US and EU when you respect CFAA, GDPR, robots.txt, and rate limits. 2026 guide with landmark cases, jurisdiction matrix, and EU AI Act rules.

Run the open-web recon an attacker would — first

Automate defensive dorking and SERP + AI-answer reconnaissance on the scope you're authorized to monitor. Structured JSON, your detection logic, one credit pool. The search-indexed layer, honestly bounded.

Find what the search-indexed open web exposes about you — before an attacker does

What is open-web threat intelligence?

Google search operators

Enumerate every URL on a domain

The open-web recon layer — one honest surface, clearly bounded

Why security teams build open-web recon on cloro

Your attack surface includes everything Google has indexed

Google dorking is reconnaissance both sides already run

Exposure and reputation signals surface in AI answers too

An honest feed beats an over-claiming one

From dork playbook to exposure alerts

Scheduled exposed-asset sweep on your own domain

Response example

Use cases

External attack-surface audit

Phishing-page surfacing

Defensive dorking automation

Open-web layer for EASM / TI vendors

Pricing that scales with you

Estimate your monthly cost and plan

Open-web threat intelligence, answered

Google Search Operators: Complete List and Examples

How to Find All URLs on a Domain: 5 Proven Methods

Is Web Scraping Legal? 2026 Rules (US + EU)

Run the open-web recon an attacker would — first

Google SERP API

ChatGPT

Perplexity

Gemini

Grok

Copilot

Google AI Mode

Find what the search-indexed open web exposes about you — before an attacker does

What is open-web threat intelligence?

Google search operators

Enumerate every URL on a domain

The open-web recon layer — one honest surface, clearly bounded

Why security teams build open-web recon on cloro

Your attack surface includes everything Google has indexed

Google dorking is reconnaissance both sides already run

Exposure and reputation signals surface in AI answers too

An honest feed beats an over-claiming one

From dork playbook to exposure alerts

Scheduled exposed-asset sweep on your own domain

Response example

Use cases

External attack-surface audit

Phishing-page surfacing

Defensive dorking automation

Open-web layer for EASM / TI vendors

Pricing that scales with you

Estimate your monthly cost and plan

Open-web threat intelligence, answered

Deep dives on open-web reconnaissance

Google Search Operators: Complete List and Examples

How to Find All URLs on a Domain: 5 Proven Methods

Is Web Scraping Legal? 2026 Rules (US + EU)

Run the open-web recon an attacker would — first