URL Scanner
To better understand Internet usage around the world, use Cloudflare's URL Scanner. With Cloudflare's URL Scanner, you have the ability to investigate the details of a domain, IP, URL, or ASN. Cloudflare's URL Scanner is available in the Security Center of the Cloudflare dashboard, Cloudflare Radar ↗ and the Cloudflare API.
To make your first URL scan using the API, you must obtain a URL Scanner specific API token. Create a Custom Token with Account > URL Scanner in the Permissions group, and select Edit as the access level.
Once you have the token, and you know your account_id
, you are ready to make your first request to the API at https://api.cloudflare.com/client/v4/accounts/{account_id}/urlscanner/
.
To submit a URL to scan, the only required information is the URL to be scanned in the POST
request body:
By default, the report will have a Public
visibility level, which means it will appear in the recent scans ↗ list and in search results. It will also include a single screenshot with desktop resolution.
A successful response will have a status code of 200
and be similar to the following:
The uuid
property in the response above identifies the scan and will be required when fetching the scan report.
Here's an example request body with some custom configuration options:
Above, the visibility level is set as Unlisted
, which means that the scan report won't be included in the recent scans ↗ list nor in search results. In effect, only users with knowledge of the scan ID will be able to access it.
There will also be three screenshots taken of the webpage, one per target device type. The User-Agent
↗ will be set as "XXX-my-user-agent". Note that you can set any custom HTTP header, including Authorization ↗.
Once the URL Scan submission is made, the current progress can be checked by calling https://api.cloudflare.com/client/v4/accounts/{account_id}/urlscanner/v2/result/{scan_id}
. The scan_id
will be the uuid
value returned in the previous response.
While the scan is in progress, the HTTP status code will be 404
; once it is finished, it will be 200
. Cloudflare recommends that you poll every 10-30 seconds.
The response will include, among others, the following top properties:
task
- Information on the scan submission.page
- Information pertaining to the primary response, for example IP address, ASN, server, and page redirect history.data.requests
- Request chains involved in the page load.data.cookies
- Cookies set by the page.data.globals
- Non-standard JavaScript global variables.data.console
- Console logs.data.performance
- Timings as given by thePerformanceNavigationTiming
↗ interface.meta
- Meta processors output including detected technologies, domain and URL categories, rank, geolocation information, and others.lists.ips
- IPs contacted.lists.asns
- AS Numbers contacted.lists.domains
- Hostnames contacted, includingdns
record information.lists.hashes
- Hashes of response bodies, of the main page HTML structure, screenshots, and favicons.lists.certificates
- TLS certificates of HTTP responses.verdicts
- Verdicts on malicious content.
Some examples of more specific properties include:
task.uuid
- ID of the scan.task.url
- Submitted URL of the scan. May differ from final URL (page.url
) if there are HTTP redirects.task.success
- Whether scan was successful or not. Scans can fail for various reasons, including DNS errors.task.status
- Current scan status, for example,Queued
,InProgress
, orFinished
.meta.processors.domainCategories
- Cloudflare categories of the main hostname contacted.meta.processors.phishing
- What kind of phishing, if any, was detected.meta.processors.radarRank
- Cloudflare Radar Rank ↗ of the main hostname contacted.meta.processors.wappa
- The kind of technologies detected as being in use by the website, with the help of Wappalyzer ↗.page.url
- URL of the primary request, after all HTTP redirects.page.country
- GeoIP country name of the main IP address contacted.page.history
- Main page history, including any HTTP redirects.page.screenshot
- Various hashes of the main screenshot. Can be used to search for sites with similar screenshots.page.domStructHash
- HTML structure hash. Use it to search for sites with similar structure.page.favicon.hash
- MD5 hash of the favicon.verdicts.overall.malicious
- Whether the website was considered malicious at the time of the scan. Please check the remaining properties for each subsystem(s) for specific threats detected.
The Get URL Scan API endpoint documentation contains the full response schema.
To fetch the scan's screenshots or full network log refer to the corresponding endpoints' documentation.
Use a subset of ElasticSearch Query syntax to filter scans. Search results will include Public
scans and your own Unlisted
scans.
To search for scans to the hostname google.com
, use the query parameter q=page.domain:"google.com"
:
If, instead, you wanted to search for scans that made at least one request to the hostname cdnjs.cloudflare.com
, for example sites that use a JavaScript library hosted at cdnjs.cloudflare.com
, use the query parameter hostname=cdnjs.cloudflare.com
:
Some other example queries:
task.url:"https://google.com" OR task.url:"https://www.google.com"
: Search for scans whose submitted URL was eithergoogle.com
orwww.google.com
. URLs must be enclosed in quotes.page.url:"https://google.com" AND NOT task.url:"https://google.com"
: Search for scans togoogle.com
whose submitted URL was notgoogle.com
(that is, sites that redirected to google.com).page.domain:microsoft AND verdicts.malicious:true AND NOT page.domain:microsoft.com
: Malicious scans whose hostname starts withmicrosoft
. Would match domains likemicrosoft.phish.com
.apikey:me AND date:[2024-01 TO 2024-10]
: Your scans from January 2024 to October 2024.page.domain:(blogspot OR www.blogspot)
: Searches for scans whose main domain starts withblogspot
or withwww.blogspot
.date:>now-7d AND path:okta-sign-in.min.js
: Scans from the last seven days with any request path that ends withokta-sign-in.min.js
.page.asn:AS24940 AND hash:-557369673
: Websites hosted in AS24940 where a resource with the given hash was retrieved.hash:8f662c2ce9472ba8d03bfeb8cdae112dbc0426f99da01c5d70c7eb4afd5893ca
: Using the hash atpage.domStructHash
search for other scans with the same HTML structure hash.
Go to Search URL scans in the API documentation for the full list of available options.
Alternatively, you can search in the Security Center:
- Log in to the Cloudflare dashboard ↗ and select your account.
- Go to Security Center > Investigate.
- Enter your query and select Search.
In the Security Center, you can retrieve information already pre-filtered by:
- Similar screenshot
- Identical favicon
- Similar favicon
- Similar HTML structure
- Identical ASN
- Identical IP
- Identical domain
- Identical final URL (after all redirections)