The Whitespace Analysis page is a platform for overview and insights directed to input search criteria. Analysis information displayed on this page can be searched and sorted in multiple ways, providing a dynamic and flexible interface ideal for AI/ML prior art searches, competitive landscape monitoring, underexplored technology areas conducive to R&D innovation, and more.
Analysis and insights available on this page are generated based on user-defined focus keywords, CPC filters, and date ranges. The overview section provides a high-level summary of key metrics, while the tables and charts allow for in-depth exploration of patent filings relevant to the specified criteria. For example, key metrics include subject matter saturation, patent and publication activity rates and momentum, and CPC trends for specific search criteria and semantically similar concepts.
For any combination of focus keywords, CPC filters, and date range the overview performs the following steps:
At least one focus input (keywords or CPC) is required. The default window covers the past 24 months, anchored to the end date.
Use comma-separated words and/or phrases that describe the AI/ML subject matter of interest. The analysis reflects keyword, key phrase, and semantic search matches (when enabled) found in the title, abstract, and claims of patents and publications.
Example: foundation models, multi-modal reasoning, retrieval augmented generation
Concentrate the target search set in a specific technology area with CPC classification codes. Supports partial codes such as G06N and full designations like G06F17/30.
Example: G06N20/00, A61B5, G06V, G06K9/00
Restrict the results set to a specific time range corresponding to patent grant date or publication date. Empty fields fall back to the full data set in Patent Scout's database. When only an end date is provided the start defaults to 23 months earlier.
Example: From 2023-07-01, To 2025-06-30
When enabled, whitespace analysis matches semantic nearest neighbors (based on embedding index) and merges those with exact keyword and phrase matches.
Default: Enabled
Loads more complex, weighted whitespace signals calculated per assignee. Includes context graph, assignee signal cards, and Sigma visualization beneath the overview. Off by default.
Default: Disabled
Exact, semantic, and total distinct publication counts inside the window.
Average filings per month plus the observed min/max band.
Slope of the monthly time series and CAGR over the window.
Highest volume CPC codes among matched filings.
Plots monthly publication counts across the selected window. Hover in the UI to inspect the exact month totals. Sharp inflections may indicate changes in momentum.
Ranks CPC codes by patent and publication volume. A shorter bar generally corresponds to a less explored technology area, whereas a longer bar may suggest a more developed or saturated technology area.
Summaries for the last 6, 12, 18, and 24 months. This information can be read with near-term patent and publication velocity against historical averages.
Result set table lists up to 1000 patents and publications per target search set, sortable on recency, relevance, or assignee name. Click any patent/publication number to open the document in a new tab.
Result set table can be exported as a PDF (up to 1000 patents and publications) for later reference and review. The exported PDF includes the overview and analysis displayed above on the page.
Switching on “Group by Assignee” augments the whitespace analysis with a per-assignee clustering view. More complex, weighted signals are calculated from semantic embeddings, which are used to build a cosine KNN graph and evaluate four signals per grouping:
Toggle on "Group by Assignee" to generate specific analysis and insights scoped to specific entities (e.g., competitors, investors in the AI/ML space, etc.).
Shift the end date to align with product launches or regulatory moments. Comparing periods highlights whether filings are accelerating into that milestone.
A smaller gap between exact-match results and semantic-search results indicates the target search set is well-aligned with conventional terminology used across the domain. Large gaps between exact-match results and semantic-search results indicate that the relevant concepts are often expressed in different wording than the target search set. That is, the domain uses diverse terminology or synonyms not captured by the literal query. Expanding upon keywords and phrases (e.g., using synonyms, abbreviations, etc.) and/or adding CPC filters can help refine the target search set.
When small keyword changes cause noticeable shifts in the CPC distribution, the overall concept likely spans multiple technology areas. Depending on the goal, this may be a signal to explore the concept in more granular clusters.
Solution: Verify at least one keyword or CPC is provided. Try expanding the date range or disabling semantic neighbors if the query is very niche.
Solution: Check the timeline sparkline for month-to-month variability. Extending the window or adding semantic neighbors can expose greater insights.
Solution: Ensure “Group by Assignee” is toggled on and the latest run completed. Some narrow scopes may lack a sufficient number of patents and publications per assignee to expose a signal with that satisfies a minimum level of confidence.