Problem reproducing graphs/aggregates based on Patentscope results

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Problem reproducing graphs/aggregates based on Patentscope results

andybu
Hi there,

I have been trying to find the underlying patent data that allows me to reproduce the following graph by the World Bank (querying "Patent applications, residents" and "South Africa"):

https://data.worldbank.org/indicator/IP.PAT.RESD?locations=ZA

Since I wasn't able to reproduce anything like this using Lens.org and other patent databases, I reached out to the World Bank for help. They reproduced the same graph via the "WIPO IP Statistics Data Center" (https://www3.wipo.int/ipstats/editIpsSearchForm.htm?tab=patent), see screenshot here:



They also said that the data came from the WIPO and is searchable via Patentscope.

My problem is that I cannot produce the raw data via Patentscope. When I query Patentscope, I  would assume something like ARE:(ZA) AND AD:(1998) return 200 patent applications from South African residents (see 1998 in screenshot above).

However, for 1998, Patentscope only returns 158 applications. From 1980 till today, Patentscope finds fewer than 10k applications for ARE:(ZA), while the World Bank aggregated a total of 58,773 applications over the same period. I tried different filters etc., but just do not get anywhere closer.

My questions are:

Can anyone think of (a) a possible explanation for the differences between aggregates in the graphs and the Patentscope results, and, if the issue lies with Patentscope, (b) alternative databases that would allow me to download patent data that, when aggregated, more closely resemble the numbers in the graphs?

Every hint is appreciated. Thank you very much.

Best wishes,
Andy
Reply | Threaded
Open this post in threaded view
|

Re: Problem reproducing graphs/aggregates based on Patentscope results

Iustin
Administrator
1. The aggregate data in WIPO’s statistics data center is received as such by the office of origin, it is not based on PATENTSCOPE data.
2. The data in WIPO’s statistics data center contains all filings by an office for a given year, regardless of their publication status.
3. The data in PATENTSCOPE is also received by the office of origin and contains only published data.
4. Taking in consideration 2&3 it is clear that the numbers will differ with the statistics number on the higher end
5. However the numbers that PATENTSCOPE returns for the given queries are very low compared to reality even if it is only published data. The reason for this is that the residence information is only available for 20% of the South African data

Regards,
PATENTSCOPE Team