Thursday, October 9, 2014

Differences of inventors within the same docdb family (part I)

To create the full list of inventors that participated to an innvoation is not an trivial task.
Especially because if we mean for innovation not a mere application but a patent family, to make an append of all the person_ids for all applications belonging to the family would surely lead to undetected duplication of names (ie due to different spelling or address in different application authorities).
Thus one way could be to take only the inventors related to one application (ie the older or the one where data are more likely to be complete fi EPO).
In this case we may instead have an uncomplete recall of inventors whether across different applications one or more inventors may change, be amended or added.

One way to validate this idea is to count what is the difference between min and max count of inventors in the applications within the family. This could validate the fact that in most cases the list of inventors remains the same.
The count is here below: over 95% of docdb families have the same number of inventors for all applications





delta
n families
%
0
36.048.365
95,523%
1
859.567
2,278%
2
413.670
1,096%
3
206.235
0,546%
4
101.529
0,269%
5
48.545
0,129%
6
25.400
0,067%
7
13.372
0,035%
8
7.775
0,021%
9
4.432
0,012%
10
2.972
0,008%
11
1.697
0,004%
12
1.122
0,003%
13
836
0,002%
14
580
0,002%
15 or more
1.661
0,004%


The higher difference within a familis (98 inventors) is for family_id 39324928, containing 74 distinct patent applications where is patent  WO2008051495 has 98 inventors, while  JP2010520959 counts 0 inventors.

No comments:

Post a Comment