Table 1 summarizes the datasets for PCR and CRISPR patents and publications in both fields. Patents related to PCR technology from 1993 to 2023 were included, while patents related to CRISPR were considered from 2014 to 2023. The journal articles were divided into subgroups according to citations per article. All published patents of the respective technologies were included.
Table 1: Journal Publications and Patents on PCR (1993-2023) and CRISPR (2014-2023).
The table presents data collected for two technologies, including journal publications with over 10 and over 40 citations, as well as the entire data set of granted patents (Mdn: Mean, SD: Standard deviation).
Table 2 presents the statistical analysis, which shows, that the results deviate from a normal distribution. Conversely, the correlation analysis shows a moderate positive correlation between the results. For PCR moderate correlations of 0.52 and 0.57 indicate a medium to strong positive relationship between publications and patents, suggesting that they tend to move in similar directions but are not perfectly synchronized. In contrast CRISPR exhibits only a weak relationship with positive correlations of 0.18 and 0.23, respectively. The mean value comparison clearly shows significant differences between the categories. The significant p-value indicates that the difference between the patents and publications groups in the areas of PCR and CRISPR is not due to a random effect.
3.1 Results for Polymerase Chain Reaction
The dataset contains 4498 publications with more than 10 citations, with a maximum of 326 articles published in one year. However, there is a high fluctuation with a standard deviation of 116.5, the median value was 112 publications. 41% of the articles had a higher citation rate of more than 40. The maximum number of publications fell by more than half to 128 publications, as well as the median value and the standard deviation (Table 1). From Table 3 it can be deduced, that the median impact factor is 5.32 and fluctuates with a standard deviation of 11.4.
The statistics for the 434 patents are less volatile, with a maximum of 31 patents in 2021, a median of 14 and a standard deviation of 8.3.
The cross-correlation shows how patents and publications change over time. The maximum correlation is a lag of 0, which indicates that there is no time lag between the publication of a scientific article and a patent. The same applies to the median and the mean of 0. However, the high standard deviation of 18.33 indicates that the time lag between patents and publications fluctuates greatly (Figure 3).
Table 4 reveals that China has the most journal publications, whereas the number of patents in China ranks behind the USA and India. Compared to other countries, the USA has the most patents and the second-highest number of publications.
3.2 Results for CRISPR/Cas9
The second part of summarizes the data collected for CRISPR Technology. The number of publications with more than 10 citations differs from the PCR technology in some important respects. The period considered is significantly shorter, at 9 years only approximately 30% of the time span of PCR technology. A total of 651 publications were included, with a maximum of 150 publications per year. The standard deviation was 89.2 and the median was 119. At a higher citation limit of 40 citations, the standard deviation decreases by more than 75% compared to 61% for PCR.
A total of 246 fulfilled the criterion, the maximum number of publications per year amounted to 50 publications. The standard deviation and the median are 20.8 and 24 respectively, which also shows a distinct contrast to the PCR technology. In the category of published patents, 56 patents with CRISPR technology were identified. In addition to the temporal distortion, this also reflects the fact that there are 87.15 fewer patents in the CRISPR field. The maximum number of patents per year was 12, the standard deviation was 3.8 and the median was 4.5.
In the computation of cross-correlation the maximum of the coefficient is -1 years, which indicates, that the publications are 1 years prior to the patent publications.
The mean time lag is 0, proving a heterogeneous distribution, that both patents run ahead of publications and vice versa. The standard deviation of 5,63 years demonstrates the large scatter. The median time lag supports the findings of the mean, which is also 0 and proves, that the mean delay is neither strongly influenced nor affected (Figure 4).
The analysis of publications and patents by country activity demonstrates that China and the USA are also comparably active in CRISPR technology. The ratio between the number of publications and patents has changed considerably for the technologies. In the case of PCR, only 17 patents were registered in China out of a total of 435 publications, while the ratio for CRISPR rose from 4% to 22%, i.e. 24 patents were registered for 109 publications. In contrast, in the USA, 206 patents were granted in the field of PCR, and scientific activity accounted for 165 publications. The ratio of patents to publications is 10% for CRISPR, but 126% for PCR (Table 4).
Figure A shows the chronological development of patent publications and scientific articles in the field of polymerase chain reaction (PCR) from 1990 to 2023, with only granted patents shown in green and scientific articles in blue. Figure B shows the chronological development of patent publications and scientific articles in the field of breast cancer for the same period. In this case, patents granted are also shown in green and scientific articles in blue. The data has been normalized for better comparability.
3.3 Nonpatent references
Figure 4 and Table 5 summarize the results of 32 active US patents with PCR technology.
If the chronological horizon of the cited publications (publication date of the article) is analyzed, a time span of an average of 23.4 (Md. = 22) years and an average of 21.7 (Md. = 19) years before the publication of the patent and 2.1 years (Md. = 2) after the publication of the patent can be observed. Figure 4 illustrates this relationship.
The patents under consideration reference an average of 66.1 (Md. = 31) publications.
Comparing the publications cited by the patents with those from the data set of publications, it is noticeable, that only one publication is also cited in the patents. If the publications cited by the patents are compared with each other, it becomes apparent, that the majority of the articles are only cited once and only 22 scientific articles are cited more than three times by the patents analyzed (Table 7).
The cited publications within the patents exhibit a discernible concentration on specific journals as evidenced by the dataset. In terms of journals, the data reveal an average impact factor of 26.3 (Md. = 25.1), with citations per publication averaging 1485.5 (Md. = 876.7). A total of 1525 publications from the 32 active patents were cited from 463 journals.