AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

AI for Patents : A Novel Yet Effective and Efficient Framework for Patent Analysis. / Son, Junyoung; Moon, Hyeonseok; Lee, Jeongwoo; Lee, Seolhwa; Park, Chanjun; Jung, Wonkyung; Lim, Heuiseok.

In: IEEE Access, Vol. 10, 2022, p. 59205-59218.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Son, J, Moon, H, Lee, J, Lee, S, Park, C, Jung, W & Lim, H 2022, 'AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis', IEEE Access, vol. 10, pp. 59205-59218. https://doi.org/10.1109/ACCESS.2022.3176877

APA

Son, J., Moon, H., Lee, J., Lee, S., Park, C., Jung, W., & Lim, H. (2022). AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis. IEEE Access, 10, 59205-59218. https://doi.org/10.1109/ACCESS.2022.3176877

Vancouver

Son J, Moon H, Lee J, Lee S, Park C, Jung W et al. AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis. IEEE Access. 2022;10:59205-59218. https://doi.org/10.1109/ACCESS.2022.3176877

Author

Son, Junyoung ; Moon, Hyeonseok ; Lee, Jeongwoo ; Lee, Seolhwa ; Park, Chanjun ; Jung, Wonkyung ; Lim, Heuiseok. / AI for Patents : A Novel Yet Effective and Efficient Framework for Patent Analysis. In: IEEE Access. 2022 ; Vol. 10. pp. 59205-59218.

Bibtex

@article{a97be2227aca469f94d187a377758562,
title = "AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis",
abstract = "Patents provide inventors exclusive rights to their inventions by protecting their intellectual property rights. However, analyzing patent documents generally requires knowledge of various fields, considerable human labor, and expertise. Recent studies to alleviate this problem on patent analysis deal only with the analysis of claims and abstract parts, neglecting the descriptions that contain essential technical cores. Moreover, few studies use a deep learning approach to handle the entire patent analysis process, including preprocessing, summarization, and key-phrase generation. Therefore, we propose a novel multi-stage framework that can aid in analyzing patent documents by using the description part of the patent rather than abstracts or claims with deep learning. The framework comprises two stages: key-sentence extraction and key-phrase generation tasks. These stages are based on the T5 model structure, transformer-based architecture that uses a text-to-text approach. To further improve the framework's performance, we employed two key factors: i) post-training the model with a patent-related raw corpus for encouraging the model's comprehension of the patent domain, and ii) utilizing a text rank algorithm for efficient training based on the priority score of each sentence. We verified that our key-phrase generation method of the framework shows higher performance in both superficial and semantic evaluation than other extraction methods. In addition, we provided the validity and effectiveness of our methods through quantitative and qualitative analysis, demonstrating the practical functionality of our methods. We also provided a practical contribution to the patent analysis by releasing the framework as a demo system. ",
keywords = "Deep learning, Key-sentence extraction, Keyword extraction, Patent, Patent analysis, Post training",
author = "Junyoung Son and Hyeonseok Moon and Jeongwoo Lee and Seolhwa Lee and Chanjun Park and Wonkyung Jung and Heuiseok Lim",
note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",
year = "2022",
doi = "10.1109/ACCESS.2022.3176877",
language = "English",
volume = "10",
pages = "59205--59218",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

RIS

TY - JOUR

T1 - AI for Patents

T2 - A Novel Yet Effective and Efficient Framework for Patent Analysis

AU - Son, Junyoung

AU - Moon, Hyeonseok

AU - Lee, Jeongwoo

AU - Lee, Seolhwa

AU - Park, Chanjun

AU - Jung, Wonkyung

AU - Lim, Heuiseok

N1 - Publisher Copyright: © 2013 IEEE.

PY - 2022

Y1 - 2022

N2 - Patents provide inventors exclusive rights to their inventions by protecting their intellectual property rights. However, analyzing patent documents generally requires knowledge of various fields, considerable human labor, and expertise. Recent studies to alleviate this problem on patent analysis deal only with the analysis of claims and abstract parts, neglecting the descriptions that contain essential technical cores. Moreover, few studies use a deep learning approach to handle the entire patent analysis process, including preprocessing, summarization, and key-phrase generation. Therefore, we propose a novel multi-stage framework that can aid in analyzing patent documents by using the description part of the patent rather than abstracts or claims with deep learning. The framework comprises two stages: key-sentence extraction and key-phrase generation tasks. These stages are based on the T5 model structure, transformer-based architecture that uses a text-to-text approach. To further improve the framework's performance, we employed two key factors: i) post-training the model with a patent-related raw corpus for encouraging the model's comprehension of the patent domain, and ii) utilizing a text rank algorithm for efficient training based on the priority score of each sentence. We verified that our key-phrase generation method of the framework shows higher performance in both superficial and semantic evaluation than other extraction methods. In addition, we provided the validity and effectiveness of our methods through quantitative and qualitative analysis, demonstrating the practical functionality of our methods. We also provided a practical contribution to the patent analysis by releasing the framework as a demo system.

AB - Patents provide inventors exclusive rights to their inventions by protecting their intellectual property rights. However, analyzing patent documents generally requires knowledge of various fields, considerable human labor, and expertise. Recent studies to alleviate this problem on patent analysis deal only with the analysis of claims and abstract parts, neglecting the descriptions that contain essential technical cores. Moreover, few studies use a deep learning approach to handle the entire patent analysis process, including preprocessing, summarization, and key-phrase generation. Therefore, we propose a novel multi-stage framework that can aid in analyzing patent documents by using the description part of the patent rather than abstracts or claims with deep learning. The framework comprises two stages: key-sentence extraction and key-phrase generation tasks. These stages are based on the T5 model structure, transformer-based architecture that uses a text-to-text approach. To further improve the framework's performance, we employed two key factors: i) post-training the model with a patent-related raw corpus for encouraging the model's comprehension of the patent domain, and ii) utilizing a text rank algorithm for efficient training based on the priority score of each sentence. We verified that our key-phrase generation method of the framework shows higher performance in both superficial and semantic evaluation than other extraction methods. In addition, we provided the validity and effectiveness of our methods through quantitative and qualitative analysis, demonstrating the practical functionality of our methods. We also provided a practical contribution to the patent analysis by releasing the framework as a demo system.

KW - Deep learning

KW - Key-sentence extraction

KW - Keyword extraction

KW - Patent

KW - Patent analysis

KW - Post training

UR - http://www.scopus.com/inward/record.url?scp=85130817030&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2022.3176877

DO - 10.1109/ACCESS.2022.3176877

M3 - Journal article

AN - SCOPUS:85130817030

VL - 10

SP - 59205

EP - 59218

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

ER -

ID: 314304435