키워드 추출

키워드 추출은 문서의 주제를 가장 잘 설명하는 용어의 자동 식별으로 작업한다.^[1]^[2]

주요 문구, 주요 용어, 주요 세그먼트 또는 단지 키워드는 문서에 포함된 가장 관련성이 높은 정보를 나타내는 용어를 정의하는 데 사용되는 용어다.용어는 다르지만 기능은 동일하다: 문서에서 논의되는 주제의 특성화.키워드 추출 과제는 텍스트 마이닝, 정보 추출, 정보 검색 및 자연어 처리(NLP)에서 중요한 문제다.^[3]

키워드 할당 대 추출

키워드 할당 방법은 대략 다음과 같이 나눌 수 있다.

키워드 할당(제어된 어휘 또는 분류법에서 선택됨) 및
키워드 추출(원문 텍스트에 명시적으로 언급된 단어 중에서 선택됨).

자동 키워드 추출 방법은 감독, 반 감독 또는 감독 없이 할 수 있다.^[4]^[5]감독되지 않은 방법은 이러한 방법의 일부 또는 대부분을 결합한 단순한 통계, 언어학 또는 그래프 기반 또는 앙상블 방식으로 더욱 나눌 수 있다.^[6]

참조

^ Beliga, Slobodan; Ana, Meštrović; Martinčić-Ipšić, Sanda. (2015). "An Overview of Graph-Based Keyword Extraction Methods and Approaches". Journal of Information and Organizational Sciences. 39 (1): 1–20.{{cite journal}}: CS1 maint: 작성자 매개변수 사용(링크)
^ Rada Mihalcea and Paul Tarau (July 2004). TextRank: Bringing Order into Texts (PDF). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2004). Barcelona, Spain.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)
^ Beliga, Slobodan; Meštrović, Ana; Martinčić- Ipšić, Sanda. (2014). Toward Selectivity-Based Keyword Extraction for Croatian News (PDF). Surfacing the Deep and the Social Web (SDSW 2014). Vol. 1310. Italy: CEUR Proc. pp. 1–14.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)
^ Alrehamy, H.; Walker, C. (2017). SemCluster: Unsupervised Automatic Keyphrase Extraction Using Affinity Propagation. 17th UK Workshop on Computational Intelligence.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)
^ "Keyword Extraction: from TF-IDF to BERT".
^ Tayfun Pay; Stephen Lucci (2017). Automatic Keyword Extraction: An Ensemble Method. 2017 IEEE International Conference on Big Data (Big Data). doi:10.1109/BigData.2017.8258552.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)

추가 읽기

스콜리아는 키워드 추출에 대한 주제 프로파일을 가지고 있다.

Nazanin Firoozeh; Adeline Nazarenko; Fabrice Alizon; Béatrice Daille (11 November 2019). "Keyword extraction: Issues and methods". Natural Language Engineering. 26 (3): 259–291. doi:10.1017/S1351324919000457. ISSN 1351-3249. Wikidata Q109971296.

이 컴퓨터 언어학 관련 기사는 단조롭다.위키피디아를 확장하여 도울 수 있다.

[1] Beliga, Slobodan; Ana, Meštrović; Martinčić-Ipšić, Sanda. (2015). "An Overview of Graph-Based Keyword Extraction Methods and Approaches". Journal of Information and Organizational Sciences. 39 (1): 1–20.{{cite journal}}: CS1 maint: 작성자 매개변수 사용(링크)

[2] Rada Mihalcea and Paul Tarau (July 2004). TextRank: Bringing Order into Texts (PDF). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2004). Barcelona, Spain.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)

[3] Beliga, Slobodan; Meštrović, Ana; Martinčić- Ipšić, Sanda. (2014). Toward Selectivity-Based Keyword Extraction for Croatian News (PDF). Surfacing the Deep and the Social Web (SDSW 2014). Vol. 1310. Italy: CEUR Proc. pp. 1–14.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)

[4] Alrehamy, H.; Walker, C. (2017). SemCluster: Unsupervised Automatic Keyphrase Extraction Using Affinity Propagation. 17th UK Workshop on Computational Intelligence.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)

[5] "Keyword Extraction: from TF-IDF to BERT".

[6] Tayfun Pay; Stephen Lucci (2017). Automatic Keyword Extraction: An Ensemble Method. 2017 IEEE International Conference on Big Data (Big Data). doi:10.1109/BigData.2017.8258552.{{cite conference}}: CS1 maint: 작성자 매개변수 사용(링크)

[1]

[2]

[3]

[4]

[5]

[6]

Search

키워드 추출

네임스페이스

더

키워드 할당 대 추출

참조

추가 읽기