지수 없음

HTML 로봇 메타 태그의 noindex 값은 자동화된 인터넷 봇이 웹 페이지 색인화를 방지하도록 요청합니다.^[1]^[2]이 메타 태그를 사용하려는 이유로는 로봇에게 매우 큰 데이터베이스, 일시적인 웹 페이지, 개발 중인 웹 페이지, 조금 더 비공개로 유지하고 싶은 웹 페이지, 또는 프린터 및 모바일에 적합한 버전의 페이지를 색인화하지 말 것을 권고하는 것이 있습니다.웹 사이트의 노인덱스 태그를 지정해야 하는 부담이 검색 로봇의 작성자에게 있기 때문에 이러한 태그가 무시되는 경우도 있습니다.또한 noindex 태그의 해석은 검색 엔진 회사마다 약간씩 다릅니다.

로봇 색인 없음전체 페이지 색인 없음

<html> <헤드> <meta명="robots" content="noindex"> <title> 이 페이지 색인 안 함</title> </head>

메타 태그 내용에 사용할 수 있는 값은 "none", "all", "index", "noindex", "nofollow" 및 "follow"입니다.다음과 같은 값의 조합도 가능합니다.^[1]

<메타 이름.=robots 내용을="인덱스 없음, 팔로우">

봇별 지침

메타 태그에 다른 "이름" 값을 지정하여 특정 봇에만 noindex 지시문을 제한할 수 있습니다.예를 들어, 구체적으로 Google의 봇을 차단하려면 ^[3]다음을 지정합니다.

<메타 이름.=구글봇 내용을="인덱스 없음">

또는 Bing의 봇을 차단하려면 다음을 지정합니다.

<메타 이름.=빙봇 내용을="인덱스 없음">

또는 바이두의 봇을 차단하려면 다음을 지정합니다.

<메타 이름.="베이서퍼스더" 내용을="인덱스 없음">

robots.txt 파일

로봇.txt 파일은 크롤링을 차단하는데 사용될 수 있습니다.

페이지의 색인 부분 없음

웹 페이지의 일부(예: 탐색 텍스트)를 전체 페이지가 아닌 색인화하는 것에서 제외할 수도 있습니다.이렇게 하는 데에는 다양한 기술이 있습니다. 여러 가지를 복합적으로 사용하는 것이 가능합니다.구글의 주요 색인 거미인 구글봇은 이러한 기술들 중 어떤 것도 인식하지 못하는 것으로 알려져 있습니다.

<noindex> 태그

러시아 검색 엔진 Yandex는 새로운 <noindex> 태그를 도입하여 태그 간의 내용 색인을 방지했습니다.소스 코드의 유효성을 검사하기 위해 를 사용할 수 있습니다.^[4]

<p이 텍스트를 색인화합니다.<noindex>이 텍스트를 색인화하지 않습니다.</noindex> <!--noindex-->이 텍스트를 색인하지 않습니다.<!--/noindex--> </p>

Atomz를 포함한 다른 인덱싱 거미도 <noindex> 태그를 인식합니다.^[5]

마이크로 포맷

동일한 기능을 가진 2005년 초안 마이크로포맷 규격이 있습니다.Robot Exclusion Profile(로봇 제외 프로파일)은 HTML 태그에서 속성 및 값 class=" robots-noindex"를 찾습니다.

<이 텍스트를 색인화합니다.</p> <div class="robots-noindex">이 텍스트를 색인화하지 않습니다.</div> <span class="robots-noindex">이 텍스트를 색인화하지 않습니다.</span> <p class="robots-noindex">이 텍스트를 색인하지 않음./p>

다음과 같은 값의 조합도 가능합니다.^[6]

<div class="robots-noindex robots-follow">텍스트./div>

야후!

2007년 야후는 마이크로 포맷과 유사한 기능을 거미에 도입했습니다.그러나 Yahoo!의 거미는 value class=" robots-nocontent"를 찾고 이 값만 찾는다는 점에서 호환되지 않습니다.

<이 텍스트를 색인화합니다.</p> <div class="robots-내용 없음">이 텍스트를 색인하지 마십시오.</div> <span class="robots-내용 없음">이 텍스트를 색인하지 마십시오.</span> <p class="robots-내용 없음">이 텍스트 색인화 안 함./p>

쉐어포인트

SharePoint 2010의 iFilter는 <div> 태그 내부의 특성 및 값 클래스가 = "noindex"인 콘텐츠를 제외합니다.내부 <div>는 처음에 제외되지 않았지만 변경되었을 수 있습니다.<div>^[8] 이외의 태그에도 속성을 적용할 수 있는지 여부도 알 수 없습니다.

<이 텍스트를 색인화합니다.</p> <div class="noindex">이 텍스트를 색인화하지 않습니다.</div>

구조화된 코멘트

구글 검색 어플라이언스

Google 검색 어플라이언스는 구조화된 주석을 사용합니다.^[9]

<p이 텍스트를 색인화합니다.<!--googleoff: all--> 이 텍스트를 색인화하지 마십시오.<!--googleon: all--> </p>

다른 인덱싱 거미들도 자신만의 구조화된 코멘트를 사용합니다.

참고 항목

참고문헌

^ ^a ^b 로봇과 메타(META) 요소, 공식 W3 규격
^ 로봇 <META> 태그 정보
^ 메타 태그를 사용하여 사이트에 대한 액세스 차단, Google Webmaster 도구 도움말
^ "Using HTML tags". webmaster → help. Yandex. Section: <noindex> tag. Retrieved March 25, 2013.
^ "General Search FAQ". Help. Atomz. 2013. Section: How do I exclude parts of my site from being searched?. Archived from the original on December 8, 2021. Retrieved March 23, 2013. Need to prevent parts of individual pages from being searched? If you want to exclude portions of a page from indexing, surround the text with <noindex> and </noindex> tags. This is useful, for example, if you want to exclude navigation text from searches.(등록 필요)
^ ^a ^b Janes, Peter (June 18, 2005). "Robot Exclusion Profile". Microformats. Retrieved March 24, 2013.
^ Garg, Priyank (May 2, 2007). "Introducing Robots-Nocontent for Page Sections". Yahoo! Search Blog. Yahoo!. Archived from the original on August 20, 2014. Retrieved March 23, 2013.
^ "Control Search Indexing (Crawling) Within a Page with Noindex". Microsoft Developer. Microsoft. June 7, 2010. Archived from the original on November 4, 2017. Retrieved November 4, 2017.
^ "Administering Crawl: Preparing for a Crawl". Google Search Appliance. Google Inc. August 23, 2012. Section: Excluding Unwanted Text from the Index. Archived from the original on November 23, 2012. Retrieved March 23, 2013.

[W3spec-1] 로봇과 메타(META) 요소, 공식 W3 규격

[2] 로봇 <META> 태그 정보

[google_noindex-3] 메타 태그를 사용하여 사이트에 대한 액세스 차단, Google Webmaster 도구 도움말

[4] "Using HTML tags". webmaster → help. Yandex. Section: <noindex> tag. Retrieved March 25, 2013.

[5] "General Search FAQ". Help. Atomz. 2013. Section: How do I exclude parts of my site from being searched?. Archived from the original on December 8, 2021. Retrieved March 23, 2013. Need to prevent parts of individual pages from being searched? If you want to exclude portions of a page from indexing, surround the text with <noindex> and </noindex> tags. This is useful, for example, if you want to exclude navigation text from searches.(등록 필요)

[microformat-6] Janes, Peter (June 18, 2005). "Robot Exclusion Profile". Microformats. Retrieved March 24, 2013.

[7] Garg, Priyank (May 2, 2007). "Introducing Robots-Nocontent for Page Sections". Yahoo! Search Blog. Yahoo!. Archived from the original on August 20, 2014. Retrieved March 23, 2013.

[8] "Control Search Indexing (Crawling) Within a Page with Noindex". Microsoft Developer. Microsoft. June 7, 2010. Archived from the original on November 4, 2017. Retrieved November 4, 2017.

[9] "Administering Crawl: Preparing for a Crawl". Google Search Appliance. Google Inc. August 23, 2012. Section: Excluding Unwanted Text from the Index. Archived from the original on November 23, 2012. Retrieved March 23, 2013.

[1]

[2]

[3]

[4]

[5]

[6]

[8]

[9]

Search