{"id":1543,"date":"2016-04-29T19:52:31","date_gmt":"2016-04-29T14:22:31","guid":{"rendered":"http:\/\/184.154.163.210\/~hitechwork\/?p=1543"},"modified":"2023-11-08T15:25:43","modified_gmt":"2023-11-08T09:55:43","slug":"wordpress-robots-txt-file-place-wordpress","status":"publish","type":"post","link":"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/","title":{"rendered":"WordPress Robots txt file! How To Create Configure And Optimized It"},"content":{"rendered":"<p>WordPress robots txt file is Introduced by the robotstxt.org to instruct the search engine who to crawl their website. It is a very powerful file (we also can say it a tool) if you working on a Site SEO.<!--more--><\/p>\n<p>You can control which part of your website, you want to share with a Search engine.<\/p>\n<figure id=\"attachment_5914\" aria-describedby=\"caption-attachment-5914\" style=\"width: 755px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5914\" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper.png\" alt=\"wordpress robots txt file\" width=\"755\" height=\"299\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper.png 790w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper-600x238.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper-300x119.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper-768x304.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/wordpress-robots-file-on-wallpaper-610x242.png 610w\" sizes=\"(max-width: 755px) 100vw, 755px\" \/><figcaption id=\"caption-attachment-5914\" class=\"wp-caption-text\">Robots text file example<\/figcaption><\/figure>\n<p>By simply <a href=\"https:\/\/www.hitechwork.com\/put-upload-robots-txt-file-in-wordpress-cpanel\/\">placing a WordPress robots txt<\/a> file in the<a href=\"https:\/\/www.hitechwork.com\/find-root-directory-of-wordpress-website-upload-file\/\"> root of your domain<\/a>, let you stop the search engines from indexing Sensitive Information from your site.<\/p>\n<p><strong>For example,<\/strong><\/p>\n<ul>\n<li>Which plugin you\u00a0are using to<a href=\"https:\/\/www.hitechwork.com\/wordpress-security-tips-and-tricks-protect-website-hacker\/\"> secure your WordPress<\/a>,<\/li>\n<li>Wp-admin area to protect password and<\/li>\n<li>Some other confidential information.<\/li>\n<\/ul>\n<p>Over the years, especially Google changed a lot\u00a0in how it crawls the web\/internet. So the old best practices of WordPress robots txt file are no longer valid now.<\/p>\n<p>In this article,<\/p>\n<p>First, we start with the\u00a0basic OLD robots.txt file to understand its feature and functionality and then move to advance robots.txt file.<\/p>\n<p>I will also cover the importance of robots file,\u00a0how we can create WordPress robots txt file, Test robots.txt file with Google webmaster tool to ensure everything is working fine.<\/p>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_65 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title \" >Table of Contents<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#What_Is_WordPress_Robotstxt_file\" title=\"What Is WordPress Robots.txt file?\">What Is WordPress Robots.txt file?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#What_Happen_If_I_Dont_Use_Robotstxt_File\" title=\"What Happen If I Don\u2019t Use Robots.txt File?\">What Happen If I Don\u2019t Use Robots.txt File?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Importance_of_Robotstxt_file\" title=\"Importance of Robots.txt file\">Importance of Robots.txt file<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#How_To_Create_WordPress_Robots_txt_file\" title=\"How To Create WordPress Robots txt file\">How To Create WordPress Robots txt file<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Basic_Robots_txt_file_Syntax\" title=\"Basic Robots txt file Syntax\">Basic Robots txt file Syntax<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Some_Other_Syntex_of_Robots_file\" title=\"Some Other Syntex of Robots file\">Some Other Syntex of Robots file<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Advanced_Robots_txt_Syntex\" title=\"Advanced Robots txt Syntex\">Advanced Robots txt Syntex<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#List_of_the_Search_engine_Bots\" title=\"List of the Search engine Bots\">List of the Search engine Bots<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#What_to_Place_in_Your_Robotstxt_File\" title=\"What to Place in Your Robots.txt File\">What to Place in Your Robots.txt File<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Basic_Rule_for_Robotstxt_file\" title=\"Basic Rule for Robots.txt file\">Basic Rule for Robots.txt file<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Add_Comment_To_Your_Robots_File\" title=\"Add Comment To Your Robots File\">Add Comment To Your Robots File<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-12\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Test_Your_Robotstxt_File_Before_Uploading\" title=\"Test Your Robots.txt File Before Uploading\">Test Your Robots.txt File Before Uploading<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-13\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#_Maximum_Size_Of_The_Robots_txt_file\" title=\"\u00a0Maximum Size Of The Robots txt file\">\u00a0Maximum Size Of The Robots txt file<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-14\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Command_that_you_can_use_to_block_file\" title=\"Command that you can use to block file\">Command that you can use to block file<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-15\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#What_Experts_Are_Doing\" title=\"What Experts Are Doing\">What Experts Are Doing<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-16\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Recommended_Post\" title=\"Recommended Post\">Recommended Post<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-17\" href=\"https:\/\/www.hitechwork.com\/wordpress-robots-txt-file-place-wordpress\/#Conclusion\" title=\"Conclusion\">Conclusion<\/a><\/li><\/ul><\/li><\/ul><\/nav><\/div>\n<h3><span class=\"ez-toc-section\" id=\"What_Is_WordPress_Robotstxt_file\"><\/span>What Is WordPress Robots.txt file?<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.<\/p>\n<figure id=\"attachment_5913\" aria-describedby=\"caption-attachment-5913\" style=\"width: 757px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5913 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file.png\" alt=\"allow and disallow robots.txt file command\" width=\"757\" height=\"270\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file.png 864w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file-600x214.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file-300x107.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file-768x274.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/allow-and-disallow-robots.txt-file-610x217.png 610w\" sizes=\"(max-width: 757px) 100vw, 757px\" \/><figcaption id=\"caption-attachment-5913\" class=\"wp-caption-text\">syntax<\/figcaption><\/figure>\n<p>Let me Explain it, In brief.\u00a0Suppose our website is like a House.<\/p>\n<p>A house consists of a large number of Rooms. Some Room is important for us and some are not like the\u00a0storage room, private room. When someone comes to our home, we focus on that thing they never go toward the private area.<\/p>\n<p>Similarly, we have to stop search engine to crawl sensitive area of our website like Wp-admin, cg-bin etc.<\/p>\n<p>In this way, we guide\u00a0the search engine bots by using robots file, what should have to crawl on a website, which new things are available for indexing, which part not need to crawl.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"What_Happen_If_I_Dont_Use_Robotstxt_File\"><\/span>What Happen If I Don\u2019t Use Robots.txt File?<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>In the absence of a robots.txt file, search engine bots can index and crawl every part of your site. So it is highly recommended that you create one.<\/p>\n<p>Without a robots.txt file, your website is not optimized in terms of crawl-ability. If you at least one time deal with an SEO. You know very well, who WordPress robots txt file help you to clean your or your client website.<\/p>\n<p>Major search engines will follow the rules that you set, expected the malicious bots and poor search engines. They index whatever they want.<\/p>\n<p>Thankfully, major search engines follow the standard, including Google, Bing, Yandex, Yahoo, and Baidu.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Importance_of_Robotstxt_file\"><\/span>Importance of Robots.txt file<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Most of the search engine does not provide webmaster tool, like Ask. So how they crawl your website. To guide that search engine we add<a href=\"https:\/\/www.hitechwork.com\/create-xml-sitemap-wordpress-important-website\/\"> XML sitemap<\/a> in the robots.txt file.<\/p>\n<p>Another important reason to add sitemap in the WordPress robots txt file is that all the bots first check robots file and then move next.<\/p>\n<p>So by placing a sitemap in the robots file, you can help bots to crawl and get a faster index of your new or old post.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"How_To_Create_WordPress_Robots_txt_file\"><\/span>How To Create WordPress Robots txt file<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Robots.txt is a <strong>General Text file<\/strong>. So, if you don\u2019t have this file in root of your directory, then<\/p>\n<ul>\n<li>Open any <strong>Text Editor<\/strong> as you like ( Notepad, text) and<\/li>\n<li>Make Robots.txt file<\/li>\n<\/ul>\n<figure id=\"attachment_5896\" aria-describedby=\"caption-attachment-5896\" style=\"width: 757px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5896 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note.png\" alt=\"write robots command in text file\" width=\"757\" height=\"327\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note.png 769w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note-600x259.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note-300x130.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note-768x332.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-type-in-note-610x263.png 610w\" sizes=\"(max-width: 757px) 100vw, 757px\" \/><figcaption id=\"caption-attachment-5896\" class=\"wp-caption-text\">Robots.txt file<\/figcaption><\/figure>\n<ul>\n<li>And upload it to your site. It&#8217;s done.<\/li>\n<\/ul>\n<p>By default, WordPress has the following robots.txt file in the root of the domain.<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: * <\/span><\/em><br \/>\n<em> <span style=\"color: #008080;\">Allow: \/wp-admin\/admin-ajax.php<\/span><\/em><br \/>\n<em> <span style=\"color: #008080;\">Disallow: \/wp-adin\/ <\/span><\/em><\/p>\n<p>You can check your wordpress Robots.txt file by simply typing <span style=\"color: #008080;\"><em>yourwebsitename.com\/robots.txt<\/em>\u00a0<\/span>in the new Tab of your browser.<\/p>\n<figure id=\"attachment_5895\" aria-describedby=\"caption-attachment-5895\" style=\"width: 756px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5895 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window.png\" alt=\"Search In Browsere about robots.txt file\" width=\"756\" height=\"255\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window.png 809w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window-600x202.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window-300x101.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window-768x259.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-file-search-in-window-610x206.png 610w\" sizes=\"(max-width: 756px) 100vw, 756px\" \/><figcaption id=\"caption-attachment-5895\" class=\"wp-caption-text\">Search In Browser<\/figcaption><\/figure>\n<p>So this is, who robots.txt file look like.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Basic_Robots_txt_file_Syntax\"><\/span>Basic Robots txt file Syntax<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p><strong>Robots txt syntax<\/strong> is very simple, you don&#8217;t need to learn a new Programming language to make a robots.txt file<\/p>\n<p>Available commands for directives are few. In fact, knowing just two of them is enough for most purposes.<\/p>\n<p><strong>Here is a command;<\/strong><\/p>\n<ul>\n<li><strong>User-Agent<\/strong>\u00a0\u2013 Defines the search engine crawler like Google, Yandex, Bing etc<\/li>\n<li><strong>Disallow <\/strong>\u2013 Tells the crawler to stay away from defined directories\/page\/file\/image.<\/li>\n<\/ul>\n<p>An asterisk (*) can be used to define universal directives for all the search engine.<\/p>\n<p><strong> For example,<\/strong><\/p>\n<p>To block everyone from your entire website, you would <strong>Configure robots txt file<\/strong> in the following way:<\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em>User-agent: * <\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/<\/em><\/span><\/p>\n<p>Here, the Slash (\/) tell don&#8217;t crawl this.<\/p>\n<p>Let, we first clear this <strong>Example of the\u00a0robots txt File<\/strong> and then we move next.<\/p>\n<p>I want to tell search engine no need to index my website. then I simply write a command in a .txt file and upload it to the root of my directory.<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p>But this command is incomplete. I have to also mansion the search engine that is User-agent<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: * <\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p>Here Asterisk (*) defines, all the search engine. So according to this command, all search engine will\u00a0not index my website.<\/p>\n<p>But, if you only want to block google, then you have to\u00a0Configure robots txt file in the following way,<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: Googlebot<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p><strong>Note: &#8211;<\/strong> This command only blocks\u00a0the google bots to crawl your website.<\/p>\n<p>But, If you want to allow Only Googlebot and block all the other search engine then\u00a0you have to write the following command in your WordPress robots txt file<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: Googlebot<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow:\u00a0<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">User-agent: *<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p>This code inside your robots.txt would give only Google full access to your website while keeping everyone else out.<\/p>\n<p><strong>Note: &#8211;<\/strong> Command always runs in sequence. So it is important to first allow the search engine and then disallow it.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Some_Other_Syntex_of_Robots_file\"><\/span>Some Other Syntex of Robots file<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<ul>\n<li><strong>Allow<\/strong> \u2013allows crawling your site<\/li>\n<li><strong>Sitemap<\/strong> \u2013 Tell where your sitemap file<\/li>\n<li><strong>Host &#8211;<\/strong> Tell the Primary domain<\/li>\n<\/ul>\n<p><span style=\"color: #008080;\"><strong>Allow Directory: <\/strong><\/span><\/p>\n<p>A common misconception about <span style=\"color: #008080;\"><em><strong>Allow robots txt file directory<\/strong>\u00a0<\/em><\/span>is that this rule is used to tell search engines to check out your site<\/p>\n<p>Basically, Allow is used to give permission to a\u00a0subfolder.<\/p>\n<p><strong>For example;<\/strong><\/p>\n<div id=\"crayon-58dfb9ce4f01d622039569-1\" class=\"crayon-line\" style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em><span class=\"crayon-v\">User<\/span><span class=\"crayon-o\">&#8211;<\/span><span class=\"crayon-v\">agent<\/span><span class=\"crayon-o\">:<\/span> <span class=\"crayon-o\">*<\/span><\/em><\/span><\/div>\n<div id=\"crayon-58dfb9ce4f01d622039569-2\" class=\"crayon-line crayon-striped-line\" style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em><span class=\"crayon-v\">Allow<\/span><span class=\"crayon-o\">:<\/span>\u00a0\/<span class=\"crayon-o\">content<\/span><span class=\"crayon-o\">\/<\/span><span class=\"crayon-v\">my<\/span><span class=\"crayon-o\">&#8211;<\/span><span class=\"crayon-v\">file<\/span><span class=\"crayon-sy\">.<\/span><span class=\"crayon-e\">php<\/span><\/em><\/span><\/div>\n<div id=\"crayon-58dfb9ce4f01d622039569-3\" class=\"crayon-line\" style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em><span class=\"crayon-v\">Disallow<\/span><span class=\"crayon-o\">:<\/span> <span class=\"crayon-o\">\/<\/span><span class=\"crayon-v\">content<\/span><span class=\"crayon-o\">\/<\/span><\/em><\/span><\/div>\n<p>The search engines would stay away from a <em>Content<\/em>\u00a0folder in general, but still access my-file.php.<\/p>\n<p><strong>Note: &#8211;<\/strong> it\u2019s important to note that you need to place the <span style=\"color: #008080;\"><em>directive allow<\/em><\/span> first in order for this to work.<\/p>\n<p><span style=\"color: #008080;\"><strong>Sitemap Directory:<\/strong><\/span><\/p>\n<p>This can be used to tell search engines or other robots where your sitemap is located. For example, the complete robots.txt could look like this,<\/p>\n<p><strong>For example<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em>Sitemap: https:\/\/www.hitechwork.com\/post-sitemap.xml<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Sitemap: https:\/\/www.hitechwork.com\/page-sitemap.xml<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Sitemap: https:\/\/www.hitechwork.com\/category-sitemap.xml<\/em><\/span><\/p>\n<p>WordPress robots txt file are used to block particular directory where the\u00a0sitemap may is used to give the robot a list of pages that is available for indexing.<\/p>\n<p>As I already told you, By giving the search engine a sitemap you can increase the number of pages that it indexes. The sitemap can also tell the robots when the page was last modified, the priority of the page, and how often the page is likely to be updated.<\/p>\n<p><span style=\"color: #008080;\"><strong>Host Directory:<br \/>\n<\/strong><\/span><\/p>\n<p>A Host is only supported by Yandex. This Command let you to decided whether you want to show <span style=\"color: #008080;\">www.example.com<\/span> or <span style=\"color: #008080;\">example.com<span style=\"color: #333333;\"> in the\u00a0search result.<\/span><\/span><\/p>\n<figure id=\"attachment_5900\" aria-describedby=\"caption-attachment-5900\" style=\"width: 751px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5900 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file.png\" alt=\"Host Sydnex use by the yandex \" width=\"751\" height=\"206\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file.png 835w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file-600x165.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file-300x82.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file-768x211.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/yandex-search-engine-say-add-host-in-robots-file-610x167.png 610w\" sizes=\"(max-width: 751px) 100vw, 751px\" \/><figcaption id=\"caption-attachment-5900\" class=\"wp-caption-text\">Host Syntex<\/figcaption><\/figure>\n<p><strong>For example;<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">Host: www.hitechwork.com<\/span><\/p>\n<p>I don&#8217;t recommend to do this because\u00a0only Yandex support this.\u00a0But if you want to do, You can learn more about Host directive here<\/p>\n<p>Do only that setting that follow by all the search engine. Like, google Use 301 redirects to handle this situation.<\/p>\n<p><b>For example;<\/b><\/p>\n<p>If your domain starts with www. Peoples who are searching your website without www (hitechwork.com) will automatically redirect to www.hitechwork.com<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Advanced_Robots_txt_Syntex\"><\/span>Advanced Robots txt Syntex<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Robots.txt file not only uses\u00a0to prevent the search engine from crawling your site.<\/p>\n<p>Sometimes it is used to provide useful information to search engine and block unnecessary file to clear your website.<\/p>\n<p><strong>For example:<\/strong><\/p>\n<p>On your website, you have a folder for testing content, affiliate links, unnecessary image and many other things.<\/p>\n<p>You want to keep this folder out from the search engine index. then you have to write the following command in robots file.<\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em>Disallow: \/testfolder\/<\/em><\/span><\/p>\n<p>All the content in the testfolder is now blocked.<\/p>\n<p>But, if you wanted to block all folders from access that begin with <em>wp<\/em> or something else. You could so like that:<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: *<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/wp-*\/<\/span><\/em><\/p>\n<p>If you want to exclude all PDF files in my media folder from showing up in search results, again you have to write the following command in the robots.txt file.<\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br \/>\n<span style=\"color: #008080;\">Disallow: \/wp-content\/uploads\/*\/*\/*.pdf<\/span><\/p>\n<p><strong>Note: &#8211;<\/strong>\u00a0when you upload any file it will go\u00a0to the<strong> Uploads folder<\/strong><\/p>\n<p>See the below screenshot<\/p>\n<figure id=\"attachment_5891\" aria-describedby=\"caption-attachment-5891\" style=\"width: 751px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5891 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website.png\" alt=\"URL of image exmple\" width=\"751\" height=\"448\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website.png 813w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website-600x358.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website-300x179.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website-768x458.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/explain-the-URL-of-the-image-on-website-610x364.png 610w\" sizes=\"(max-width: 751px) 100vw, 751px\" \/><figcaption id=\"caption-attachment-5891\" class=\"wp-caption-text\">URL of image<\/figcaption><\/figure>\n<p>I replaced the month and day directories that WordPress automatically sets up with wildcards Asterisk (*)<\/p>\n<p>According to this command no matter when they upload. All the file in the uploads folder that ending with <strong>.pdf<\/strong> are blocked.<\/p>\n<p><span style=\"color: #008080;\">www.hitechwork.com\/wp-content\/uploads\/2017\/04\/SEO.pdf<\/span><\/p>\n<p>\u201cOfficially\u201d, the robots.txt standard doesn\u2019t support regular expressions (wildcards)<\/p>\n<p>But, all major search engines can understand it. This means you can use these line to block groups of files:<\/p>\n<p><strong>Note: &#8211;\u00a0<\/strong>Robots.txt file is case sensitive. pdf and PDF are two different things for robots.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"List_of_the_Search_engine_Bots\"><\/span>List of the Search engine Bots<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<table class=\"full-width\" style=\"height: 392px;\" width=\"607\">\n<thead>\n<tr style=\"height: 23px;\">\n<th style=\"width: 159px; height: 23px;\">Search Engine<\/th>\n<th style=\"width: 164px; height: 23px;\">Field<\/th>\n<th style=\"width: 262px; height: 23px;\">User-agent<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">General<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Googlebot<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">Images<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Googlebot-Image<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">Mobile<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Googlebot-Mobile<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">News<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Googlebot-News<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">Video<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Googlebot-Video<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">AdSense<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>Mediapartners-Google<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Google<\/td>\n<td style=\"width: 164px; height: 23px;\">AdWords<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>AdsBot-Google<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23px;\">\n<td style=\"width: 159px; height: 23px;\">Yahoo!<\/td>\n<td style=\"width: 164px; height: 23px;\">General<\/td>\n<td style=\"width: 262px; height: 23px;\"><code>slurp<\/code><\/td>\n<\/tr>\n<tr style=\"height: 23.0313px;\">\n<td style=\"width: 159px; height: 23.0313px;\">Yandex<\/td>\n<td style=\"width: 164px; height: 23.0313px;\">General<\/td>\n<td style=\"width: 262px; height: 23.0313px;\"><code>yande<\/code><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>More can be found on\u00a0User-Agents.org<\/p>\n<h3><span class=\"ez-toc-section\" id=\"What_to_Place_in_Your_Robotstxt_File\"><\/span>What to Place in Your Robots.txt File<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>I always change my robots.txt file time to time, according to the change in the Trend market.<\/p>\n<p>My <a href=\"https:\/\/www.hitechwork.com\/robots.txt\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">current robots.txt file<\/a>.<\/p>\n<p>The best thing about the robots.txt file is that you can check the robots file of any website.<\/p>\n<p>By typing a simple command in your browser.<\/p>\n<p><span style=\"color: #008080;\"><em>www.example.com\/robots.txt<\/em><\/span><\/p>\n<p>If you check out the robots.txt file of some WordPress websites, you will see that website owners define different rules for search engines.<\/p>\n<p><strong>For example;<\/strong><\/p>\n<ul>\n<li>Worpdress.com<\/li>\n<li>perishablepress.com<\/li>\n<li>askapache.com<\/li>\n<li>moz.com<\/li>\n<li>backlinko.com<\/li>\n<li>matcutt.com<\/li>\n<\/ul>\n<p>After searching a lot of robots file and reading the research paper, I reach the end of this file,<\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: *<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Allow: \/wp-admin\/admin-ajax.php<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/wp-admin\/<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/cgi-bin\/<\/span><\/em><\/p>\n<p><strong>Note: &#8211;<\/strong> It is only from my best of knowledge, You can go with your decision<\/p>\n<p>The Robots Exclusion Standard can be used to stop search engines crawling files and directories that you do not want to be indexed, however, if you enter the wrong code, you may end up blocking important pages from being crawled.<\/p>\n<p><strong>Hint: &#8211;<\/strong>\u00a0By placing an\u00a0<a href=\"https:\/\/www.hitechwork.com\/create-sitemap-online-important-website\/\">XML Sitemap file<\/a>\u00a0in the robots file, help you\u00a0to faster indexed by the\u00a0search engine.<\/p>\n<p>Just add your XML sitemap with the robots.txt file and upload it to the root of your site.<\/p>\n<p><span style=\"color: #008080;\">Sitemap: https:\/\/www.hitechwork.com\/post-sitemap.xml <\/span><br \/>\n<span style=\"color: #008080;\">Sitemap: https:\/\/www.hitechwork.com\/page-sitemap.xml <\/span><br \/>\n<span style=\"color: #008080;\">Sitemap: https:\/\/www.hitechwork.com\/category-sitemap.xml<\/span><\/p>\n<h3><span class=\"ez-toc-section\" id=\"Basic_Rule_for_Robotstxt_file\"><\/span>Basic Rule for Robots.txt file<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<ol>\n<li>Don\u2019t keep the space at the beginning of any line and don\u2019t make ordinary space in the file.<br \/>\nAllow: \/foldername1\/filename.html<\/li>\n<li>Don&#8217;t change the sequence of the command (first Specified the agent and then decide the allow or disallow).<\/li>\n<li>If you want no index, more than one directory or page don\u2019t right along with these names (Disallow: \/support \/cgi-bin \https://www.hitechwork.com/images\/).<\/li>\n<li>Robots.txt file is case sensitive As the example, you want no index \u201cDownload\u201d directory but write \u201cdownload\u201d on Robots.txt file. It makes miss understand for search bot.<\/li>\n<li>Don&#8217;t place sitemap at the top of the\u00a0robots.txt file.<\/li>\n<\/ol>\n<p>Here is a major search engine recommendation to create a robots txt file<\/p>\n<ul>\n<li><a href=\"https:\/\/support.google.com\/webmasters\/answer\/6062596?hl=en\" target=\"_blank\" rel=\"noopener\">Google robots txt file<\/a><\/li>\n<li><a href=\"https:\/\/yandex.com\/support\/webmaster\/controlling-robot\/robots-txt.xml\" target=\"_blank\" rel=\"noopener\"> Yandex robots txt file<\/a><\/li>\n<li><a href=\"https:\/\/www.bing.com\/webmaster\/help\/how-to-create-a-robots-txt-file-cb7c31ec\" target=\"_blank\" rel=\"noopener\">Bing robots txt file<\/a><\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Add_Comment_To_Your_Robots_File\"><\/span>Add Comment To Your Robots File<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>If you are working with your client website or you have a large website. You can add a\u00a0comment too in WordPress robots txt file.<\/p>\n<p>This will help you quickly understand the rules you have added when you refer to it later.<\/p>\n<p><span style=\"color: #008080;\"># <span style=\"color: #333333;\">use to add a comment to the file.\u00a0<\/span><\/span><\/p>\n<div id=\"highlighter_316872\" class=\"syntaxhighlighter nogutter \">\n<div class=\"lines\">\n<div class=\"line alt1\" style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\"># Block the Googlebot from crawling the site<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">\u00a0<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">User-agent: Googlebot<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/div>\n<\/div>\n<\/div>\n<p>A comment can be placed at the start of a line or after the end of the line.<\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em>User-agent: Googlebot-Image # The Google Images crawler<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em> Disallow: \https://www.hitechwork.com/images\/ # Hide the images folder<\/em><\/span><\/p>\n<p>I recommended you to add code to the robots file. It only helps you in future but also helps you to understand the rules you create.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Test_Your_Robotstxt_File_Before_Uploading\"><\/span>Test Your Robots.txt File Before Uploading<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>There are a number of ways in which you can test your robots.txt file by using an online or offline too. But, I recommended <a href=\"https:\/\/www.hitechwork.com\/verify-site-google-bing-yandex-webmaster-tools\/\">Create\u00a0google webmaster Account<\/a> and use the <strong>robots.txt file<\/strong>\u00a0feature In a webmaster tool.<\/p>\n<p>Because if, Google Update any change in the robots file, it will show you here and also\u00a0tell how to fix it.<\/p>\n<ul>\n<li>Log in your <a href=\"https:\/\/www.google.com\/webmasters\/tools\/dashboard\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google webmaster tool<\/a>\u00a0and click on <strong>Robots.txt Tester<\/strong> under <strong>Crawl<\/strong> Option.<\/li>\n<\/ul>\n<figure id=\"attachment_5897\" aria-describedby=\"caption-attachment-5897\" style=\"width: 747px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5897\" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool.png\" alt=\"Robots file in google webmaster tool\" width=\"747\" height=\"332\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool.png 846w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool-600x267.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool-300x133.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool-768x341.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-tool-in-google-webmster-tool-610x271.png 610w\" sizes=\"(max-width: 747px) 100vw, 747px\" \/><figcaption id=\"caption-attachment-5897\" class=\"wp-caption-text\">Robots file in webmaster<\/figcaption><\/figure>\n<p><strong>Note: &#8211;<\/strong>\u00a0The robots.txt file that is displayed comes from the last copy of robots.txt that Google retrieved from your website.<\/p>\n<ul>\n<li>Replace this with your robots.txt file and click on Test.<\/li>\n<\/ul>\n<figure id=\"attachment_5893\" aria-describedby=\"caption-attachment-5893\" style=\"width: 754px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5893 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool.png\" alt=\"Paste Robots.txt file in google webmaster tool\" width=\"754\" height=\"395\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool.png 909w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool-600x314.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool-300x157.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool-768x402.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/paste-robots.txt-file-in-the-webmster-tool-610x319.png 610w\" sizes=\"(max-width: 754px) 100vw, 754px\" \/><figcaption id=\"caption-attachment-5893\" class=\"wp-caption-text\">Paste Robots.txt file<\/figcaption><\/figure>\n<ul>\n<li>A check is there any error or not. If there is an error, fix it and click on submit.<\/li>\n<li>A new window will be open\u00a0with download option.<\/li>\n<\/ul>\n<figure id=\"attachment_5898\" aria-describedby=\"caption-attachment-5898\" style=\"width: 755px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5898 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster.png\" alt=\"submitting file in googl ewebmaster tool\" width=\"755\" height=\"320\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster.png 849w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster-600x254.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster-300x127.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster-768x326.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-file-in-the-webmaster-610x259.png 610w\" sizes=\"(max-width: 755px) 100vw, 755px\" \/><figcaption id=\"caption-attachment-5898\" class=\"wp-caption-text\">Submit file<\/figcaption><\/figure>\n<ul>\n<li>Click on <strong>Download<\/strong> to download the\u00a0robots.txt file.<\/li>\n<\/ul>\n<p>Use <a href=\"https:\/\/www.hitechwork.com\/put-upload-robots-txt-file-in-wordpress-cpanel\/\">Cpanel To upload Robots txt file<\/a> in the root of your directory.<\/p>\n<p>After Uploading Robots.txt file in the root of your site, come back to webmaster tool and then click on <strong>View uploaded version<\/strong>.<\/p>\n<ul>\n<li>When you click on it. A new window will be open\u00a0with your new Robots.txt file.<\/li>\n<\/ul>\n<figure id=\"attachment_5892\" aria-describedby=\"caption-attachment-5892\" style=\"width: 763px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5892 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel.png\" alt=\"Upload file to cpanel\" width=\"763\" height=\"288\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel.png 832w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel-600x226.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel-300x113.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel-768x290.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/file-are-uploaded-to-the-cpanel-610x230.png 610w\" sizes=\"(max-width: 763px) 100vw, 763px\" \/><figcaption id=\"caption-attachment-5892\" class=\"wp-caption-text\">Upload file to Cpanel<\/figcaption><\/figure>\n<p><strong>Note: &#8211;<\/strong> Check your file is uploaded or not. If not then reload the page or check in your Cpanel.<\/p>\n<ul>\n<li>Now click on Submit.<\/li>\n<\/ul>\n<figure id=\"attachment_5899\" aria-describedby=\"caption-attachment-5899\" style=\"width: 753px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5899 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster.png\" alt=\"submit file in webmster tool\" width=\"753\" height=\"296\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster.png 837w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster-600x236.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster-300x118.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster-768x302.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/submit-the-file-in-webmaster-610x240.png 610w\" sizes=\"(max-width: 753px) 100vw, 753px\" \/><figcaption id=\"caption-attachment-5899\" class=\"wp-caption-text\">Submit file<\/figcaption><\/figure>\n<p>It&#8217;s done. Your new WordPress robots txt file is live.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"_Maximum_Size_Of_The_Robots_txt_file\"><\/span>\u00a0Maximum Size Of The Robots txt file<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>According to the\u00a0Google\u2019s John Mueller <a title=\"Clarifying Robots.txt File Size Limit\" href=\"https:\/\/plus.google.com\/+JohnMueller\/posts\/Wbk17p6bMSe\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">clarified the issue<\/a><\/p>\n<blockquote><p>\u201cIf you have a giant robots.txt file, remember that Googlebot will only read the first 500kB. If your robots.txt is longer, it can result in a line being truncated in an unwanted way. The simple solution is to limit your robots.txt files to a reasonable size.\u201d<\/p><\/blockquote>\n<p>Try to make your robots.txt file less the 500KB. You can check your robots.txt file size in google webmaster tool.<\/p>\n<figure id=\"attachment_5894\" aria-describedby=\"caption-attachment-5894\" style=\"width: 753px\" class=\"wp-caption alignnone\"><img decoding=\"async\" class=\"wp-image-5894 \" style=\"border: 4px solid #c0c0c0;\" src=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB.png\" alt=\"robots.txt file should be Less than 500KB\" width=\"753\" height=\"271\" srcset=\"https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB.png 842w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB-600x216.png 600w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB-300x108.png 300w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB-768x276.png 768w, https:\/\/www.hitechwork.com\/wp-content\/uploads\/2017\/04\/robots.txt-fie-less-than-500KB-610x220.png 610w\" sizes=\"(max-width: 753px) 100vw, 753px\" \/><figcaption id=\"caption-attachment-5894\" class=\"wp-caption-text\">Less than 500KB<\/figcaption><\/figure>\n<h3><span class=\"ez-toc-section\" id=\"Command_that_you_can_use_to_block_file\"><\/span>Command that you can use to block file<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p><strong>Allow indexing of everything<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br \/>\n<span style=\"color: #008080;\">Disallow:<\/span><\/p>\n<p><strong>Disallow indexing of everything<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br \/>\n<span style=\"color: #008080;\">Disallow: \/<\/span><\/p>\n<p><strong>Disallow indexing of a specific folder<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br \/>\n<span style=\"color: #008080;\"> Disallow: \/folder name\/<\/span><\/p>\n<p><strong>Disallow Googlebot from indexing<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: Googlebot<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p><strong>Disallow Googlebot and allow other<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><em><span style=\"color: #008080;\">User-agent: Googlebot<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow:\u00a0<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">User-agent: *<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/<\/span><\/em><\/p>\n<p><strong>Disallow specific file in folder<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br \/>\n<span style=\"color: #008080;\">Allow:\u00a0\/content\/my-file.php<\/span><br \/>\n<span style=\"color: #008080;\">Disallow: \/content\/<\/span><\/p>\n<p><strong>Disallow file that starting with name wp-<\/strong><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: * <\/span><br \/>\n<span style=\"color: #008080;\">Disallow: \/wp-*\/ <\/span><\/p>\n<p><strong>Disallow file that ends with pdf<\/strong><\/p>\n<div class=\"crayon-line\" style=\"padding-left: 30px;\"><span style=\"color: #008080;\">User-agent: *<\/span><br style=\"color: #666666; font-family: 'Open Sans', Arial, sans-serif; background-color: #ffffff;\" \/><span style=\"color: #008080;\">Disallow: \/wp-content\/uploads\/*\/*\/*.pdf<\/span><\/div>\n<h3><span class=\"ez-toc-section\" id=\"What_Experts_Are_Doing\"><\/span>What Experts Are Doing<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>According to Yoast<\/p>\n<blockquote><p>No longer is Google the dumb little kid that just fetches your sites HTML and ignores your styling and JavaScript. It fetches everything and renders your pages completely. This means that when you deny Google access to your CSS or JavaScript files, it doesn\u2019t like that at all.<\/p><\/blockquote>\n<p>Yoast Robots txt file are<\/p>\n<p class=\"crayon-line\" style=\"padding-left: 30px;\"><span style=\"color: #008080;\"><em>User-Agent: *<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/out\/<\/em><\/span><\/p>\n<p class=\"crayon-line\">Yoast only blocks the \/out\/ directory.<\/p>\n<p class=\"crayon-line\">Why Yoast not blocking anything from their site. The reason is very simple when Google fetch your website, It checks CSS, JS, and styling component to render your content properly. if you blocking this then google think your site looks like crap and penalize you for it with devastating effects<\/p>\n<p class=\"crayon-line\">if you blocking this then google think your site looks like crap and penalize you for it with devastating effects according to Yoast<\/p>\n<div class=\"crayon-line\">Yoast also\u00a0recommended not to block directory with a robots.txt file. They recommended using <a href=\"https:\/\/www.hitechwork.com\/no-index-no-follow-meta-tag-stop-search-engines\/\">nofollow noindex tag to stop indexing<\/a> of page.<\/div>\n<div class=\"crayon-line\"><\/div>\n<div class=\"crayon-line\">WordPress Found Matt Mullenweg follow the similar approach<\/div>\n<div class=\"crayon-line\"><\/div>\n<div class=\"crayon-line\"><span style=\"color: #008080;\"><em>User-agent: *<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow:<\/em><\/span><\/div>\n<p><span style=\"color: #008080;\"><em>User-agent: Mediapartners-Google*<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow:<\/em><\/span><\/p>\n<p><span style=\"color: #008080;\"><em>User-agent: *<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/dropbox<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/contact<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/blog\/wp-login.php<\/em><\/span><br \/>\n<span style=\"color: #008080;\"><em>Disallow: \/blog\/wp-admin<\/em><\/span><\/p>\n<p class=\"crayon-line\">Matt Mullenweg also blocking the Wp-admin and Wp-login.php. Dropbox and contact are another folders that are blocked. Maybe it contains important or unuseful information that he does not want to share with google.<\/p>\n<p class=\"crayon-line\">Another example come form\u00a0WPBeginner<\/p>\n<p class=\"crayon-line\"><em><span style=\"color: #008080;\">User-Agent: *<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Allow: \/?display=wide<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Allow: \/wp-content\/uploads\/<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/wp-content\/plugins\/<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/readme.html<\/span><\/em><br \/>\n<em><span style=\"color: #008080;\">Disallow: \/refer\/<\/span><\/em><\/p>\n<p class=\"crayon-line\">You can see that WPBeginner blocking plugin folder for security. And\u00a0its refer folder contain affiliate links and other information.<\/p>\n<h3 class=\"crayon-line\"><span class=\"ez-toc-section\" id=\"Recommended_Post\"><\/span>Recommended Post<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<ul>\n<li><a href=\"https:\/\/www.hitechwork.com\/what-are-benefits-of-link-building-importance-seo\/\">Benefits Of Link Building And Its Importance In SEO<\/a><\/li>\n<li><a href=\"https:\/\/www.hitechwork.com\/brief-guide-find-protect-your-content-copied-thief\/\">Protect Your Content From Being Copied By Thief<\/a><\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Conclusion\"><\/span>Conclusion<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Robost.txt file is just not a .txt\u00a0file. if you want\u00a0to deal with the SEO on your client site or on your own personal blog. Then Robots.txt file is the first tool that helps\u00a0you to Fix damage and help to increase the\u00a0reputation of your site in the\u00a0search result.<\/p>\n<p>At last, be aware of the syntax of a\u00a0command. A little mistake in the robots.txt file can block your entire site. Always test your file before upload it to the\u00a0root of your site. After Adding uploading your\u00a0file, monitor your traffic if you found nay peak change (decrease) in traffic. Then recheck your file and find the reason behind it.<\/p>\n<p class=\"gr-progress\">Remember to share this post with anyone who might benefit from this information, including your Facebook friends, Twitter followers and members of your Google+ group! And also Support Us By Liking Our\u00a0<a class=\"\" href=\"https:\/\/www.facebook.com\/hitechwork\/\" rel=\"nofollow noopener\" target=\"_blank\">Facebook<\/a>,\u00a0<a href=\"https:\/\/twitter.com\/hitechwork\" rel=\"nofollow noopener\" target=\"_blank\">Twitter<\/a>, and\u00a0<a href=\"https:\/\/plus.google.com\/b\/113530827824377836781\/+Hitechwork\/posts?gmbpt=true&amp;pageId=113530827824377836781&amp;hl=en-GB\" rel=\"nofollow noopener\" target=\"_blank\">Google+<\/a>\u00a0Page.<\/p>\n<p>If you have any suggestion or problem about WordPress robots txt file\u00a0please feel free to comment below.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>WordPress robots txt file is Introduced by the robotstxt.org to instruct the search engine who to crawl their website. It is a very powerful file (we also can say it a tool) if you working on a Site SEO.<\/p>\n","protected":false},"author":1,"featured_media":5915,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"tdm_status":"","tdm_grid_status":"","footnotes":""},"categories":[5],"tags":[],"_links":{"self":[{"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/posts\/1543"}],"collection":[{"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/comments?post=1543"}],"version-history":[{"count":1,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/posts\/1543\/revisions"}],"predecessor-version":[{"id":239901,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/posts\/1543\/revisions\/239901"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/media\/5915"}],"wp:attachment":[{"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/media?parent=1543"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/categories?post=1543"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.hitechwork.com\/wp-json\/wp\/v2\/tags?post=1543"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}