Our search service always recieves the follwoing error when indexing SharePoint
content:
Error in the Site Data Web Service. (*** Client found response content type of
'text/html', but expected 'text/xml'. The request failed with the error message:
-- <HTML> <HEAD><TITLE>500 Server Error</TITLE></HEAD> <BODY> <H1>Server
Error</H1> <H4> The following error occurred:<P> [code=CANT_CONNECT] Could not
connect because of networking problems. Contact your system administrator. </H4>
<HR> Please contact the administrator. </BODY> </HTML> --.)
We are quite certain that this is a message from our proxy server.
According to this TechNet article
(http://technet2.microsoft.com/Office/en-us/library/87e0397a-ddab-442f-91cd-8acf" target="_blank" rel="nofollow">technet2.microsoft.com/.../87e0397a-ddab-442f-91cd-8acf\
0c0c712b1033.mspx?mfr=true
<https://intranet.tria.de/exchweb/bin/redir.asp?URL=http://technet2.microsoft.co\
m/Office/en-us/library/87e0397a-ddab-442f-91cd-8acf0c0c712b1033.mspx?mfr=true> )
, when creating your search service, the hosts entry is changed to point to the
WFE so that the DNS server does not need to be contacted. We can confirm that
the hosts entry has been changed correctly and that the WFE is accessible.
Currently our DNS is configured to publish SharePoint via an external ISA
server, however the ISA is not running currently.
We have an identical farm, which does not use ISA publishing and therefore has
DNS configured as expected and the search crawls with no problems..
Changing the DNS is not an option and it seems the hosts file has no effect on
the crawler.
Do you know someone who can help? Share a link to this thread on twitter, or facebook.