Gravatar for

Question by searchdeveloper, Feb 16, 2017 10:42 PM

Incorrect excerpt with HtmlContentInBodyWithRequestsProcessor on secured pages

We are using Coveo for Sitecore version: 3.0.1084.0 and Sitecore version: 8 update 4. HtmlContentInBodyWithRequestsProcessor is enabled and works for unsecured pages (valid excerpt, full text search). On secured pages, the following invalid excerpt is generated:

This web browser does not support JavaScript or JavaScript in this web browser is not enabled. … To find out if your web browser supports JavaScript or to enable JavaScript, see web browser help. … Sign in with your organizational account … Keep me signed in

Full text search on the rendered page content also does not work for the secured pages. We do not see anything that seems relevant in the Coveo logs.

How can we correct this?

1 Reply
Gravatar for

Answer by Jean-François L'Heureux, Feb 17, 2017 7:48 AM

The Coveo processor that gets the HTML of the page is simply doing an HTTP request to the published web page to get the content. If the page is secured and your business logic is redirecting the user to a login page, the login page will be indexed as the body of the secured pages.

That is unless you provide the processor with valid credentials to use on your login page. You can follow this guide on how to do it:

Gravatar for

Comment by searchdeveloper, Feb 23, 2017 7:24 AM

Thank you, configuring credentials corrected the issue. Do you know if the HTML processing has been improved in version 4.0 to not use simple web requests per admin response ?

Gravatar for

Comment by Jean-François L'Heureux, Feb 23, 2017 7:33 AM

This improvement hasn't been done yet. We tried but we discovered other problems in this method. In the end, we kept the actual method with an HTTP request. The improvement request is still opened on our side as we might have other ideas to improve the process.

Ask a question