Gravatar for anya.zajaczkowska@alpa.org

Question by searchdeveloper2, Mar 29, 2017 8:42 PM

Custom open converter not being called for web pages

We are using Coveo for Sitecore version: 3.0.1084.0 and Sitecore version: 8 update 4.

A custom converter was added to Open Converters to exclude certain HTML from getting indexed according to https://developers.coveo.com/display/public/Converter/Modifying+an+HTML+Document

The converter was then associated with the Web Pages document type in the document set. When the source is re-indexed, the converter is not being executed for Sitecore pages (no traces in log). If we associate the converter with another document type, PDF for example, it is executed.

How can we troubleshoot this?

2 Replies
Gravatar for jflheureux@coveo.com

Answer by Jean-François L'Heureux, Mar 30, 2017 1:20 PM

Have you modified the right "Document Type Set"? Sitecore sources do not use the default document type set. They are using one created by Coveo for Sitecore which also indexes images. You can check which document type set is used by the source and modify this one.

Gravatar for anya.zajaczkowska@alpa.org

Comment by searchdeveloper2, Mar 30, 2017 8:01 PM

Yes, we have modified the correct document type set that is used by the Sitecore source (Sitecore Search Provider Document Types Set). If we associate the converter with another document type, it does get called.

Gravatar for jflheureux@coveo.com

Answer by Jean-François L'Heureux, Mar 30, 2017 8:11 PM

The approach you found is outdated.

Coveo for Sitecore 3.0 documentation outlines a cleaner way to clean the HTML to be indexed. This solution uses a Sitecore processor to clean the HTML instead of a custom converter.

I suggest you to try it.

Jeff

Ask a question