Question by jpdery, May 23, 2014 1:55 PM

Salesforce indexing: clean html markup from field

I a Coveo cloud org, I index a Salesforce Knowledge field "Article Solution" that contains html markup. Can it be cleaned so the freetext search dont hit on "style" ?

Answer by Martin Laporte, May 24, 2014 4:06 PM

Hmm there isn't anything to do this out-of-the-box right now, but that's a very valid use case. A post-conversion script on the index server could do the trick, but that's the kind of thing we should be able to do out-of-the-box (I mean, HTML fields are pretty frequent in Salesforce).

I'll forward this to the appropriate person cough Jodi cough to see if we might add something in the mapping file that would allow cleaning HTML from fields.

Comment by jgiordano, May 26, 2014 9:47 AM

As discussed with Martin, this is indeed a valid use case and I will look into supporting it in the future. For now, a post-conversion script would do the trick.

Answer by maveilleux, Dec 23, 2014 11:07 AM

As of the 2014 CES October release, the connector generate a html stripped version of some field. Simply append __html_stripped to the field metadata name.

So if you have a field name yourfield__c, the html stripped version would be named: yourfield__c__html_stripped.

