Salesforce indexing: clean html markup from field
I a Coveo cloud org, I index a Salesforce Knowledge field "Article Solution" that contains html markup. Can it be cleaned so the freetext search dont hit on "style" ?
Hmm there isn't anything to do this out-of-the-box right now, but that's a very valid use case. A post-conversion script on the index server could do the trick, but that's the kind of thing we should be able to do out-of-the-box (I mean, HTML fields are pretty frequent in Salesforce).
I'll forward this to the appropriate person cough Jodi cough to see if we might add something in the mapping file that would allow cleaning HTML from fields.
As of the 2014 CES October release, the connector generate a html stripped version of some field. Simply append
__html_stripped to the field metadata name.
So if you have a field name
yourfield__c, the html stripped version would be named: