Indexing Media Library with Sitecore Legacy connector
We have a customer using the Sitecore legacy connector (6.5 x86 Build 4898) and they would like to begin indexing PDFs that they have contained in the media library. I'm trying to determine if there is a way for us to index this content and filter out all other content types such as word docs using document type sets. I have tried to do this by applying a custom document type set and changing the action to reject. However, I still see this content show up in the Index Browser. One thing to note is that the items in Sitecore don't necessarily have the proper file extension. Is there some way to handle this via the Content Type value.
That was actually really fun to test since I have not been playing with CES 6.5 in a while.
Using Document type set would be the best alternative I believe. Now Sitecore might indeed give a special extension to some documents but you can add new extension to an existing document type set using the Add button.
var fileExtension = DocumentInfo.Extension;
You can then add some logic to compare and exclude, example:
DocumentInfo.IsValid = fileExtension !== "the extension that I don't want"
Now remember that you can log in the CES Console using
And finally, once your script is ready, you can reference it in the Adminstration Tools:
Should do the trick.