Gravatar for loebrandy@gmail.com

Question by rloeb, Apr 14, 2017 5:20 PM

Sharepoint crawling targeting a specific list?

I am trying to get the crawler to only crawl a specific list in Sharepoint, but it seems like it only works when I point it to Sharepoint's root. I can add exclusion filters, but it seems like that doesn't speed up crawling as those branches are still examined even if not indexed. Is there some kind of artifact or setting I need to add to Sharepoint to make a specific list independently crawlable?

2 Replies
Gravatar for psheridan@coveo.com

Answer by paul_sheridan, Apr 19, 2017 2:29 PM

You should use the list URL as the start URL for the source. e.g. https://site:8080/sites/support/lists/contacts/allItems.aspx, rather than trying to crawl the whole site and use Exclusion Rules to exclude everything except the site you want, I believe. Please see https://onlinehelp.coveo.com/en/cloud/add_edit_sharepoint_online_source.htm (for Coveo Cloud) or https://onlinehelp.coveo.com/en/ces/7.0/administrator/configuring_and_indexing_a_microsoft_sharepoint_source.htm (for Coveo on premises)

,

Hi! Sorry we didn't get to this earlier. If you are using Coveo Cloud, have a look at this section of the documentation: https://onlinehelp.coveo.com/en/cloud/addeditsharepointonlinesource.htm. There is an example of a start URL for indexing just a specific list: https://site:8080/sites/support/lists/contacts/allItems.aspx. Similarly, for Coveo on-premises, https://onlinehelp.coveo.com/en/ces/7.0/administrator/configuringandindexingamicrosoftsharepointsourcewiththelegacyconnector.htm includes an example of crawling a specific list. It's important that you set the start URL to the list you want to crawl, rather than trying to use exclusion rules.

Gravatar for loebrandy@gmail.com

Comment by rloeb, Apr 19, 2017 3:01 PM

allItems.aspx - pardon my sharepoint ignorance but is this some kind of standard sharepoint artifact, or something custom needed by Coveo? If it's missing or if there is a permission problem, would that cause the crawl to fail?

Also is there a problem specifying

https://site:8080/sites/support/lists/contacts/

versus

https://site:8080/sites/support/lists/contacts/allItems.aspx

Or will the former automatically cause the crawler to look for allItems.aspx?

Gravatar for psheridan@coveo.com

Comment by paul_sheridan, Apr 19, 2017 4:20 PM

I just tested this (in Coveo Cloud), and you do not need to specify allitems.aspx.

,

just checked….you do not need to use "allitems.aspx".

Gravatar for psheridan@coveo.com

Answer by paul_sheridan, Apr 19, 2017 6:41 PM

oh...I think maybe i just realized something...were you trying to use the Web Sites connector to index a Sharepoint List? I suspect that you were, because you talked about Exclusion Rules. You should probably be using the Sharepoint connector to do this.

Ask a question