Index pipe delimited files in Coveo
Hi, Is there some documentation out there about using Coveo to index pipe or comma delimited files? I'm trying to figure out the best approach for mapping the values in the input file to fields in the index? Will I need to define a custom document type to do that? Any other steps to take into account?
Any guidance appreciated!
Do your file contains more than one row of data (multiple indexed documents per file) or one file should map to only one indexed document?
If it contains only a single document, I would create and use a post-converison script to extract the values and assign them to CES fields.
If it contains data that should be indexed in multiple indexed documents, I think you would have to code your own connector for your file format.
As Jeff mentioned in his last comment, the Database connector is probably the easiest way to go. All you need is an ODBC or OLE DB driver of the type you need.
Microsoft already installs some of ODBC Drivers by default. I will suggest you to use Microsoft Text Driver. You will need to have a connection string and a configuration file to use in the CES Database source.
For example, to crawl a CSV file with the Database connector using Microsoft Text ODBC Driver to have each row as a different document:
- I would use a connection string as shown here https://www.connectionstrings.com/microsoft-text-odbc-driver/
- My configuration file will have a query similar to this one: "SELECT * FROM [MyDatabaseFile.csv]"