10-02-2013 07:44 AM
Hi,
Here's a use case for the search feature we're having troubles dealing with:
We have various files, say to store numerous norm documents. Their titles have the following format: "NF Z65-130.pdf".
Searching through titles using the FULLTEXT operator gives unsatisfying results since dashes (-) are stripped down from the query (see NXQLQueryBuilder.sanitizeFulltextInput) (1) ; so searching "Z65-130" (even with the double dashes) will eventually be translated as query documents that contain either "Z65" or "130", which will return a lot of results.
The more appropriate operator seems to be LIKE, except it requires the use of the % operator as the wildcard, which is not user-friendly (depending on the field, users would have to use either * or %).
Is there any approach that would allow queries to match user expectations when they simply type "Z60-130" or "*Z60-130*"?
Thanks for your help
(1) This seems to also occur when dc:title is removed from default-repository-config.xml, i.e. when NXQL queries fall back from FULLTEXT 'query' to LIKE '%query%'.
10-02-2013 08:20 AM
You're working with the assumption that fulltext search can search something like "Z65-130". You'll find out that this is not always the case, it heavily depends on the database, the database fulltext configuration, etc.
If what you're searching for is not a regular word in a known language, then fulltext search is not a good match for you unless you spend a lot of time tweaking the database fulltext parser and making sure all the Nuxeo layers know about this specific parsing, which would require code changes.
Fulltext is primarily designed for human languages.
10-02-2013 09:05 AM
Hi Florent,
Find what you came for
We want to make your experience in Hyland Connect as valuable as possible, so we put together some helpful links.