Solr to 'crawl' QuickStart sites

Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-26-2012 01:01 PM
Hi,
I would like to index a QuickStart site using the Solr instance installed with Alfresco.
I know that Solr is already used by Alfresco to index Quickstart articles and documents but in our site some pages are created from an external source (a big xml file).
Parts of this xml file are integrated into pages based on a complex logic and so I think that the only solution is to index the full page.
Having a search engine like Solr already runnig, it could be useful to use it to index the site (of course with the support of a web crawler).
Do you see any problem with this approach?
In general, can the Alfresco Solr instance be used by external applications?
Thank you.
Kind Regards,
Marco
I would like to index a QuickStart site using the Solr instance installed with Alfresco.
I know that Solr is already used by Alfresco to index Quickstart articles and documents but in our site some pages are created from an external source (a big xml file).
Parts of this xml file are integrated into pages based on a complex logic and so I think that the only solution is to index the full page.
Having a search engine like Solr already runnig, it could be useful to use it to index the site (of course with the support of a web crawler).
Do you see any problem with this approach?
In general, can the Alfresco Solr instance be used by external applications?
Thank you.
Kind Regards,
Marco
Labels:
- Labels:
-
Archive
1 REPLY 1

Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-05-2012 01:08 PM
Hi,
I'm sorry for resend more or less the same question, but for a different project I have a similar problem to solve.
We need to index contents that is not stored inside Alfresco: external document stores, intranets, internet sites, corporate directories.
I know that this is possible with Solr, but I would like to be sure that the Solr instance used by Alfresco doesn't have limitations.
It would be important also to be able to execute a single search for contents in alfresco and external contents indexed in the same Solr.
Thank you for your help,
Marco
I'm sorry for resend more or less the same question, but for a different project I have a similar problem to solve.
We need to index contents that is not stored inside Alfresco: external document stores, intranets, internet sites, corporate directories.
I know that this is possible with Solr, but I would like to be sure that the Solr instance used by Alfresco doesn't have limitations.
It would be important also to be able to execute a single search for contents in alfresco and external contents indexed in the same Solr.
Thank you for your help,
Marco
