cancel
Showing results for 
Search instead for 
Did you mean: 

AAAR Extract data CMIS problem

iblanco
Confirmed Champ
Confirmed Champ
Hi everyone,

I have installed and configured AAAR as Francesco says to do it but I have a problem when extracting data.
I execute AAAR_Extract.sh and everything seems to go rigth, it connects to the audit REST API and starts processing, and after some minutes it starts getting documents and folders by CMIS.

But when it starts to get some CMIS folder data I get this message time after time:


2014/06/19 20:17:26 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:26 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:27 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:27 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:28 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:28 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:28 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/19 20:17:29 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.


The fact that it says "Retrieved n.0 results" and that the same message is repeated over and over again make me suspect that this is not normal and that it will never end. Am I wrong and this is a normal progress message ? It has already been running for 2 hours and nothing seems to have changed.

Any clue what could be wrong ? I rebuilt the Lucene index before loading data just in case, but this didn't change anything.

I think I have succesfully followed all Francesco's instructions and no error is present in catalina.out. The only thing "strange" I saw was that when I went in Pentaho admin console to "Tools"=>"Refresh"=>"CDA Cache" only a new empty white tab was opened, with no message or text. Don't now if it matters at all.
7 REPLIES 7

fcorti
Elite Collaborator
Elite Collaborator
Hi Igor,

Don't worry about waiting time while you have any sort of error: the extraction is working for you. 😉

The extraction process depends on the amount of documents and folders (and the network too) and could have a relevant duration in some cases.
All of this to organize the data for the speed of the analysis (for that reason the extraction is usually scheduled during the night time).

By the way, thanking the community contributions, I have discovered that the CMIS Input plugin could be faster.
In the incoming release (v2.1) this is improved together with some new interesting features. 😉
If you want, you can test the v2.1 version using the installation described in the attached image but pay attention… it's a beta version that I have to test for some time more.

Let us know how it goes…

iblanco
Confirmed Champ
Confirmed Champ
Hi Francesco,

I have less than 5000 non deleted document nodes, and less thant 3000 folders.

I'm afroid there must be an issue or a bug with my Alfresco or AAAR installation because the initial extraction has already been running for over 16 hours and it keeps repeating:

2014/06/20 10:58:51 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:52 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:52 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:52 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:52 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:53 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.
2014/06/20 10:58:53 - Cmis Input modified folder.0 - Cmis Input - Retrieved n.0 results from item n.1000 on a total of n.1000 results.


But I have checked the AAAR version I'm running and I am already running 2.1 Beta, this is the one tha the marketplace installed by default. May it be a bug with 2.1?

fcorti
Elite Collaborator
Elite Collaborator
Ok, 16 hours is definitely too much!
The suggestion is to start from the AAAR v2.0.

iblanco
Confirmed Champ
Confirmed Champ
There seems to be a bug with the marketplace or the package itself, as much as I try to install 2.0 it always downloads 2.1
from http://sourceforge.net/projects/aaar/files/v2.1/AAAR_v2.1.zip/download

The first time I execute Pentaho BI it says something about version checker, maybe I should disable this version checker so it allows the installation of older versions.

Or maybe I should just try the manual installation procedure.

fcorti
Elite Collaborator
Elite Collaborator
Mmm… so strange!
I have checked the marketplace config and everything it's ok.
To install AAAR manually you have simply to download the AAAR_v2.0.zip package from sourceforge and unzip it in the system folder of pentaho (you will find easily a AAAR folder to replace).
For more support, write me privately. 😉

I installed 2.0 by hand and now after around 24 minutes the extracion ended and everything was ok. It seems like there was some kind of problems with beta version 2.1.

fcorti
Elite Collaborator
Elite Collaborator
Great!
Now we will see what it's wrong with the beta version. 😉
Thank you for the feedback.