Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

arabic results are coming back VERY slowly #19

Open
reynoldsnlp opened this issue Apr 25, 2019 · 4 comments
Open

arabic results are coming back VERY slowly #19

reynoldsnlp opened this issue Apr 25, 2019 · 4 comments
Assignees

Comments

@reynoldsnlp
Copy link
Owner

Searched for الحور الرجراج for 40 results and they were being processed very slowly. Starting about 25/04/2019 16:19:41 in catalina.out.

@reynoldsnlp
Copy link
Owner Author

mvn clean install and restarting tomcat fixed it. Still don't know the source of the problem.

We still had lots of free memory.

@mjbriggs
Copy link
Collaborator

What I suspect to be the problem is websites that are pdf's. The tika processor doesn't work with them at the moment so a bunch of valid search results are tossed because we can't process them. This causes a significant slowdown since we load and try to process the webpage.

@mjbriggs
Copy link
Collaborator

We may have introduced some memory leaks as well

@mjbriggs
Copy link
Collaborator

I have removed pdfs from being acceptable search results. I have not seen this issue pop up in a while so I believe that solved it, but I have not closed this issue since I did not know what very slowly meant. Additionally, it is difficult to tell from the front end whether the server is taking a long time or if the server ran out of heap space.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants