Description of problem: When hovering on the results list, stderr is filled with text handling error messages. They seem to be harmless and not to impact functionality. Version-Release number of selected component (if applicable): 1.23.7 How reproducible: always Steps to Reproduce: 1. Type recoll in a terminal. 2. Enable "show results in a spreadsheet-like table" with the toolbar button at the right. 3. Type something in the search box and press enter, so that a few results are displayed. 4. Hover with the mouse on the results. Actual results: whenever the mouse enters a cell, a bunch of lines like this are printed on stderr: :2:common/textsplit.cpp:533::Textsplit: error occured while scanning UTF-8 string Expected results: no error messages. Additional info:
Hi and thanks for reporting this. This is probably caused by some non utf-8 data which somehow got into the index. This seems dependant on the data, and I can't reproduce it. Would it be possible for you to share one of the documents which is causing this ? At least please indicate the type of document. What is your locale value (output of the 'locale' command) ?
Created attachment 1483778 [details] reproducer
Hi, thanks for the input. Sorry, it is not always: If the search result path has an accented vowel from ISO Latin-1, - stderr has logs, - the URL in the results table is truncated at the special character, - the URL in the HTML preview is full. The locale is it_IT.UTF-8, it also happens with en_US.UTF-8. The files were copied from Windows XP (Italian). To rule out the content indexing: It does not happen if those documents are renamed to UTF-8 (keeping their Latin-1 contents). The context menu items are functional: preview/open, find similar docs, etc Attached example: - the archive contains a file named Facolt, followed by 0xE0 (Latin-1 for à), followed by .txt. - Please unpack, add directory to the index configuration (stemming language: got the same result with english and italian), update index - search for: biblioteca - The URL column ends with: Facolt - The URL in the HTML details ends with: Facolt%E0.txt
Thanks a lot for taking the time to qualify the problem, and making it easy for me to fix it. This will be coirrected in the next release. Meanwhile, the attached patch can be applied to Recoll 1.23.x or 1.24.x source for a fix. Note that there will still be some error messages, but at level 4 (debug), so they will not appear in the default configuration.
Created attachment 1483911 [details] Patch to fix the error messages about non UTF-8 URLs
recoll-1.23.7-8.fc29 has been submitted as an update to Fedora 29. https://bodhi.fedoraproject.org/updates/FEDORA-2018-dc579bde5d
recoll-1.23.7-8.fc28 has been submitted as an update to Fedora 28. https://bodhi.fedoraproject.org/updates/FEDORA-2018-597675803c
recoll-1.23.7-8.fc27 has been submitted as an update to Fedora 27. https://bodhi.fedoraproject.org/updates/FEDORA-2018-6baaf41137
recoll-1.23.7-8.fc27 has been pushed to the Fedora 27 testing repository. If problems still persist, please make note of it in this bug report. See https://fedoraproject.org/wiki/QA:Updates_Testing for instructions on how to install test updates. You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2018-6baaf41137
recoll-1.23.7-8.fc28 has been pushed to the Fedora 28 testing repository. If problems still persist, please make note of it in this bug report. See https://fedoraproject.org/wiki/QA:Updates_Testing for instructions on how to install test updates. You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2018-597675803c
recoll-1.23.7-8.fc29 has been pushed to the Fedora 29 testing repository. If problems still persist, please make note of it in this bug report. See https://fedoraproject.org/wiki/QA:Updates_Testing for instructions on how to install test updates. You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2018-dc579bde5d
recoll-1.23.7-8.fc29 has been pushed to the Fedora 29 stable repository. If problems still persist, please make note of it in this bug report.
recoll-1.23.7-8.fc27 has been pushed to the Fedora 27 stable repository. If problems still persist, please make note of it in this bug report.
recoll-1.23.7-8.fc28 has been pushed to the Fedora 28 stable repository. If problems still persist, please make note of it in this bug report.