I am using OJS 3.1.2.1 . All articles and issues incl. the PDFs in this journal have been generated by a custom XML-generator and imported.
In the config.inc.php
, the line
index[application/pdf] = "/usr/bin/pdftotext -enc UTF-8 -nopgbrk %s - | /usr/bin/tr '[:cntrl:]' ' '"
is uncommented and I cleared the cache/_db/
multiple times.
When I run php tools/rebuildSearchIndex.php
, the scripts runs without errors, but words from the PDF files are not included (I checked the database table submission_search_keyword_list
) and tested it on the website.
Then, I checked the submission files and all files that where originally .pdf
files are now .txt
files in the submission folder. Changing the index line in the config.inc.php
to
index[plain/text] = "/usr/bin/pdftotext -enc UTF-8 -nopgbrk %s - | /usr/bin/tr '[:cntrl:]' ' '"
had no effect.
Subsequently, I checked the file_type
column in the database table submission_files
. This says for all submissions application/pdf
(which is correct).
Now, I am running out of ideas. Where is the type defined to process the submission files? Or what can I do?
Any help is appreciated!
Cheers,
Adrian