We have a site where we are posting collections of articles, not new original publications of content. Many sites that do this just include reference information and access (URL) links if available (arXiv, INSPIRE, …), but we want users to easily be able to access and work with the actual content of the papers.
However for that we don’t have nor desire to get permission to repost the original source content from all articles as that would be difficult and expensive, so instead similar to these other sites we are posting the metadata for the content of each as a galley, and users can then choose that and get a URL for the original content.
This then makes access almost as convenient as if we were actually posting it but fits within legal copyright limits.
However, that means that searching on the site is then only for the actual online content, which is the title, authors, summary/abstract, and keywords (are they searched?), and does not include the actual article content – a major restriction.
I am wondering how this is all handled in OJS. When one would include a galley of the original content, I am wondering if all such content is kept in a separate repository from the article descriptions and then accessed from the actual article if requested. If so, that would mean that indexing and searching would be over that single collected central repository, which is defined somewhere in the code or settings.
In that case, would it be possible for us to retarget the search and indexing code to instead use our own local repository, which is not published due to the above restrictions, but can contain local copies of the materials.
If this was possible, to coerce OJS searching mechanisms to use our local repository, it would achieve the main goal of content searching but be easier than us having to create our own local index/search (Lucene’ish) searching system over our repository for which we would add a new button on the main page (or subsume the current search button).
Any/all inputs and comments welcome.
TIA