Greetings to all OJS users.
Lens Viewer is great application for displaying scientific articles for JATS XML.
For our journal I have developed DOCX to JATS converter, which make the transformation process more convenient. The link to the project on Github: https://github.com/Vitaliy-1/DOCX2JATS
Because DOCX OOXML format is not very structured, which is not the case for JATS XML, input articles must be highly structured. Also, because OOXML is not contain any metadata, latter must be entered manually.
My project includes TEIC stylesheets for the hardwork and Java for more subtle parsing of references (for now only in AMA, Vancouver citations style) in-text references, table and figure labels, titles and captions. The examples of
how articles must be formatted in DOCX are situated in the root directory of the project (article1.docx, article2.docx). For good results articles MUST have the same format:
1st level and 2nd level titles for sections and subsections;
separate reference section for reference list; references in AMA or Vancouver style. Journal articles, books, chapter and conference are supported;
lists must be lists in docx (ordered and unordered lists are supported);
bold and italic text is supported;
for in-text references square brackets must be used;
in-text references for tables and figures are parsed if they mark as "tabl 1.", "table 1", "fig. 1", "figure" or cyrillic analogs;
Table label and title are need to be situated above the tables and figures as:
Table 1. Boring table title.
- Table and figure description need to be situated under the table and start with symbol
- UTF-8 encoding support
To run the program java 8 must be installed. Link to archive: https://github.com/Vitaliy-1/DOCX2JATS/releases
Archive contains 1.jar file and stylesheets folder, which need to be unziped into one directory. Because I am not good programmer, there is a need to place article in docx format in this folder before making transformation. Suppose archive is unzipped on the drive C in the jats folder. Input article article1.docx is also situated there. From windows cmd user need to go to this folder and enter:
java -jar 1.jar C:\jats\article1.docx article1.xml
Converter does not parse metadata and formulas. Also tables may need some correction. If article in docx is accordingly formatted the full process of manual correction takes about 30 minutes (in our case). Maybe someone also finds this converter usefull.
It is need to be pointed that we use the last version of Lens Viewer and parser converts articles according to it`s JATS XML support.