Hi. I built an xml file to import html files into ojs 3.1.2.1. It works fine for 99% of the html files (all from the same website) but there are a couple that when pulled get corrupted.
The file contents looks like this: ‹ í½a`I–
I checked the physical file as well not just the file via the web browser and it’s corrupted.
Everything else about the record looks fine (abstract, author data etc). Any Idea what this could be?
Here is the xml:
<issues xmlns="http://pkp.sfu.ca" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://pkp.sfu.ca native.xsd">
<issue published="1">
<issue_identification>
<number>89</number><year>2018</year><title>Number 89: Spring 2018</title></issue_identification>
<date_published>2020-02-18</date_published>
<sections> <section ref="ART"> <abbrev>ART</abbrev> <policy/> <title>Articles</title> </section>
</sections>
<articles>
<article section_ref="ART" stage="production" date_published="2020-02-18" seq="1" language="">
<id type="doi" advice="update">10.5062/F4668BF1</id>
<title locale="en_US">Citation Analysis of Ph.D. Theses at Faculty of Science, University of Ibadan, Nigeria</title>
<abstract>The authors analyzed 21,005 ..... </abstract>
<subjects><subject>Citation analysis</subject></subjects>
<authors>
<author user_group_ref="Author" include_in_browse="true" primary_contact="true">
<givenname>Malik </givenname><familyname></familyname><email>removedForPrivacy@gmail.com</email>
</author>
<author user_group_ref="Author" include_in_browse="true">
<givenname>Wole </givenname><familyname></familyname><email>removedForPrivacy@yahoo.co.uk</email>
</author>
</authors>
<submission_file id="1" stage="proof">
<revision genre="Article Text" number="1" filetype="text/html" filename="refereed3.html">
<name>refereed3.html</name>
<href src="http://www.istl.org/18-spring/refereed3.html" mime_type="text/html"/>
</revision>
</submission_file>
<article_galley>
<name>HTML</name>
<seq>1</seq>
<submission_file_ref id="1" revision="1"/>
</article_galley>
<pages></pages>
</article>
</articles>
</issue>
</issues>