Oxford University Press needs a data engineer. The job description states:
“To design and implement XML data structures, devise XML strategies, and through expert content analysis, facilitate the electronic and print production, publication, and licensing of OUP’s books, journals and dictionaries content.
• Analyze OUP and licensed texts, and the requirements for their production, online functionality and presentation, and general electronic use, to:
• Determine appropriate data structures to support those requirements (content and metadata),
• Devise quality control rules and processes,
• Design tools and workflows required by staff in Editorial and Production to produce, capture, and enhance the data.
• Write transformation scripts to convert source data into OUP standard data structures that meets the agreed data quality standards for publishing and licensing.”
Requirements for the position include:
“Educated to degree level, or possess equivalent relevant experience
• Content analysis experience (i.e. ability to analyze diverse content and its underlying structure)
• Understanding of the structure of academic and professional content and ability to translate this into XML data models
• Experience of XML and XML tools including ability to devise and enhance data models (DTD and/or XML Schema)
• Excellent communication skills, with the ability to communicate equally with technical and non-technical people
• Experience of working with, writing instructions for, and quality assuring the deliverables from, data suppliers
• Practical knowledge of XSLT, XPath, Unicode, and related technologies (in particular XQuery, Schematron, RDF, DITA)
• Some programming experience (e.g. Perl), and an understanding of the software lifecycle”.