Difference between revisions of "User:Dmsmith/KJV 2.6"
David Haslam (talk | contribs) (→See also: == OSIS rdg element ==) |
David Haslam (talk | contribs) (→OSIS rdg element: Many of the study notes record more literal renderings of the Hebrew or Greek text. We might wish to wrap all such readings within the OSIS '''rdg''' element. e.g. <note type=") |
||
Line 234: | Line 234: | ||
== OSIS rdg element == | == OSIS rdg element == | ||
+ | Many of the study notes record more literal renderings of the Hebrew or Greek text. We might wish to wrap all such readings within the OSIS '''rdg''' element. e.g. | ||
+ | <note type="study">And the evening…: Heb. And the evening was, and the morning was etc.</note> | ||
+ | would become | ||
+ | <note type="study"><catchWord>And the evening…:</catchWord> Heb. <rdg>And the evening was, and the morning was</rdg> etc.</note> | ||
+ | [[User:David Haslam|David Haslam]] | ||
== See also == | == See also == |
Revision as of 16:03, 14 January 2016
This page is for recommended changes to the KJV module version 2.6 (or later).
Contents
- 1 Punctuation
- 2 Words of Christ
- 3 Added words
- 4 Tagging the Divine Name
- 5 Cross references?
- 6 Reference text policy
- 7 Hebrew words
- 8 Notes appertaining to Psalm titles
- 9 Psalm 119 Acrostic Stanza Titles
- 10 Multiple whitespace
- 11 Missing punctuation in notes
- 12 Add language identifier for foreign element
- 13 Hyphens
- 14 Selah
- 15 Punctuation and Strongs
- 16 OSIS catchWord element
- 17 OSIS rdg element
- 18 See also
Punctuation
1 Cor 15:27 The comma at "him, it is" should not be italicized. There should be a comma between "excepted which". --Dmsmith 11:19, 16 February 2014 (MST)
- Done --Dmsmith 17:54, 18 February 2014 (MST)
Words of Christ
The Old Scofield only highlights Words of Christ (WoC) as they come directly from his mouth. Not what others say he said. Not translation of what he said, such as translation from Aramaic. In 2.6, there are 3 error in markup (maybe more, but these are known). Red is what should be WoC, black is currently red, but shouldn't be:
Mat 8:26 And he saith unto them, Why are ye fearful, O ye of little faith? Then he arose, and rebuked the winds and the sea; and there was a great calm.
Mat 19:18 He saith unto him, Which? Jesus said, Thou shalt do no murder, Thou shalt not commit adultery, Thou shalt not steal, Thou shalt not bear false witness,
Act 1:4 And, being assembled together with them, commanded them that they should not depart from Jerusalem, but wait for the promise of the Father, which, saith he, ye have heard of me.
- Done--Dmsmith 06:56, 19 February 2014 (MST)
Added words
Added words & punctuation
Split all transChange elements that contain punctuation marks, so that the punctuation and following space is normal text. 1 Cor 15:27 is one example of many such occurrences.
- Found and fixed:
- 67 ,
- 10 ;
- 6 :
- 1 ?
--Dmsmith 18:38, 18 February 2014 (MST)
Added words & Strong's
Review and correct each instance in which a w element for Strong's & Morph is found within a transChange element. The markup probably belongs to the preceding word. The following are the instances and word(s) that precede that are not contained by a <w> element.
Gen 14.10 <transChange type="added"><w lemma="strong:H0875">was full of</w></transChange> Exod 15.12 <transChange type="added"><w lemma="strong:H02098">which</w></transChange> Exod 15.16 <transChange type="added"><w lemma="strong:H02098">which</w></transChange> Exod 34.19 <transChange type="added"><w lemma="strong:H02142" morph="strongMorph:TH8735">that is male</w></transChange> Num 1.16 These <transChange type="added"><w lemma="strong:H07148">were</w></transChange> Num 3.19 These <transChange type="added"><w lemma="strong:H01992">are</w></transChange> Num 10.28 Thus <transChange type="added"><w lemma="strong:H0428">were</w></transChange> Num 13.3 <transChange type="added"><w lemma="strong:H01992">were</w></transChange> Num 14.28 unto them, <transChange type="added"><w lemma="strong:H03808">As truly as</w></transChange> Num 20.13 This <transChange type="added"><w lemma="strong:H01992">is</w></transChange> 1Sam 30.27 To <transChange type="added"><w lemma="strong:H0834">them</w></transChange> 2Kgs 19.31 <transChange type="added"><w lemma="strong:H06635" morph="strongMorph:TH8675">of hosts</w></transChange> 2Chr.10.16 <transChange type="added"><w lemma="strong:H07200" morph="strongMorph:TH8804">saw</w></transChange> Ezra 2.65 and <transChange type="added"><w lemma="strong:H0428">there were</w></transChange> Ps.17.6 unto me <transChange type="added"><w lemma="strong:H08085" morph="strongMorph:TH8798">and hear</w></transChange> Ps 39.3 <transChange type="added"><w lemma="strong:H0227">then</w></transChange> Jer 6.14 <transChange type="added"><w lemma="strong:H01323" morph="strongMorph:TH8676">of the daughter</w></transChange> Jer 28.9 <transChange type="added"><w lemma="strong:H0227">then</w></transChange> Jer.51.53 <transChange type="added"><w lemma="strong:H0227">yet</w></transChange>
- I'll need some help determining how these should be changed, if at all. It may be that the KJV uses italics for a purpose other than "added" words. --Dmsmith 19:33, 18 February 2014 (MST)
- Italics is presentational formating. The transChange element is semantic. David Haslam 05:19, 19 February 2014 (MST)
- The 1611 and 1769 editions of the KJV didn't have an eText with semantic markup. Any semantic markup we have deduces the intention of the authors/printers from the orthographic representation. --Dmsmith 07:00, 19 February 2014 (MST)
- We submitted this list to David Instone-Brewer and received a detailed reply. David Haslam 06:37, 17 September 2015 (MDT)
- The 1611 and 1769 editions of the KJV didn't have an eText with semantic markup. Any semantic markup we have deduces the intention of the authors/printers from the orthographic representation. --Dmsmith 07:00, 19 February 2014 (MST)
- Italics is presentational formating. The transChange element is semantic. David Haslam 05:19, 19 February 2014 (MST)
Added words & the Divine Name
We found one instance of the Divine Name element within a transChange element. This is probably inappropriate.
- The one example is in 2 Chronicles 17:4. It is rendered in italics and small caps. So accordingly it is an added word representing the tetragrammaton. It is represented properly in OSIS. --Dmsmith 17:49, 18 February 2014 (MST)
Tagging the Divine Name
This much more complicated than we thought. Observations:
- The divine name is also tagged within some study notes (even twice within the same note a few times).
- More than one Strong's number is involved.
- Five instances are in the NT where the Greek word κυριος is tagged.
- There is one instance where two Strong's numbers are joined to the divine name.
- In many places, there are some English words between the Strong's tag and the divine name tag.
- There are places where the divine name is tagged, even though it is within a transChange element (see previous subsection).
- The three hyphenated forms of the divine name (Jehovah–jireh, Jehovah–nissi, Jehovah–shalom) are not tagged in the main text, only in the study notes.
- The other two hyphenated forms of the divine name (Jehovah–shammah, Jehovah–tsidkenu) occur only in the study notes, where the English word (Lord) is tagged.
- Divine Name tagging in the KJV follows the "small caps" orthographic representation of Lord, God, Yah.
As such, it is found in added words and notes not being associated with Strong's Numbers. - The Strong's Numbers tagged are H3068, H3069, H3072 and H3050. The first is the tetragrammaton. The second and third are variations of it. The last is Yah.
- In the NT, the orthographic representation of Lord as the divine name are backed by Greek, not Hebrew.
- The instance of two Strong's Numbers being associated with divine name needs to be reviewed. The leading word is "face", often translated "before" or "presence".
- Jer.26.19 <w morph="strongMorph:TH8762" lemma="strong:H02470">and besought</w> <w lemma="strong:H06440 strong:H03068">the <divineName>Lord</divineName></w>
- Here the leading word is left untranslated.
Cross references?
Sadly lacking from our KJV module are any scripture cross-references. Many printed editions of the AV contain such references. We should explore how the module might be enhanced by obtaining the data from a suitable electronic source.
[1] is of interest in this context, but see the foot of [2] which describes the sources of the data.
The 1769 edition of the KJV included Benjamin Blayney's cross-references. Many of the OT references therein were to the Deuterocanonical books. See also [3].
My late mother's 1936 Collins edition of the KJV has centre margins with notes and cross-references. Of particular interest is that the cross-references tags are italicised lowercase superscript letters and the note tags are superscripted integers. These are positioned at the start of each word being referenced. This practice differs from how many of our modules are marked up, where the tag is often placed at the end of the word being referenced. David Haslam
One cross-reference already
The note within II Samuel 23:8 already contains a cross-reference! It should be converted to a proper OSIS xref, and the conf file should be updated to include GlobalOptionsFilter=OSISScripRef.
<note type="study">1ch 11:11 he lift…: from whom he…: Heb. slain</note>
David Haslam 08:42, 14 January 2016 (MST)
Reference text policy
It may be sensible to review whether we chose the most suitable published text as our reference standard. The most widely accepted one is the Cambridge University Press - Concord Reference Bible.
- Need two things: an e-text and permission for the text. (I think the "crown" claims copyright.) --Dmsmith 19:36, 18 February 2014 (MST)
- Crown Copyright applies to the Authorised Version per se, not just to those printed by CUP, who are merely one of the licensed printers for all the works that come under Royal Letters Patent. Refer to our Copyright page David Haslam 05:08, 19 February 2014 (MST)
Hebrew words
Following the addition of Greek words in the NT from the TR in version 2.6, is it planned to do likewise for Hebrew (& Aramaic) words in the OT from the MT ?
- It would be wonderful. However, the tagging of Strong's numbers provided a map to the TR in the src="x y" attribute, where that gave the position of the word in the TR. So, the addition of the Greek was trivial. We have nothing like that for the MT. It won't be trivial. Also, the Strong's tagging in the OT is not comprehensive. In any given verse only some of the words from the MT are tagged. In the NT all the TR words were present in the tagging, even if empty (i.e. untranslated.)
- We are more likely to update the morphology of the OT first for those that have some kind of morphology today.
- --Dmsmith 09:31, 25 February 2014 (MST)
Notes appertaining to Psalm titles
Study notes appertaining to text within a Psalm title are currently placed at the end of verse 1, just like any other note. To prevent these notes being orphaned when headings are hidden, it is proposed to move these notes to within the title element. As some Psalms also have one or more note appertaining to text within verse 1, this change will require careful manual editing, rather than automating by a script.
Psalm 119 Acrostic Stanza Titles
The 22 Hebrew letter acrostic titles in Psalm 119 should be displayed before the first verse of each eight-verse stanza. Currently, the next verse tag is displayed before each stanza title. This is incorrect when compared to the KJV printed edition. The mod2imp output for the first such title is:
$$$Psalms 119:1 <title canonical="true" type="acrostic"><foreign n="א">ALEPH.</foreign></title> <w lemma="strong:H0835">Blessed</w> <transChange type="added">are</transChange> <w lemma="strong:H08549">the undefiled</w> <w lemma="strong:H01870">in the way</w>, <w lemma="strong:H01980" morph="strongMorph:TH8802">who walk</w> <w lemma="strong:H08451">in the law</w> <w lemma="strong:H03068">of the <divineName>Lord</divineName></w>. <note type="study">undefiled: or, perfect, or, sincere</note>
Do we need to wrap each stanza title between suitably constructed milestone preverse div elements?
- The preverse div should never be constructed in xml. It is created by osis2mod.
- Done in version 2.8----Dmsmith 05:33, 20 December 2015 (MST)
- Did you just move the titles to before the stanza? David Haslam 05:43, 20 December 2015 (MST)
- So far, yes. I'm testing it. I may have to put it in a section div or change osis2mod. If a div, that would be the first instance of a section div, which may add vertical whitespace that is not present elsewhere in the KJV.--Dmsmith 05:46, 20 December 2015 (MST)
- Did you just move the titles to before the stanza? David Haslam 05:43, 20 December 2015 (MST)
Multiple whitespace
Within the text source for 2.7 (kjvfull.xml) there are 39 instances of double spaces (outside the header):
- 36 are immediately after the "w" in a w element
- 2 are after a closing " but within a w element
- 1 is between two w elements; in Phil.4.2, and is displayed by SWORD as
that they be of the same mind
The latter should be corrected.
- Done in version 2.8----Dmsmith 05:31, 20 December 2015 (MST)
Missing punctuation in notes
- There are 3 study notes contain the abbreviation "Heb" with no full-stop after the abbreviation. The locations are 2Chr.2.16, Isa.9.20, Jer.13.21
- Done in version 2.8----Dmsmith 05:36, 20 December 2015 (MST)
Add language identifier for foreign element
Suggest add the following attribute to the foreign element in each acrostic title:
xml:lang="hbo"
Refer to OSIS Reference Manual.
- Done in version 2.8--Dmsmith 06:18, 20 December 2015 (MST)
Should also identify and add other foreign elements.--Dmsmith 06:18, 20 December 2015 (MST)
- MENE, MENE, TEKEL, UPHARSIN (what language code?) David Haslam 10:00, 20 December 2015 (MST)
Hyphens
As discussed before under Hyphenation, only five words in the NT use a hyphen/minus, eleven occurrences in total for the whole Bible. Seeing as the text of the KJV module already requires a font that includes the en dash (U+2013), and thus is not restricted to ASCII, I see no reason why we shouldn't replace these hyphen/minus by the proper Unicode character for hyphen, U+2010. The five words are:
3 God-ward 1 joint-heirs 1 thee-ward 3 us-ward 3 you-ward
Selah
There are 75 instances of the whole word "Selah" in the KJV. The first is in II Kings 14:7. The rest are found in Psalms (71) and Habbakuk (3). Of those in Psalms, these 13 locations have the peculiarity in that the Strongs markup includes other words besides Selah.
<w lemma="strong:H05542">thereof. Selah</w>. <w lemma="strong:H05542">me. Selah</w>. <w lemma="strong:H05542">himself. Selah</w>. <w lemma="strong:H05542">before them. Selah</w>. <w lemma="strong:H05542">for us. Selah</w>. <w lemma="strong:H05542">themselves. Selah</w>. <w lemma="strong:H05542">upon us; Selah</w>. <w lemma="strong:H05542">of it. Selah</w>. <w lemma="strong:H05542">thee. Selah</w>. <w lemma="strong:H05542">there. Selah</w>. <w lemma="strong:H05542">thee? Selah</w>. <w lemma="strong:H05542">for me. Selah</w>. <w lemma="strong:H05542">themselves. Selah</w>.
The full XML context for these is:
<verse osisID="Ps.46.3" sID="Ps.46.3"/><transChange type="added">Though</transChange> <w lemma="strong:H04325">the waters</w> <w morph="strongMorph:TH8799" lemma="strong:H01993">thereof roar</w> <transChange type="added">and</transChange> <w morph="strongMorph:TH8799" lemma="strong:H02560">be troubled</w>, <transChange type="added">though</transChange> <w lemma="strong:H02022">the mountains</w> <w morph="strongMorph:TH8799" lemma="strong:H07493">shake</w> <w lemma="strong:H01346">with the swelling</w> <w lemma="strong:H05542">thereof. Selah</w>.<verse eID="Ps.46.3"/> <verse osisID="Ps.49.15" sID="Ps.49.15"/><milestone type="x-extra-p"/><w lemma="strong:H0430">But God</w> <w morph="strongMorph:TH8799" lemma="strong:H06299">will redeem</w> <w lemma="strong:H05315">my soul</w> <w lemma="strong:H03027">from the power</w> <w lemma="strong:H07585">of the grave</w>: <w morph="strongMorph:TH8799" lemma="strong:H03947">for he shall receive</w> <w lemma="strong:H05542">me. Selah</w>.<note type="study">power: Heb. hand</note><note type="study">the grave: or, hell</note><verse eID="Ps.49.15"/> <verse osisID="Ps.50.6" sID="Ps.50.6"/><w lemma="strong:H08064">And the heavens</w> <w morph="strongMorph:TH8686" lemma="strong:H05046">shall declare</w> <w lemma="strong:H06664">his righteousness</w>: <w lemma="strong:H0430">for God</w> <transChange type="added">is</transChange> <w morph="strongMorph:TH8802" lemma="strong:H08199">judge</w> <w lemma="strong:H05542">himself. Selah</w>.<verse eID="Ps.50.6"/> <verse osisID="Ps.54.3" sID="Ps.54.3"/><w morph="strongMorph:TH8801" lemma="strong:H02114">For strangers</w> <w morph="strongMorph:TH8804" lemma="strong:H06965">are risen up</w> <w lemma="strong:H06184">against me, and oppressors</w> <w morph="strongMorph:TH8765" lemma="strong:H01245">seek</w> <w lemma="strong:H05315">after my soul</w>: <w morph="strongMorph:TH8804" lemma="strong:H07760">they have not set</w> <w lemma="strong:H0430">God</w> <w lemma="strong:H05542">before them. Selah</w>.<verse eID="Ps.54.3"/> <verse osisID="Ps.62.8" sID="Ps.62.8"/><milestone type="x-extra-p"/><w morph="strongMorph:TH8798" lemma="strong:H0982">Trust</w> <w lemma="strong:H06256">in him at all times</w>; <transChange type="added">ye</transChange> <w lemma="strong:H05971">people</w>, <w morph="strongMorph:TH8798" lemma="strong:H08210">pour out</w> <w lemma="strong:H03824">your heart</w> <w lemma="strong:H06440">before</w> <w lemma="strong:H0430">him: God</w> <transChange type="added">is</transChange> <w lemma="strong:H04268">a refuge</w> <w lemma="strong:H05542">for us. Selah</w>.<verse eID="Ps.62.8"/> <verse osisID="Ps.66.7" sID="Ps.66.7"/><w morph="strongMorph:TH8802" lemma="strong:H04910">He ruleth</w> <w lemma="strong:H01369">by his power</w> <w lemma="strong:H05769">for ever</w>; <w lemma="strong:H05869">his eyes</w> <w morph="strongMorph:TH8799" lemma="strong:H06822">behold</w> <w lemma="strong:H01471">the nations</w>: <w morph="strongMorph:TH8802" lemma="strong:H05637">let not the rebellious</w> <w morph="strongMorph:TH8686 strongMorph:TH8675 strongMorph:TH8799" lemma="strong:H07311 strong:H07311">exalt</w> <w lemma="strong:H05542">themselves. Selah</w>.<verse eID="Ps.66.7"/> <title type="psalm" canonical="true"><w morph="strongMorph:TH8764" lemma="strong:H05329">To the chief Musician</w> <w lemma="strong:H05058">on Neginoth</w>, <w lemma="strong:H04210">A Psalm</w> <transChange type="added">or</transChange> <w lemma="strong:H07892">Song</w>.</title><verse osisID="Ps.67.1" sID="Ps.67.1"/><w lemma="strong:H0430">God</w> <w morph="strongMorph:TH8799" lemma="strong:H02603">be merciful</w> <w morph="strongMorph:TH8762" lemma="strong:H01288">unto us, and bless</w> us; <transChange type="added">and</transChange> <w lemma="strong:H06440">cause his face</w> <w morph="strongMorph:TH8686" lemma="strong:H0215">to shine</w> <w lemma="strong:H05542">upon us; Selah</w>.<note type="study">chief…: or, overseer</note><note type="study">upon: Heb. with</note><verse eID="Ps.67.1"/> <verse osisID="Ps.75.3" sID="Ps.75.3"/><w lemma="strong:H0776">The earth</w> <w morph="strongMorph:TH8802" lemma="strong:H03427">and all the inhabitants</w> <w morph="strongMorph:TH8737" lemma="strong:H04127">thereof are dissolved</w>: <w morph="strongMorph:TH8765" lemma="strong:H08505">I bear up</w> <w lemma="strong:H05982">the pillars</w> <w lemma="strong:H05542">of it. Selah</w>.<verse eID="Ps.75.3"/> <verse osisID="Ps.84.4" sID="Ps.84.4"/><w lemma="strong:H0835">Blessed</w> <transChange type="added">are</transChange> <w morph="strongMorph:TH8802" lemma="strong:H03427">they that dwell</w> <w lemma="strong:H01004">in thy house</w>: <w morph="strongMorph:TH8762" lemma="strong:H01984">they will be still praising</w> <w lemma="strong:H05542">thee. Selah</w>.<verse eID="Ps.84.4"/> <verse osisID="Ps.87.6" sID="Ps.87.6"/><w lemma="strong:H03068">The <divineName>Lord</divineName></w> <w morph="strongMorph:TH8799" lemma="strong:H05608">shall count</w>, <w morph="strongMorph:TH8800" lemma="strong:H03789">when he writeth up</w> <w lemma="strong:H05971">the people</w>, <transChange type="added">that</transChange> this <transChange type="added">man</transChange> <w morph="strongMorph:TH8795" lemma="strong:H03205">was born</w> <w lemma="strong:H05542">there. Selah</w>.<verse eID="Ps.87.6"/> <verse osisID="Ps.88.10" sID="Ps.88.10"/><milestone type="x-extra-p"/><w morph="strongMorph:TH8799" lemma="strong:H06213">Wilt thou shew</w> <w lemma="strong:H06382">wonders</w> <w morph="strongMorph:TH8801" lemma="strong:H04191">to the dead</w>? <w lemma="strong:H07496">shall the dead</w> <w morph="strongMorph:TH8799" lemma="strong:H06965">arise</w> <transChange type="added">and</transChange> <w morph="strongMorph:TH8686" lemma="strong:H03034">praise</w> <w lemma="strong:H05542">thee? Selah</w>.<verse eID="Ps.88.10"/> <verse osisID="Ps.140.5" sID="Ps.140.5"/><w lemma="strong:H01343">The proud</w> <w morph="strongMorph:TH8804" lemma="strong:H02934">have hid</w> <w lemma="strong:H06341">a snare</w> <w lemma="strong:H02256">for me, and cords</w>; <w morph="strongMorph:TH8804" lemma="strong:H06566">they have spread</w> <w lemma="strong:H07568">a net</w> <w lemma="strong:H04570 strong:H03027">by the wayside</w>; <w morph="strongMorph:TH8804" lemma="strong:H07896">they have set</w> <w lemma="strong:H04170">gins</w> <w lemma="strong:H05542">for me. Selah</w>.<verse eID="Ps.140.5"/> <verse osisID="Ps.140.8" sID="Ps.140.8"/><milestone type="x-extra-p"/><w morph="strongMorph:TH8799" lemma="strong:H05414">Grant</w> <w lemma="strong:H03068">not, O <divineName>Lord</divineName></w>, <w lemma="strong:H03970">the desires</w> <w lemma="strong:H07563">of the wicked</w>: <w morph="strongMorph:TH8686" lemma="strong:H06329">further</w> <w lemma="strong:H02162">not his wicked device</w>; <transChange type="added">lest</transChange> <w morph="strongMorph:TH8799" lemma="strong:H07311">they exalt</w> <w lemma="strong:H05542">themselves. Selah</w>.<note type="study">lest…: or, let them not be exalted</note><verse eID="Ps.140.8"/>
These words & punctuation do not belong properly to the "Selah" but are part of the preceding sentence or phrase. It may therefore be sensible to convert them like this:
thereof. <w lemma="strong:H05542">Selah</w>. me. <w lemma="strong:H05542">Selah</w>. himself. <w lemma="strong:H05542">Selah</w>. before them. <w lemma="strong:H05542">Selah</w>. for us. <w lemma="strong:H05542">Selah</w>. themselves. <w lemma="strong:H05542">Selah</w>. upon us; <w lemma="strong:H05542">Selah</w>. of it. <w lemma="strong:H05542">Selah</w>. thee. <w lemma="strong:H05542">Selah</w>. there. <w lemma="strong:H05542">Selah</w>. thee? <w lemma="strong:H05542">Selah</w>. for me. <w lemma="strong:H05542">Selah</w>. themselves. <w lemma="strong:H05542">Selah</w>.
This issue was already communicated by email on 2015-09-09. David Haslam
- Although OSIS defines an attribute value type="selah", this only applies to the poetry line element l, none of which are used in the KJV.
Punctuation and Strongs
A much more general issue was also reported. Namely, tagged w elements that span beyond the end of a sentence or phrase. Many of these can be identified by the fact that the spanned text includes at least one terminating punctuation mark [.,;:!?)]. Some of these even contain two or more such punctuation marks, so devising a regexp is a bit fraught. Moreover, for some of those that have a comma, it may be perfectly valid to include the preceding word[s]. Less likely for the other punctuation marks.
Searching for different regexps such as [>.+\?.+</w>] I counted the following:
Count Punctuation mark 219 Full-stop 7646 Comma (of which 444 have two or more commas) 1215 Colon 1064 Semicolon 22 Exclamation mark 254 Question mark 11 Right parenthesis 13 Left parenthesis (all these also contain another pm)
It's often the case that the English word that matches the Strong's tag is the last word before the </w>. Even so, I have not proven that this applies to 100% of the above patterns.
This issue was also reported by email on 2015-09-10. David Haslam
OSIS catchWord element
Most of the study notes in the KJV source text have recognisable catch words. These should be marked up using the OSIS catchWord element. e.g.
<note type="study">the light from…: Heb. between the light and between the darkness</note>
should become
<note type="study"><catchWord>the light from…:</catchWord> Heb. between the light and between the darkness</note>
OSIS rdg element
Many of the study notes record more literal renderings of the Hebrew or Greek text. We might wish to wrap all such readings within the OSIS rdg element. e.g.
<note type="study">And the evening…: Heb. And the evening was, and the morning was etc.</note>
would become
<note type="study"><catchWord>And the evening…:</catchWord> Heb. <rdg>And the evening was, and the morning was</rdg> etc.</note>