Difference between revisions of "DevTools:conf Files"
David Haslam (talk | contribs) m (→Required Elements: italics) |
(→Copyright & Licensing related elements) |
||
(122 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
− | + | This page describes important information about module configuration (.conf) files. All SWORD modules require one. | |
− | |||
− | |||
− | + | Please look also at our detailed and commented [[Tutorial:Writing Conf files|tutorial]]. | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Key elements of a SWORD module.conf == | == Key elements of a SWORD module.conf == | ||
− | |||
− | Some keys can be repeated. | + | Some keys can be repeated. Many not. |
Some can have values that span more than 1 line with '\' at the end of a line indicating that the text on the next line continues the value. Don't use continuation unless allowed. It will produce different results in different front ends. | Some can have values that span more than 1 line with '\' at the end of a line indicating that the text on the next line continues the value. Don't use continuation unless allowed. It will produce different results in different front ends. | ||
− | RTF is allowed in some values | + | RTF is allowed in some values. |
Some allow HTML <a href="xxx">label</a>hypertext links. HTML is not allowed otherwise. | Some allow HTML <a href="xxx">label</a>hypertext links. HTML is not allowed otherwise. | ||
Line 122: | Line 18: | ||
The order of elements specified in a conf file is immaterial, except where specified otherwise. | The order of elements specified in a conf file is immaterial, except where specified otherwise. | ||
− | |||
− | |||
===Required Elements=== | ===Required Elements=== | ||
Line 134: | Line 28: | ||
|- | |- | ||
| [ModName] | | [ModName] | ||
− | | Each conf file begins | + | | Each conf file begins with a unique identifier for a module placed within brackets, e.g., [KJV1611]. This must be first in the file. Valid characters for this abbreviation are limited to [https://en.wikipedia.org/wiki/Perl_Compatible_Regular_Expressions PCRE] class <tt>[A-Za-z0-9_]</tt>.<ref>That excludes the space and hyphen characters!</ref><br/> |
The Abbreviation element is meant to allow for localization of this field.<br/> | The Abbreviation element is meant to allow for localization of this field.<br/> | ||
− | The .conf file should be named the lowercase of this abbreviation followed by .conf. For example, | + | The .conf file should be named the lowercase of this abbreviation followed by .conf. For example, kjv1611.conf. |
| | | | ||
| | | | ||
|- | |- | ||
− | | Abbreviation<ref>We | + | | Abbreviation<ref>We advise against explicitly declaring a redundant Abbreviation identical to the ModName. Abbreviation values should try to be unique to avoid user confusion.</ref> |
| | | | ||
'''<string>'''<br/> | '''<string>'''<br/> | ||
− | This field allows for | + | The abbreviation displayed to the user. If not supplied, the unique ModName will be used. This field allows for localization-- showing a different abbreviation depending on the user's locale. |
− | |||
| [ModName] | | [ModName] | ||
| [[DevTools:conf Files#Localization|Localization]] | | [[DevTools:conf Files#Localization|Localization]] | ||
Line 160: | Line 53: | ||
DataPath is the path to the module data files relative to the SWORD module library root directory. This path should start with "./modules". If the DataPath indicates a directory it should end with a '/'. Otherwise the module name is both the directory and the prefix for each file in that directory. Although DataPath can point to any folder or files under the root of the SWORD module library, the following conventions are recommended and must be used for modules wishing to be included in a CrossWire repository: | DataPath is the path to the module data files relative to the SWORD module library root directory. This path should start with "./modules". If the DataPath indicates a directory it should end with a '/'. Otherwise the module name is both the directory and the prefix for each file in that directory. Although DataPath can point to any folder or files under the root of the SWORD module library, the following conventions are recommended and must be used for modules wishing to be included in a CrossWire repository: | ||
− | Paths for a module named [MyModule], depending on the type of module (Bible text, commentary, | + | Paths used for a module named [MyModule], depending on<BR>(a) the type of module (Bible text, commentary, lexicon or dictionary<ref>Daily devotionals & glossaries go in subdirectories under lexdict. A glossary is between two languages.</ref>, general book) and<BR>(b) the data driver (ModDrv parameter) are: |
:./modules/texts/rawtext/mymodule/ | :./modules/texts/rawtext/mymodule/ | ||
+ | :./modules/texts/rawtext4/mymodule/ | ||
:./modules/texts/ztext/mymodule/ | :./modules/texts/ztext/mymodule/ | ||
+ | :./modules/texts/ztext4/mymodule/ | ||
:./modules/comments/zcom/mymodule/ | :./modules/comments/zcom/mymodule/ | ||
+ | :./modules/comments/zcom4/mymodule/ | ||
:./modules/comments/hrefcom/mymodule/ | :./modules/comments/hrefcom/mymodule/ | ||
:./modules/comments/rawcom/mymodule/ | :./modules/comments/rawcom/mymodule/ | ||
Line 182: | Line 78: | ||
| | | | ||
'''RawText''' (for uncompressed Bibles)<br/> | '''RawText''' (for uncompressed Bibles)<br/> | ||
+ | '''RawText4''' (for uncompressed Bibles having entries greater than 64K bytes)<ref name="text4">e.g. If the Bible contains large introduction sections</ref><br/> | ||
'''zText''' (for compressed Bibles)<br/> | '''zText''' (for compressed Bibles)<br/> | ||
+ | '''zText4''' (for compressed Bibles having entries greater than 64K bytes)<ref name="text4">e.g. If the Bible contains large introduction sections</ref><ref name="sv">zText4 & zCom4 modules require MinimumVersion=1.8 or later.</ref><br/> | ||
'''RawCom''' (for uncompressed Commentaries)<br/> | '''RawCom''' (for uncompressed Commentaries)<br/> | ||
− | '''RawCom4''' (for uncompressed Commentaries having entries greater than | + | '''RawCom4''' (for uncompressed Commentaries having entries greater than 64K bytes)<br/> |
'''zCom''' (for compressed Commentaries)<br/> | '''zCom''' (for compressed Commentaries)<br/> | ||
+ | '''zCom4''' (for compressed Commentaries having entries greater than 64K bytes)<ref name="sv">zText4 & zCom4 modules require SwordVersion=1.8 or later.</ref><br/> | ||
'''HREFCom''' (each module entry must be only a URL to the body for the entry; experimental)<br/> | '''HREFCom''' (each module entry must be only a URL to the body for the entry; experimental)<br/> | ||
'''RawFiles''' (stores each entry in a simple text file in the datapath; recommended for Personal Commentary)<br/> | '''RawFiles''' (stores each entry in a simple text file in the datapath; recommended for Personal Commentary)<br/> | ||
'''RawLD''' (for uncompressed Dictionaries)<br/> | '''RawLD''' (for uncompressed Dictionaries)<br/> | ||
− | '''RawLD4''' (for uncompressed Dictionaries having entries greater than | + | '''RawLD4''' (for uncompressed Dictionaries having entries greater than 64K bytes)<br/> |
'''zLD''' (for compressed Dictionaries)<br/> | '''zLD''' (for compressed Dictionaries)<br/> | ||
'''RawGenBook''' (for uncompressed tree keyed modules) | '''RawGenBook''' (for uncompressed tree keyed modules) | ||
Line 206: | Line 105: | ||
! width="10%"|Allows | ! width="10%"|Allows | ||
|- | |- | ||
− | | SourceType | + | | SourceType<ref>Omitting this for a non-plaintext module has unpredictable effects.</ref> |
| | | | ||
'''Plaintext'''<br/> | '''Plaintext'''<br/> | ||
− | |||
− | |||
'''OSIS''' ([http://www.bibletechnologies.net Open Scriptural Information Standard])<br/> | '''OSIS''' ([http://www.bibletechnologies.net Open Scriptural Information Standard])<br/> | ||
'''TEI''' ([http://www.tei-c.org Text Encoding Initiative])<br/> | '''TEI''' ([http://www.tei-c.org Text Encoding Initiative])<br/> | ||
Specifies the '''markup''' used in the module. The preferred markup is OSIS. TEI is preferred for dictionaries until OSIS supports dictionaries. While SourceType has a default, it is a best practice to specify it.<br/> | Specifies the '''markup''' used in the module. The preferred markup is OSIS. TEI is preferred for dictionaries until OSIS supports dictionaries. While SourceType has a default, it is a best practice to specify it.<br/> | ||
+ | |||
+ | Legacy modules may have a key here stating: | ||
+ | |||
+ | '''GBF''' ([http://www.ebible.org/bible/gbf.htm General Bible Format])<br/> | ||
+ | '''ThML''' ([http://www.ccel.org/ThML Theological Markup Language])<br/> | ||
+ | |||
In SWORD, for modules encoded with ThML, OSIS or TEI, each verse, dictionary entry, and book division needs to be well-formed XML or it will result in display problems in some front-ends. | In SWORD, for modules encoded with ThML, OSIS or TEI, each verse, dictionary entry, and book division needs to be well-formed XML or it will result in display problems in some front-ends. | ||
| Plaintext | | Plaintext | ||
Line 220: | Line 123: | ||
| Encoding | | Encoding | ||
| | | | ||
− | |||
'''UTF-8'''<br/> | '''UTF-8'''<br/> | ||
'''UTF-16'''<br/> | '''UTF-16'''<br/> | ||
Line 226: | Line 128: | ||
Indicates how the text in the conf and in the module are encoded. | Indicates how the text in the conf and in the module are encoded. | ||
− | The preferred encoding of texts is UTF-8. Other than Hebrew, UTF-8 modules must be encoded with [http://unicode.org/reports/tr15/ Normalization Form C (NFC)]. Hebrew requires special handling. | + | The preferred encoding of texts is UTF-8. Other than Hebrew, UTF-8 modules must be encoded with [http://unicode.org/reports/tr15/ Normalization Form C (NFC)]. Biblical Hebrew requires special handling.<ref>Unicode normalization can easily break Biblical Hebrew text. See on page 9 in the [http://www.sbl-site.org/Fonts/SBLHebrewUserManual1.5x.pdf SBL Hebrew Font User Manual].</ref> A few other languages may require special handling.<ref>e.g. If they are mentioned in Table 10 in the [http://unicode.org/reports/tr15/#Corrigendum_5_Sequences Corrigendum 5 Sequences].</ref><ref>The improper normalization of exceptional codepoints can be prevented by inserting a [https://en.wikipedia.org/wiki/Combining_Grapheme_Joiner Combining Grapheme Joiner].</ref> |
− | To date no | + | To date, no modules use UTF-16 or SCSU. Legacy modules may hold a key here '''Latin-1''' referring solely to Windows Codepage 1252, a superset of ISO-8859-1. Front-end implementors wishing to use such modules should use "cp1252" or "windows1252" explicitly, not "Latin-1" provided by some programming language libraries.'' |
− | |||
− | |||
| Latin-1 | | Latin-1 | ||
| | | | ||
Line 264: | Line 164: | ||
| Versification | | Versification | ||
| | | | ||
+ | '''Calvin'''<br/> | ||
'''Catholic'''<br/> | '''Catholic'''<br/> | ||
'''Catholic2'''<br/> | '''Catholic2'''<br/> | ||
+ | '''Darby_Fr'''<br/> | ||
'''German'''<br/> | '''German'''<br/> | ||
'''KJV'''<br/> | '''KJV'''<br/> | ||
Line 276: | Line 178: | ||
'''NRSVA'''<br/> | '''NRSVA'''<br/> | ||
'''Orthodox'''<br/> | '''Orthodox'''<br/> | ||
+ | '''Segond'''<br/> | ||
'''Synodal'''<br/> | '''Synodal'''<br/> | ||
'''SynodalProt'''<br/> | '''SynodalProt'''<br/> | ||
'''Vulg'''<br/> | '''Vulg'''<br/> | ||
− | Used to specify the versification employed by a Bible module. Refer to [[Alternate Versification]]. | + | Used to specify the versification employed by a Bible module<ref>i.e. Specify this ''only'' for a module that uses '''VerseKey'''.</ref>. Refer to [[Alternate Versification]]. |
| KJV | | KJV | ||
| | | | ||
Line 298: | Line 201: | ||
Used for RawGenBook to indicate whether the module contains a book or a Bible. At this time VerseKey is not yet supported and is being developed as a solution for Bibles which do not conform to any supported versification system in SWORD. It is best practice to take the default and not specify it. | Used for RawGenBook to indicate whether the module contains a book or a Bible. At this time VerseKey is not yet supported and is being developed as a solution for Bibles which do not conform to any supported versification system in SWORD. It is best practice to take the default and not specify it. | ||
| TreeKey | | TreeKey | ||
+ | | | ||
+ | |- | ||
+ | | CaseSensitiveKeys | ||
+ | | | ||
+ | Used for Dictionaries whose keys are case sensitive. This key is used to suppress normalization to UPPER CASE before comparison.<br/> | ||
+ | Only allowable value: '''true''' | ||
+ | | false | ||
| | | | ||
|} | |} | ||
+ | |||
+ | <references /> | ||
=== Elements required for proper rendering === | === Elements required for proper rendering === | ||
Line 311: | Line 223: | ||
| GlobalOptionFilter | | GlobalOptionFilter | ||
| | | | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
'''UTF8Cantillation''' (For Hebrew texts having cantillation marks)<ref>See https://en.wikipedia.org/wiki/Cantillation</ref><br/> | '''UTF8Cantillation''' (For Hebrew texts having cantillation marks)<ref>See https://en.wikipedia.org/wiki/Cantillation</ref><br/> | ||
− | '''UTF8GreekAccents''' (For Greek texts having accents)<br/> | + | '''UTF8GreekAccents''' (For Greek texts having accents)<ref>For detailed background, see https://en.wikipedia.org/wiki/Greek_diacritics</ref><ref>This filter can have undesirable side-effects when applied to non-Greek text!</ref><br/> |
− | '''UTF8HebrewPoints''' (For Hebrew texts having vowel points)<br/> | + | '''UTF8HebrewPoints''' (For Hebrew texts having vowel points)<ref>See https://en.wikipedia.org/wiki/Niqqud</ref><br/> |
+ | '''UTF8ArabicPoints''' (For Arabic texts having vowel points)<ref>See https://en.wikipedia.org/wiki/Arabic_diacritics</ref><br/> | ||
'''OSISLemma''' (For OSIS texts having lemmas)<ref>Must precede OSISStrongs.</ref><br/> | '''OSISLemma''' (For OSIS texts having lemmas)<ref>Must precede OSISStrongs.</ref><br/> | ||
− | '''OSISMorphSegmentation''' (For OSIS texts having morphological segmentation elements)<br/> | + | '''OSISMorphSegmentation''' (For OSIS texts having morphological segmentation elements)<ref>Currently, only some JSword based front-ends seem to support this feature. The SWORD engine has the switch available, but no change in output is effected.</ref><br/> |
− | '''OSISStrongs''' (For OSIS texts having Strong's Numbers)<br/> | + | '''OSISStrongs''' (For OSIS texts having Strong's Numbers)<ref name="strongs">See https://en.wikipedia.org/wiki/Strong%27s_Concordance#Strong.27s_numbers</ref><br/> |
'''OSISFootnotes''' (For OSIS texts having informational notes)<br/> | '''OSISFootnotes''' (For OSIS texts having informational notes)<br/> | ||
'''OSISScripref''' (For OSIS texts having [[OSIS Bibles#Marking_cross-references_note|cross reference]] type notes)<br/> | '''OSISScripref''' (For OSIS texts having [[OSIS Bibles#Marking_cross-references_note|cross reference]] type notes)<br/> | ||
Line 334: | Line 235: | ||
'''OSISHeadings''' (For OSIS texts having non-canonical headings)<br/> | '''OSISHeadings''' (For OSIS texts having non-canonical headings)<br/> | ||
'''OSISVariants''' (For OSIS texts having variant readings)<br/> | '''OSISVariants''' (For OSIS texts having variant readings)<br/> | ||
− | '''OSISRedLetterWords''' (For OSIS texts marking the Words of Christ)<br/> | + | '''OSISRedLetterWords''' (For OSIS texts marking the Words of Christ)<ref name="red">See https://en.wikipedia.org/wiki/Red_letter_edition</ref><br/> |
− | '''OSISGlosses''' (For OSIS texts with glosses)<ref>Minimum SWORD version of 1.7.0 in the module .conf is required for OSISGlosses | + | '''OSISGlosses''' (For OSIS texts with glosses)<ref>Minimum SWORD version of 1.7.0 in the module .conf is required for OSISGlosses.</ref><br/> |
'''OSISXlit''' (For OSIS texts that include transliterated forms)<ref>The Samaritan Pentateuch module SP is an example of using xlit.</ref><br/> | '''OSISXlit''' (For OSIS texts that include transliterated forms)<ref>The Samaritan Pentateuch module SP is an example of using xlit.</ref><br/> | ||
'''OSISEnum''' (For OSIS texts with enumerated words)<ref>The Samaritan Pentateuch module SP is an example of using enum.</ref><br/> | '''OSISEnum''' (For OSIS texts with enumerated words)<ref>The Samaritan Pentateuch module SP is an example of using enum.</ref><br/> | ||
Line 350: | Line 251: | ||
# "On" = Default filter toggle value ("On" or "Off") | # "On" = Default filter toggle value ("On" or "Off") | ||
</ref><ref>It is allowed to have multiple OSISReferenceLinks entries in a single conf file.</ref><br/> | </ref><ref>It is allowed to have multiple OSISReferenceLinks entries in a single conf file.</ref><br/> | ||
− | Each of these filters removes/hides the text's feature, when activated by the application. | + | Each of these filters removes/hides the text's feature, when activated by the application.<ref>It's not implied that every front-end supports all of the listed option filters.</ref> |
− | These filters are applied in the order that they are listed in the conf. Some filters are dependent on each other for certain features - e.g. | + | |
+ | These filters are applied in the order that they are listed in the conf. Some filters are dependent on each other for certain features - e.g. cross-references in notes require both the '''OSISFootnotes''' and the '''OSISScriprefs''' filters enabled. | ||
+ | |||
+ | Legacy modules may also have following keys: | ||
+ | |||
+ | '''OSISRuby'''<ref>See [https://en.wikipedia.org/wiki/Ruby_character Ruby character] and [https://en.wikipedia.org/wiki/Furigana Furigana]</ref> (For OSIS texts with glosses)<ref>Deprecated in 1.7.0. Use OSISGlosses instead.</ref><br/> | ||
+ | '''GBFStrongs''' (For GBF texts having Strong's Numbers)<ref name="strongs">See https://en.wikipedia.org/wiki/Strong%27s_Concordance#Strong.27s_numbers</ref><br/> | ||
+ | '''GBFFootnotes''' (For GBF texts having footnotes)<br/> | ||
+ | '''GBFMorph''' (For GBF texts having morphology information)<br/> | ||
+ | '''GBFHeadings''' (For GBF texts having headings)<br/> | ||
+ | '''GBFRedLetterWords''' (For GBF texts marking the Words of Christ)<ref name="red">See https://en.wikipedia.org/wiki/Red_letter_edition</ref><br/> | ||
+ | '''ThMLStrongs''' (For THML texts having Strong's Numbers)<ref name="strongs">See https://en.wikipedia.org/wiki/Strong%27s_Concordance#Strong.27s_numbers</ref><br/> | ||
+ | '''ThMLFootnotes''' (For THML texts having footnotes)<br/> | ||
+ | '''ThMLScripref''' (For THML texts having cross references)<br/> | ||
+ | '''ThMLMorph''' (For THML texts having morphology information)<br/> | ||
+ | '''ThMLHeadings''' (For THML texts having headings)<br/> | ||
+ | '''ThMLVariants''' (For THML texts having variant readings)<br/> | ||
+ | '''ThMLLemma''' (For THML texts having lemmas)<br/> | ||
+ | |||
| | | | ||
| Repeats | | Repeats | ||
Line 372: | Line 291: | ||
| Font | | Font | ||
| '''<string>'''<br/> | | '''<string>'''<br/> | ||
− | Specify the font to be used for display of the module if it is available. Omit this line to use the default font. Do not make use of font-specific encodings in your documents, but use Unicode instead and the Private Use Area if necessary for codepoints that are not handled by Unicode. | + | Specify the [[Fonts|font]] to be used for display of the module if it is available.<ref>Specifying a font may not be sufficient for some modules. The required font features may depend on a particular smart font engine, which may not be compiled into the front-end application.</ref> Omit this line to use the default font. Do not make use of font-specific encodings in your documents, but use Unicode instead and the Private Use Area if necessary for codepoints that are not handled by Unicode. |
| | | | ||
| | | | ||
Line 386: | Line 305: | ||
| | | | ||
'''StrongsNumbers''' (for modules that include Strong's numbers)<br/> | '''StrongsNumbers''' (for modules that include Strong's numbers)<br/> | ||
− | '''GreekDef''' (for modules with Strong's number encoded Greek definitions)<br/> | + | '''GreekDef''' (for dictionary modules with Strong's number encoded Greek definitions)<br/> |
− | '''HebrewDef''' (for modules with Strong's number encoded Hebrew definitions)<br/> | + | '''HebrewDef''' (for dictionary modules with Strong's number encoded Hebrew definitions)<br/> |
− | '''GreekParse''' (for modules with Greek morphology expansions)<br/> | + | '''GreekParse''' (for dictionary modules with Greek morphology expansions)<br/> |
− | '''HebrewParse''' (for modules with Hebrew morphology expansions)<br/> | + | '''HebrewParse''' (for dictionary modules with Hebrew morphology expansions)<br/> |
'''DailyDevotion''' (for daily devotionals using one of the LD drivers and keyed with MM.DD)<br/> | '''DailyDevotion''' (for daily devotionals using one of the LD drivers and keyed with MM.DD)<br/> | ||
'''Glossary''' (for collections of glosses using one of the LD drivers)<br/> | '''Glossary''' (for collections of glosses using one of the LD drivers)<br/> | ||
'''Images''' (for modules that contain images of any type)<br/> | '''Images''' (for modules that contain images of any type)<br/> | ||
− | '''NoParagraphs''' (for modules without any paragraphing information, which are typically typeset with a verse per line<ref>This feature is intended to be informational to front-end developers. Ideally, front-ends will render these modules with a verse per line rather than as a single big chapter-length paragraph block.</ref>) | + | '''NoParagraphs''' (for modules without any paragraphing information, which are typically typeset with a verse per line<ref>This feature is intended to be informational to front-end developers. Ideally, front-ends will render these modules with a verse per line rather than as a single big chapter-length paragraph block.</ref><ref>The occurrence of a small number of paragraph elements in a module (such as might be used merely to format the colophon at the end of each Pauline epistle) does not rule out the use of this feature.</ref>) |
| | | | ||
| Repeats | | Repeats | ||
Line 421: | Line 340: | ||
=== Optional elements to support particular features === | === Optional elements to support particular features === | ||
+ | ==== CaseInsensitiveKeys ==== | ||
+ | Intended for use with Lexicon/Dictionary & Glossary modules. This field will make the order of the keys based upon the mixed case keys, but the index is still sorted by byte order of those keys. There are some scripts that don’t have upper/lower case (e.g. Arabic) and some languages where a naïve toUpper() will result in the wrong character (e.g. Turkish/Azeri lowercase dotted i and capital dotted İ). | ||
+ | |||
+ | CaseInsensitiveKeys=true|false | ||
+ | |||
+ | It is fine to use toUpper() for internal normalization, but having keys in all caps when showing to a user is annoying. The problem is that the display order needs to follow something that makes sense to a user when the dictionary is presented as a list. | ||
+ | |||
+ | [https://github.com/JohnAustinDev/xulsword xulsword] has a different solution involving a configuration item not yet used by SWORD master. | ||
+ | |||
+ | LangSortOrder=AaBbCcDdEe... | ||
+ | |||
+ | This is used by xulsword to sort the keys of a dictionary/glossary in original alphabetical order. Here's an actual example for module TKLDICT which has Lang=tk-Latn (i.e. Türkmençe): | ||
+ | |||
+ | LangSortOrder=AaBbCcÇçDdEeÄ䯿FfGgHhIiJjKkLlMmNnŇňOoÖöPpQqRrSsŞşTtUuÜüVvWwXxYyÝýZzŽž | ||
+ | |||
+ | This method would need to be modified in order to support alphabets (such as [http://en.wikipedia.org/wiki/Welsh_orthography Welsh]) that include any [http://en.wikipedia.org/wiki/Digraph_%28orthography%29 digraphs]. | ||
+ | |||
==== StrongsPadding ==== | ==== StrongsPadding ==== | ||
At the heart of our lexicon/dictionary drivers, we have some old logic which tries to detect if a key value is a Strong's number, and if so, then pad it with leading zeros accordingly. To support this logic, the recognition has recently been added for an optional new .conf entry for lexicon/dictionary modules: | At the heart of our lexicon/dictionary drivers, we have some old logic which tries to detect if a key value is a Strong's number, and if so, then pad it with leading zeros accordingly. To support this logic, the recognition has recently been added for an optional new .conf entry for lexicon/dictionary modules: | ||
Line 451: | Line 387: | ||
They're pretty concise and don't involve much knowledge from the rest of the engine, making them easy to write if we need a new one. | They're pretty concise and don't involve much knowledge from the rest of the engine, making them easy to write if we need a new one. | ||
− | This processing can replace or be complimentary to any processing done by clucene. | + | This processing can replace or be complimentary to any processing done by clucene. Here's an example of what's used with the [http://papyri.info/docs/ddbdp Duke Databank of Papyri] with specialist software that's based on SWORD. |
+ | |||
+ | LocalStripFilter=PapyriPlain | ||
Since we need to strip markup, and other things clucene will likely never support (see '''PapyriPlain''' – annotations like [,],?{,}, underdot) we need this preprocessing mechanism to prepare the text before searching. We also maintain searching functionality apart from "fast indexed searching".<ref>Currently supplied by clucene, but could be implemented by any other fast search framework that we might want to integrate in future.</ref> | Since we need to strip markup, and other things clucene will likely never support (see '''PapyriPlain''' – annotations like [,],?{,}, underdot) we need this preprocessing mechanism to prepare the text before searching. We also maintain searching functionality apart from "fast indexed searching".<ref>Currently supplied by clucene, but could be implemented by any other fast search framework that we might want to integrate in future.</ref> | ||
Line 490: | Line 428: | ||
|- | |- | ||
| History_x.x | | History_x.x | ||
− | | '''<string>'''<br/> | + | | '''<string>'''<ref>Maximum line length is 1024 characters!</ref><br/> |
x.x is taken from the Version value. | x.x is taken from the Version value. | ||
Line 499: | Line 437: | ||
| Repeats<br/>[[DevTools:conf Files#Localization|Localization]] | | Repeats<br/>[[DevTools:conf Files#Localization|Localization]] | ||
|- | |- | ||
− | | MinimumVersion | + | | MinimumVersion<ref>See http://tracker.crosswire.org/browse/API-201</ref> |
| | | | ||
'''<version string>'''<br/> | '''<version string>'''<br/> | ||
− | Identifies the minimum version of the SWORD library required for this module. | + | Identifies the [[Sword library versions | minimum version]] of the SWORD library required for this module.<ref>Required to support a Bible/Commentary module that has an [[Alternate Versification]].</ref> |
| 1.5.1a | | 1.5.1a | ||
| | | | ||
Line 508: | Line 446: | ||
| Category | | Category | ||
|valign="top"| | |valign="top"| | ||
− | + | This is used by installers to further categorize the modules beyond what can be figured out by the ModDrv and Feature.<br/> | |
− | |||
− | |||
− | |||
− | |||
− | |||
'''Biblical Texts''' (for Bibles)<br/> | '''Biblical Texts''' (for Bibles)<br/> | ||
'''Commentaries'''<br/> | '''Commentaries'''<br/> | ||
'''Lexicons / Dictionaries'''<br/> | '''Lexicons / Dictionaries'''<br/> | ||
+ | '''Glossaries''' (for modules with Feature=Glossary)<br/> | ||
+ | '''Daily Devotional''' (for modules with Feature=DailyDevotion)<br/> | ||
'''Generic Books''' (for anything else....)<br/> | '''Generic Books''' (for anything else....)<br/> | ||
− | + | '''Maps''' (for modules that primarily consist of maps)<br/> | |
+ | '''Images''' (for modules that primarily consist of images)<br/> | ||
+ | '''Cults / Unorthodox / Questionable Material'''<br/> | ||
+ | '''Essays''' (for essays)<ref>'''Essays''' is handled as a subset of '''Generic Books'''.</ref> | ||
| | | | ||
Biblical Texts<br/> | Biblical Texts<br/> | ||
Line 536: | Line 474: | ||
''ModDrv=RawLD4'' or<br/> | ''ModDrv=RawLD4'' or<br/> | ||
''ModDrv=zLD''<br/> | ''ModDrv=zLD''<br/> | ||
− | + | Glossaries<br/> | |
− | ''Feature= | + | ''Feature=Glossary'' and<br/> |
''ModDrv=RawLD'' or<br/> | ''ModDrv=RawLD'' or<br/> | ||
''ModDrv=RawLD4'' or<br/> | ''ModDrv=RawLD4'' or<br/> | ||
''ModDrv=zLD''<br/> | ''ModDrv=zLD''<br/> | ||
− | + | Daily Devotional<br/> | |
− | ''Feature= | + | ''Feature=DailyDevotion'' and<br/> |
''ModDrv=RawLD'' or<br/> | ''ModDrv=RawLD'' or<br/> | ||
''ModDrv=RawLD4'' or<br/> | ''ModDrv=RawLD4'' or<br/> | ||
Line 605: | Line 543: | ||
| | | | ||
|- | |- | ||
− | | Companion | + | | Companion<ref>Many (xulsword compatible) modules in the [[Official and Affiliated Module Repositories#Institute_for_Bible_Translation|IBT Repository]] make use of this field. See also https://github.com/johnaustindev/osis-converters</ref> |
| | | | ||
'''<ModName[, ModName]*>'''<br/> | '''<ModName[, ModName]*>'''<br/> | ||
Specifies companion module(s) that should be opened together<br/> | Specifies companion module(s) that should be opened together<br/> | ||
− | e.g. When Bible and Commentary modules are distributed together. | + | e.g. When Bible and Commentary and/or Glossary modules are distributed together. |
| | | | ||
| | | | ||
|} | |} | ||
+ | '''Note:''' | ||
+ | <references/> | ||
=== Copyright & Licensing related elements === | === Copyright & Licensing related elements === | ||
Line 671: | Line 611: | ||
| | | | ||
'''<string>'''<br/> | '''<string>'''<br/> | ||
− | Contains the email address of the copyright holder, preferably in the form:<br/> name at xyz dot com | + | Contains the email address of the copyright holder, preferably in the form:<br/> name at xyz dot com |
| | | | ||
| [[DevTools:conf Files#Localization|Localization]] | | [[DevTools:conf Files#Localization|Localization]] | ||
Line 687: | Line 627: | ||
| | | | ||
| [[DevTools:conf Files#Localization|Localization]] | | [[DevTools:conf Files#Localization|Localization]] | ||
− | |- | + | |- id="distlic" |
| DistributionLicense | | DistributionLicense | ||
| | | | ||
'''Public Domain'''<br/> | '''Public Domain'''<br/> | ||
'''Copyrighted'''<br/> | '''Copyrighted'''<br/> | ||
− | '''Copyrighted; Permission to distribute granted to CrossWire'''<br/> | + | '''Copyrighted; Permission to distribute granted to CrossWire'''<ref>Modules in other repositories may have a different organization name instead of CrossWire.</ref><br/> |
+ | '''Copyrighted; Permission granted to distribute non-commercially in SWORD format'''<br/> | ||
'''Copyrighted; Free non-commercial distribution'''<br/> | '''Copyrighted; Free non-commercial distribution'''<br/> | ||
'''Copyrighted; Freely distributable'''<br/> | '''Copyrighted; Freely distributable'''<br/> | ||
− | |||
'''[http://www.gnu.org/copyleft/fdl.html GFDL]'''<br/> | '''[http://www.gnu.org/copyleft/fdl.html GFDL]'''<br/> | ||
'''[http://www.gnu.org/copyleft/gpl.html GPL]'''<br/> | '''[http://www.gnu.org/copyleft/gpl.html GPL]'''<br/> | ||
− | '''Creative Commons: | + | '''Creative Commons: BY-NC-ND 4.0'''<br/> |
− | '''Creative Commons: | + | '''Creative Commons: BY-NC-SA 4.0'''<br/> |
− | '''Creative Commons: | + | '''Creative Commons: BY-NC 4.0'''<br/> |
− | '''Creative Commons: | + | '''Creative Commons: BY-ND 4.0'''<br/> |
− | '''Creative Commons: | + | '''Creative Commons: BY-SA 4.0'''<br/> |
− | '''Creative Commons: | + | '''Creative Commons: BY 4.0'''<br/> |
'''Creative Commons: [http://creativecommons.org/publicdomain/zero/1.0/ CC0]'''<br/> | '''Creative Commons: [http://creativecommons.org/publicdomain/zero/1.0/ CC0]'''<br/> | ||
− | + | Use one of these strings '''verbatim'''. The actual copyright and/or license information is held in other elements.<br/>The last seven are different [https://creativecommons.org/licenses/ Creative Commons licenses]. | |
− | Use one of these strings '''verbatim'''. The actual copyright and/or license information is held in other elements. The last | ||
− | |||
− | |||
| | | | ||
| | | | ||
Line 718: | Line 655: | ||
| | | | ||
| [[DevTools:conf Files#Continuation|Continuation]]<br/>[[DevTools:conf Files#Localization|Localization]] | | [[DevTools:conf Files#Continuation|Continuation]]<br/>[[DevTools:conf Files#Localization|Localization]] | ||
− | |- | + | |- id="txtsrc" |
| TextSource | | TextSource | ||
| | | | ||
Line 726: | Line 663: | ||
| [[DevTools:conf Files#Continuation|Continuation]] | | [[DevTools:conf Files#Continuation|Continuation]] | ||
|- | |- | ||
− | | | + | | UnlockInfo |
| | | | ||
'''<string>'''<br/> | '''<string>'''<br/> | ||
− | Contains the | + | Contains unlock instructions for the user, intended to be displayed upon attempt to install the module. These instructions typically include a URL pointing to an exact location within a publisher's online store where a user can directly purchase an unlock key (CipherKey) for the module. |
| | | | ||
− | | | + | | [[DevTools:conf Files#RTF|RTF]]<br/>[[DevTools:conf Files#Continuation|Continuation]]<br/>[[DevTools:conf Files#Localization|Localization]]<br/>HTML Link |
|} | |} | ||
+ | '''Note:''' | ||
+ | <references/> | ||
+ | |||
+ | === Comments and blank lines === | ||
+ | Early in 2021, CrossWire began providing comment lines and blank lines in released and updated modules. | ||
+ | Here's and example showing that there may sometimes be both ''even before'' the module ID. | ||
+ | <pre> | ||
+ | ## Sword module configuration file | ||
+ | |||
+ | ## Required elements | ||
+ | # Module Unique Identifier. | ||
+ | [GerMenge] | ||
+ | ... | ||
+ | </pre>Some other module providers and repositories are already moving in the same direction. | ||
+ | |||
+ | == Further details == | ||
+ | === Continuation === | ||
+ | A value can span multiple lines by escaping the return with '\'. This is not a mechanism to make long lines more readable in the module.conf file. It is a means to introducing a break in the rendered output of that field when viewed by a front-end or installer. It is akin to a xHTML <br/>. That is, continuation is a formatting feature. | ||
+ | |||
+ | Most elements in a SWORD conf are expected to have short, one-line values. Elements that are expected to have multiple lines are noted. | ||
+ | |||
+ | === RTF === | ||
+ | A module.conf supports a very small, restricted subset of RTF markup. Only the following are allowed: | ||
+ | * \qc - for centering | ||
+ | * \par - for paragraph breaks | ||
+ | * \pard - for resetting paragraph attributes, i.e. turning off centering. | ||
+ | * \u{num}? - for unicode characters, where {num} is a signed, 16-bit representation of the code point and where ? is the ASCII character used in case unicode is not supported. If the {num} is less than 0 then add 65536 to it. This should only be used in modules that have an Encoding=UTF-8, but using the actual UTF-8 character is preferred. | ||
+ | |||
+ | The only uniqueness that RTF provides is centering. If centering is not needed, then use continuation lines instead of RTF. | ||
+ | |||
+ | === Localization === | ||
+ | Those .conf fields that are essentially text intended for presentation to the end-user may be localized by appending _''locale'' to the field name, where ''locale'' is replaced by an appropriate locale code, according to [http://www.rfc-editor.org/rfc/bcp/bcp47.txt BCP 47]. See '''Lang''' key for details. | ||
+ | |||
+ | For example, to give a French description, you can have a key: | ||
+ | :Description_fr=.... | ||
+ | |||
+ | In order to distinguish a regional form from the primary form of a language, e.g. Brazilian Portuguese vs. the Portuguese of Portugal, append the country code as in: | ||
+ | :Description_pt-BR=.... | ||
− | == Uniqueness == | + | Script variants can be given as in: |
+ | :Description_zh-Hans=.... simplified Chinese .... | ||
+ | :Description_zh-Hant=.... traditional Chinese .... | ||
+ | |||
+ | In order for a .conf entry to appear in a localized form, a non-localized form of the same field must also occur within the .conf. For example, in order for a ''Description_en'' field to appear, a file must also possess a ''Description'' field. The locale of .conf entries without the locale modifier is the default and must reflect the locale/language of the module itself (as specified in Lang=) or English (if there are no localized versions of the field). In general, fields should be provided in the language of the module itself with English translations provided in parallel fields localized with _''en''. There is no explicit upper bound on the quantity of localized fields, but all localized and localizable fields should be unique. | ||
+ | |||
+ | # See also [[Localized Language Names]]. | ||
+ | |||
+ | === Uniqueness === | ||
For comparing two versions of a module during module development, the module names and locations must be unique. For JSword based front-ends such as Bible Desktop, there is a further requirement, the Description items must be different. | For comparing two versions of a module during module development, the module names and locations must be unique. For JSword based front-ends such as Bible Desktop, there is a further requirement, the Description items must be different. | ||
+ | === Analysis Tools === | ||
+ | * [[User:Dmsmith|DMSmith]] and [[User:Domcox|Domcox]] have created scripts to analyse conf files and report anomalies. | ||
+ | * [[User:David Haslam|David Haslam]] has created a '''User Defined Language''' called '''CONF''' as a Syntax Highlighter for '''Notepad++''' (Windows). Download from [https://github.com/DavidHaslam/CONF]. | ||
+ | |||
+ | == Automated generation == | ||
+ | * For new module submissions to CrossWire, [[User:DomCox|DomCox]] now maintains a script that includes the ''automated generation'' of module conf files, given the minimum ''non-automatable'' requirements by the module submitter. See [[Module Submission]] for more details. | ||
+ | |||
+ | == See also == | ||
+ | * [[BibTeX entries]] | ||
[[Category:Development tools|Conf files]] | [[Category:Development tools|Conf files]] |
Latest revision as of 06:22, 20 February 2024
This page describes important information about module configuration (.conf) files. All SWORD modules require one.
Please look also at our detailed and commented tutorial.
Contents
- 1 Key elements of a SWORD module.conf
- 2 Further details
- 3 Automated generation
- 4 See also
Key elements of a SWORD module.conf
Some keys can be repeated. Many not.
Some can have values that span more than 1 line with '\' at the end of a line indicating that the text on the next line continues the value. Don't use continuation unless allowed. It will produce different results in different front ends.
RTF is allowed in some values.
Some allow HTML <a href="xxx">label</a>hypertext links. HTML is not allowed otherwise.
Values specifications are shown as <content spec>. The < and > are not to be included.
Enumerated values are shown in bold. These should be used exactly as given and no other values should be used.
The order of elements specified in a conf file is immaterial, except where specified otherwise.
Required Elements
Element | Values (type or enumerated) | Default Value | Allows |
---|---|---|---|
[ModName] | Each conf file begins with a unique identifier for a module placed within brackets, e.g., [KJV1611]. This must be first in the file. Valid characters for this abbreviation are limited to PCRE class [A-Za-z0-9_].[1] The Abbreviation element is meant to allow for localization of this field. |
||
Abbreviation[2] |
<string> |
[ModName] | Localization |
Description |
<string> |
Localization | |
DataPath | <relative system path pointing to the data files>
DataPath is the path to the module data files relative to the SWORD module library root directory. This path should start with "./modules". If the DataPath indicates a directory it should end with a '/'. Otherwise the module name is both the directory and the prefix for each file in that directory. Although DataPath can point to any folder or files under the root of the SWORD module library, the following conventions are recommended and must be used for modules wishing to be included in a CrossWire repository: Paths used for a module named [MyModule], depending on
|
||
ModDrv |
RawText (for uncompressed Bibles) |
- ↑ That excludes the space and hyphen characters!
- ↑ We advise against explicitly declaring a redundant Abbreviation identical to the ModName. Abbreviation values should try to be unique to avoid user confusion.
- ↑ Daily devotionals & glossaries go in subdirectories under lexdict. A glossary is between two languages.
- ↑ 4.0 4.1 e.g. If the Bible contains large introduction sections
- ↑ 5.0 5.1 zText4 & zCom4 modules require MinimumVersion=1.8 or later. Cite error: Invalid
<ref>
tag; name "sv" defined multiple times with different content
Required Elements with defaults
Element | Values (type or enumerated) | Default Value | Allows |
---|---|---|---|
SourceType[1] |
Plaintext Legacy modules may have a key here stating: GBF (General Bible Format) In SWORD, for modules encoded with ThML, OSIS or TEI, each verse, dictionary entry, and book division needs to be well-formed XML or it will result in display problems in some front-ends. |
Plaintext | |
Encoding |
UTF-8 The preferred encoding of texts is UTF-8. Other than Hebrew, UTF-8 modules must be encoded with Normalization Form C (NFC). Biblical Hebrew requires special handling.[2] A few other languages may require special handling.[3][4] To date, no modules use UTF-16 or SCSU. Legacy modules may hold a key here Latin-1 referring solely to Windows Codepage 1252, a superset of ISO-8859-1. Front-end implementors wishing to use such modules should use "cp1252" or "windows1252" explicitly, not "Latin-1" provided by some programming language libraries. |
Latin-1 | |
CompressType |
ZIP |
LZSS | |
BlockType |
BOOK |
CHAPTER | |
BlockCount |
<integer> |
200 | |
Versification |
Calvin |
KJV | |
CipherKey |
<string> A good key is something that is hard to guess. Typically in a format matching the pattern: /[0-9]{4}[A-Za-z]{4}[0-9]{4}[A-Za-z]{4}/. Internally the key can be any byte sequence from 1 to 255 bytes in length. But it needs to be readable, plain text, without leading or trailing spaces. |
||
KeyType |
TreeKey |
TreeKey | |
CaseSensitiveKeys |
Used for Dictionaries whose keys are case sensitive. This key is used to suppress normalization to UPPER CASE before comparison. |
false |
- ↑ Omitting this for a non-plaintext module has unpredictable effects.
- ↑ Unicode normalization can easily break Biblical Hebrew text. See on page 9 in the SBL Hebrew Font User Manual.
- ↑ e.g. If they are mentioned in Table 10 in the Corrigendum 5 Sequences.
- ↑ The improper normalization of exceptional codepoints can be prevented by inserting a Combining Grapheme Joiner.
- ↑ i.e. Specify this only for a module that uses VerseKey.
Elements required for proper rendering
Element | Values (type or enumerated) | Default Value | Allows |
---|---|---|---|
GlobalOptionFilter |
UTF8Cantillation (For Hebrew texts having cantillation marks)[1] These filters are applied in the order that they are listed in the conf. Some filters are dependent on each other for certain features - e.g. cross-references in notes require both the OSISFootnotes and the OSISScriprefs filters enabled. Legacy modules may also have following keys: OSISRuby[16] (For OSIS texts with glosses)[17] |
Repeats | |
Direction |
LtoR (Left to Right) |
LtoR | |
DisplayLevel | <integer> Used for General Book module types (these are keyed with a TreeKey table of contents). Indicates the preferred level from a leaf in the tree to display for context. e.g., 1 will only show the requested entry; 2 will show the entry, surrounded by all siblings, etc. |
1 | |
Font | <string> Specify the font to be used for display of the module if it is available.[19] Omit this line to use the default font. Do not make use of font-specific encodings in your documents, but use Unicode instead and the Private Use Area if necessary for codepoints that are not handled by Unicode. |
||
|
This attribute is deprecated in favor of the marker attribute on the q element. E.g.: <q who="Jesus" marker="">....</q>
|
true | |
Feature |
StrongsNumbers (for modules that include Strong's numbers) |
Repeats | |
GlossaryFrom | <lang identifier> Glossaries map one language to another. This value indicates the language being translated from. See Lang below for a discussion of valid values. |
||
GlossaryTo | <lang identifier> Glossaries map one language to another. This value indicates the language being translated to. See Lang below for a discussion of valid values. |
||
PreferredCSSXHTML | <filename> Names a file in the module's DataPath that should be referenced for the renderer as CSS display controls. Generality is advised: Use controls that are not specific to any particular rendering engine, e.g. WebKit. |
- ↑ See https://en.wikipedia.org/wiki/Cantillation
- ↑ For detailed background, see https://en.wikipedia.org/wiki/Greek_diacritics
- ↑ This filter can have undesirable side-effects when applied to non-Greek text!
- ↑ See https://en.wikipedia.org/wiki/Niqqud
- ↑ See https://en.wikipedia.org/wiki/Arabic_diacritics
- ↑ Must precede OSISStrongs.
- ↑ Currently, only some JSword based front-ends seem to support this feature. The SWORD engine has the switch available, but no change in output is effected.
- ↑ 8.0 8.1 8.2 See https://en.wikipedia.org/wiki/Strong%27s_Concordance#Strong.27s_numbers
- ↑ 9.0 9.1 See https://en.wikipedia.org/wiki/Red_letter_edition
- ↑ Minimum SWORD version of 1.7.0 in the module .conf is required for OSISGlosses.
- ↑ The Samaritan Pentateuch module SP is an example of using xlit.
- ↑ The Samaritan Pentateuch module SP is an example of using enum.
- ↑ New in SWORD 1.7.0 - This filter requires six vertical bar-delimited fields, of which the following is an example.
GlobalOptionFilter=OSISReferenceLinks|Reference Material Links|Hide or show links to study helps in the Biblical text.|x-glossary||On
Here are the different field meanings:
- "OSISReferenceLinks" = option filter class name (option class name internal to the engine). Always the same for this kind of filter.
- "Reference Material Links" = Visible name of this OSISReferenceLinks filter. This is what the user will see in the Global Options toggle lists.
- "Hide or show..." = A readable user tip explaining what the filter does.
- "x-glossary" = Tells this OSISReferenceLinks filter to filter all references with type="x-glossary".
- (empty) = Tells this OSISReferenceLinks filter to also require that subType="something" in order to filter. Empty means ALL type="x-glossary" references will be filtered regardless of subType.
- "On" = Default filter toggle value ("On" or "Off")
Optional elements to support particular features
CaseInsensitiveKeys
Intended for use with Lexicon/Dictionary & Glossary modules. This field will make the order of the keys based upon the mixed case keys, but the index is still sorted by byte order of those keys. There are some scripts that don’t have upper/lower case (e.g. Arabic) and some languages where a naïve toUpper() will result in the wrong character (e.g. Turkish/Azeri lowercase dotted i and capital dotted İ).
CaseInsensitiveKeys=true|false
It is fine to use toUpper() for internal normalization, but having keys in all caps when showing to a user is annoying. The problem is that the display order needs to follow something that makes sense to a user when the dictionary is presented as a list.
xulsword has a different solution involving a configuration item not yet used by SWORD master.
LangSortOrder=AaBbCcDdEe...
This is used by xulsword to sort the keys of a dictionary/glossary in original alphabetical order. Here's an actual example for module TKLDICT which has Lang=tk-Latn (i.e. Türkmençe):
LangSortOrder=AaBbCcÇçDdEeÄ䯿FfGgHhIiJjKkLlMmNnŇňOoÖöPpQqRrSsŞşTtUuÜüVvWwXxYyÝýZzŽž
This method would need to be modified in order to support alphabets (such as Welsh) that include any digraphs.
StrongsPadding
At the heart of our lexicon/dictionary drivers, we have some old logic which tries to detect if a key value is a Strong's number, and if so, then pad it with leading zeros accordingly. To support this logic, the recognition has recently been added for an optional new .conf entry for lexicon/dictionary modules:
StrongsPadding=true|false
Notes:
- So as not to break everything, this currently defaults to true if it is not present in the lexdict module's .conf file
- It can be set to false if you are building a lexdict module which has entries which may be misconstrued as Strong's numbers.
- In a couple years, we'll probably switch the default to false, so it would be nice to add this line and set the value to true on modules which really do require the logic.
- This is only available in SWORD version 1.7 or later. JSword never had this problem.
Strip Filters
SWORD has the concept of "filtering" a module's text at different processing points for purposes other than rendering.
One of these filter-points is for searching and we call these filters Strip Filters.
Strip Filters are typically named something like OSISPlain or GBFPlain, etc. These typically take all the markup out of an entry and prepare the text to be searched, but anything can be done to the text to prepare it further for searching. We typically remove accents and vowel points from Greek and Hebrew respectively.
Any Strip Filter can be added to a module by the module author with a line in the .conf file, such as:
LocalStripFilter=GBFPlain
If diacritics need to be removed from Arabic, then we can certainly add a filter for this as well. The conf line would be:
LocalStripFilter=UTF8ArabicPoints
Our current list of filters can be found by browsing the source folder here:
http://crosswire.org/svn/sword/trunk/src/modules/filters/
They're pretty concise and don't involve much knowledge from the rest of the engine, making them easy to write if we need a new one.
This processing can replace or be complimentary to any processing done by clucene. Here's an example of what's used with the Duke Databank of Papyri with specialist software that's based on SWORD.
LocalStripFilter=PapyriPlain
Since we need to strip markup, and other things clucene will likely never support (see PapyriPlain – annotations like [,],?{,}, underdot) we need this preprocessing mechanism to prepare the text before searching. We also maintain searching functionality apart from "fast indexed searching".[1]
Note:
- ↑ Currently supplied by clucene, but could be implemented by any other fast search framework that we might want to integrate in future.
General informatic and installer elements
Element | Values (type or enumerated) | Default Value | Allows |
---|---|---|---|
About |
<string> |
Continuation RTF Localization | |
SwordVersionDate |
<yyyy-mm-dd> (ISO 8601 Date) |
||
Version |
<version string> CrossWire's standard practice is to indicate updates that only require a .conf-file update/download by incrementing the third most significant number (the revision number). For example, if module version 1.2 requires a .conf-file update. A new .conf file with version number 1.2.1 could be released. |
1.0 | |
History_x.x | <string>[1] x.x is taken from the Version value. Indicates what has changed between different versions. Each time a version is incremented a history line with that version number should explain the change. It is recommended that each explanation be suffixed by the corresponding SwordVersionDate value. |
Repeats Localization | |
MinimumVersion[2] |
<version string> |
1.5.1a | |
Category |
This is used by installers to further categorize the modules beyond what can be figured out by the ModDrv and Feature. |
Biblical Texts |
|
LCSH | <tree/string> Library of Congress Subject Heading. You may search the Library of Congress catalog or use it as a guide for determining an appropriate LCSH for books that are not in the Library of Congress. |
||
Lang |
<Language[-Script]?[-Region]?> Language sub-tag (Regex: /[a-z]{2,3}/):
The ISO639-3 registrar page gives up-to-date table on all of the above. Script sub-tag (Regex: /[A-Z][a-z]{3}/): Region sub-tag: (Regex: /[A-Z]{2}/) Combinations(Regex: /[a-z]{2,3}(-[A-Z][a-z]{3})?(-[A-Z]{2})?/): |
en | |
InstallSize |
<integer> For modules in the CrossWire repositories, this is automatically generated and overwritten if needed. |
||
Obsoletes |
<ModName> |
Repeats | |
OSISVersion |
<version string> It is recommended that this be present for every OSIS module. |
||
Companion[5] |
<ModName[, ModName]*> |
Note:
- ↑ Maximum line length is 1024 characters!
- ↑ See http://tracker.crosswire.org/browse/API-201
- ↑ Required to support a Bible/Commentary module that has an Alternate Versification.
- ↑ Essays is handled as a subset of Generic Books.
- ↑ Many (xulsword compatible) modules in the IBT Repository make use of this field. See also https://github.com/johnaustindev/osis-converters
Element | Values (type or enumerated) | Default Value | Allows |
---|---|---|---|
Copyright |
<string> |
Continuation Localization | |
CopyrightHolder |
<string> |
Localization | |
CopyrightDate |
<yyyy> (ISO 8601 Year) |
Localization | |
CopyrightNotes |
<string> |
Continuation Localization | |
CopyrightContactName |
<string> |
Continuation Localization | |
CopyrightContactNotes |
<string> |
Continuation Localization | |
CopyrightContactAddress |
<string> |
Continuation Localization | |
CopyrightContactEmail |
<string> |
Localization | |
ShortPromo |
<string> |
HTML Link Localization | |
ShortCopyright |
<string> |
Localization | |
DistributionLicense |
Public Domain |
||
DistributionNotes |
<string> |
Continuation Localization | |
TextSource |
<string> |
Continuation | |
UnlockInfo |
<string> |
RTF Continuation Localization HTML Link |
Note:
- ↑ Modules in other repositories may have a different organization name instead of CrossWire.
Comments and blank lines
Early in 2021, CrossWire began providing comment lines and blank lines in released and updated modules. Here's and example showing that there may sometimes be both even before the module ID.
## Sword module configuration file ## Required elements # Module Unique Identifier. [GerMenge] ...Some other module providers and repositories are already moving in the same direction.
Further details
Continuation
A value can span multiple lines by escaping the return with '\'. This is not a mechanism to make long lines more readable in the module.conf file. It is a means to introducing a break in the rendered output of that field when viewed by a front-end or installer. It is akin to a xHTML <br/>. That is, continuation is a formatting feature.
Most elements in a SWORD conf are expected to have short, one-line values. Elements that are expected to have multiple lines are noted.
RTF
A module.conf supports a very small, restricted subset of RTF markup. Only the following are allowed:
- \qc - for centering
- \par - for paragraph breaks
- \pard - for resetting paragraph attributes, i.e. turning off centering.
- \u{num}? - for unicode characters, where {num} is a signed, 16-bit representation of the code point and where ? is the ASCII character used in case unicode is not supported. If the {num} is less than 0 then add 65536 to it. This should only be used in modules that have an Encoding=UTF-8, but using the actual UTF-8 character is preferred.
The only uniqueness that RTF provides is centering. If centering is not needed, then use continuation lines instead of RTF.
Localization
Those .conf fields that are essentially text intended for presentation to the end-user may be localized by appending _locale to the field name, where locale is replaced by an appropriate locale code, according to BCP 47. See Lang key for details.
For example, to give a French description, you can have a key:
- Description_fr=....
In order to distinguish a regional form from the primary form of a language, e.g. Brazilian Portuguese vs. the Portuguese of Portugal, append the country code as in:
- Description_pt-BR=....
Script variants can be given as in:
- Description_zh-Hans=.... simplified Chinese ....
- Description_zh-Hant=.... traditional Chinese ....
In order for a .conf entry to appear in a localized form, a non-localized form of the same field must also occur within the .conf. For example, in order for a Description_en field to appear, a file must also possess a Description field. The locale of .conf entries without the locale modifier is the default and must reflect the locale/language of the module itself (as specified in Lang=) or English (if there are no localized versions of the field). In general, fields should be provided in the language of the module itself with English translations provided in parallel fields localized with _en. There is no explicit upper bound on the quantity of localized fields, but all localized and localizable fields should be unique.
- See also Localized Language Names.
Uniqueness
For comparing two versions of a module during module development, the module names and locations must be unique. For JSword based front-ends such as Bible Desktop, there is a further requirement, the Description items must be different.
Analysis Tools
- DMSmith and Domcox have created scripts to analyse conf files and report anomalies.
- David Haslam has created a User Defined Language called CONF as a Syntax Highlighter for Notepad++ (Windows). Download from [1].
Automated generation
- For new module submissions to CrossWire, DomCox now maintains a script that includes the automated generation of module conf files, given the minimum non-automatable requirements by the module submitter. See Module Submission for more details.