The Open Scripures Hebrew Bible (OSHB) is a project to analyze the Hebrew Bible
by lemma and morphology. The project is marked up in OSIS XML and currently
contains lemma attributes for most words (using an augmentation of Strong’s
numbers). We are in the process of adding morphology attributes as well. These
files are found in the
Lemma and morphology data are licensed under a Creative Commons Attribution 4.0 International license. For attribution purposes, credit the Open Scriptures Hebrew Bible Project. The text of the WLC remains in the Public Domain. See the LICENSE file for more information.
HomeFiles, along with
index.html provide a simple home page for the project,
see The OpenScriptures Hebrew Bible
MAPM contains our work with the
Miqra according to the Mesorah
These were done in cooperation with Rabbi Kadish, for comparison with the OSHB.
The MapM has since progressed, so our files may not be up to date.
OSHB Graphics contains the OSHB logo graphics, for crediting or linking to
our resources. See the
readme in that directory for more details.
structure contains a demo of OSHB Verse, for displaying verses of the Hebrew
Bible divided by the cantillation hierarchy. This is a simple implementation
for using the
n attributes that have been added, for recording and editing
the cantillation divisions.
The SBL Hebrew User Manual has a section entitled, The normalisation issue, pp. 8 ff.
Normalisation is a process by which sequences of characters in text that
can be variously encoded but are semantically identical are treated as identically encoded. (p. 8)
Because of the warnings in that manual, along with my experiences dealing with the MapM text from WikiSource, any uses of the OSHB should avoid NFC normalization.
Updated: January 27, 2017
There is a perl script which generates a JSON version of the morphology which is published to npm here: https://www.npmjs.com/package/morphhb
The perl script which generates this is called morphhbXML-to-JSON.pl. It has several options:
You can run this script like so:
perl morphhbXML-to-JSON.pl --stripPointing --removeLemmaTypes --stripHFromMorph --prefixLemmasWithH --remapVerses