shared_doc_objects.rb needed: tables code poem group etc. ocn numbers hash numbers #consider a hash of: :metadata or :header or @md? #with has or array of metadata, decide on which, may be possible to use param & @md :substance, :text or :body #array of hash objects :obj #actual substantive text or content of object, :obj preferred to :txt as may contain any of the items described in :type :type #which may be one of: paragraph, heading? code, verse, grouped text, table, image/figure? :ocn #object citation number :lv or :heading? #level, document structure, heading level :name #html segment name and location name :index #contain book index information if provided :hash? #consider, need hash for clean and marked up text, then also for each footnote contained within the object :meta #array or hash? :header and :data, however see suggestion that :header or :meta should be separate from :body or :substance #paragraph operation indent 1-9 #if exists want to know how much to indent by, 1-9 heading #isn't this synonymous with level? #keep bold, italic attributes within text? # :obj container for heading? paragraph grouped text code alt poem / verse table image / figure comment #retain comments? later investigate use of extroprot (instead of or as well as Marshall) consider making easy to represent also as XML consider: additional structure fields to represent document, xml dom, node, sax, pro readily available, can be extended further, con, additional (metaverse) parsing to create them initialy xml or html txt field? pro readily available, con bigger objects, more memory