As formulated on the GOALS page, we think it would help to offer a test suite of data for software programmers.
To show off the features and possibilities of Better GEDCOM, we should write/create separate files for various features. This way, it will be easier to see what the importing application did with the information in the BG-file.
What should be in a BetterGEDCOM Testsuite? (please edit / add your ideas!)
- A "sources and evidence only" BG file.
This should demonstrate how one can use BG to share sources and the directly derived evidence. It should include media files (like JPGs, PDFs, DjVu, TXT, MP3...), the transcriptions of the sources, and the evidence persons and events that one can extract from the sources. The sources should be given varying "surety/quality" assessment. There should be "primary" sources (or more likely copies of such) and "secondary" sources, down to simple notes of a conversation or similar. The sources should be organized in a tree so that a citation can be derived easily.
- A nested BG file
This should show that one can import a GEDCOM or another BG into a BG as a source, and that the imported file will not be changed. This could also show how UUIDs for persons work.
- A "straight from old GEDCOM" file
Showing how an example GEDCOM file will translate into BG if "assimilated" and not used as a source. This could show how a "conclusion only" way of working is still possible with BG.
The GEDCOM should also be provided. It should use all allowed tags and some user-defined tags. It should have all possible variations in structure (e.g. NOTEs inline and as records...)
- A simple demonstration of the handling of evidence-conclusion with reasoning
This would need only a few conclusion persons and events, but a few more (conflicting) evidence persons and events. There should probably be sources included for the evidence, but maybe a few unsourced/un-explained conclusions could be interesting.
- A demonstration of complex families and multi-role events
Showing adoptions, half-siblings, multiple marriages / divorces / re-marriages, non-married couples, missing persons, missing dates, names, PACTs and probably much more.
Other "groups" (ships?) could possibly go here, too.
- A "farm history"
Showing how people, groups, events and dates can be linked with a place
- A "places only" file
Showing how places can be linked (or not), and the way to record places over time.
I believe a really well thought-out test suite would be of enormous value for programmers and would help BG become successful.
I also think that building this suite will be an enormous task!
End-users should try and think of difficult real-life (and imagined) examples and just write them down. Try to "condense" as many difficulties into a few constructed "cases".
Techies would then build these examples into BG-files. The specification doesn't have to be finished for this. Working on examples while formulating the definitions of BG will help with finding and fixing problems.
Test Cases
- The mystery Aunt
- GJ Cases
Discussions
Then just write the whole "case" down in plain text, either on this page or on a new sub-page (but please don't use the discussion pages). You don't have to give all the source data or the true names and places, just as much as is needed to understand the problem(s) and your reasoning on the way to a possible conclusion.
Then the techies (Tom, Louis, Mike, Adrian, Christoffer, someone from GRAMPS,...) can show how this case would end up looking in their file format.
Then we can discuss differences and hopefully see what works best.
1. Cases that follow the research model--these should begin with a focused goal (a question that seeks the answer to a genealogically relevant question; these are questions of identity or relationship) and proceed through the research cycle to a conclusion.
2. Cases that follow the research model, but take very different paths:
(a) Cases where the genealogically relevant question is solved by relying only on the application of direct evidence
(b) Cases where the genealogically relevant question is solved by relying on the application of direct and indirect evidence
(c) Cases that involve where the genealogically relevant question is solved by relying on the application of all forms of evidence--direct, indirect, circumstantial, negative
3. Cases that involve conflicting evidence (of all forms), such that the genealogically relevant question may be solved only by weighing all forms of evidence--direct, indirect, circumstantial, negative.
4. Cases, as above, more specific to materials in increasing or different degrees of complexity:
(a) Materials intended to provide particular information directly, from within the four corners of the document. For example, a birth records is intended to answer specific questions that are genealogically relevant about a birth fact, a death record is intended to answer specific questions about the death fact.
(b) Materials intended to provide particular information, but from within the four corners of the document, that information either does not, or is not intended to answer genealogically relevant questions. For example, estate documents, deeds/real estate transactions; some census. These should range from simple to more complex.
(c) Materials that rely on the identity of the creator, event, annotation and provenience or some other context in order to make some determination about information therein. Of particular interest would be cases that contain multiple references or perspectives on what might or could be the same person or event. These should range from simple to more complex.