The merging after import is the best GRAMPS offers I think.
But indeed slow, and you can overlook things, and when in families, the
family is only merged when both parents are merged.
Perhaps some bugs in the code too...
If you really don't want to enter the information, and don't want to merge
too much, you need to know if there is some kind of unique identifier in the
GEDCOM to quickly retrieve identical info.
You could then merge manually with xml files, import every gedcom that is a
family in a seperate grdb, export to xml, and open the xml in a text editor
after unzipping. The xml files of the family should be small. Copying them
manually in the xml file of your large database might be feasible. That is,
in the family xml you change the handle to the handle of the persons already
in the database via the replace command. then copy all data not yet in the
larger xml. You could keep a spreadsheet with the mapping between GRAMPS
handle and the unique identifier (if present in the GEDCOM)
Error prone, but if you know how GRAMPS xml works, quite doable. All depends
how large the files are you download.
Another option is to first import all small GEDCOMS in a new empty database,
clean that as much as possible, then import in your real database when you
are satisfied. There is also a CSV import plugin written by Doug but not
part of GRAMPS (see our website) which might help somewhat (wouldn't know,
Import with merge is something many people already broke their heads on. It
is very difficult to do. Some programs offer a unique identifier (UID) for
this task, but even then, it is hard to know on differing data what to do.
GRAMPS has chosen not to do things automatically, but depend on the user to
start the merging and check the data.
2007/10/29, Jeremy C. Reed <reed@...>:
> So I found some of my wife's family at familysearch.org. But they only
> allow downloading one family at a time.
> So now I have a mess of triplicate records. Let me explain:
> 1) Person A and family lists parents (B)
> 2) Person B and family lists child A and lists B's parents (C) and
> 3) Person C and family lists child B and lists C's parents (D) and
> So 3 GEDCOM downloads. And now I have person A twice, person B three
> times, and person C twice. Then I have spouses and keeping following
> So I keep merging as I grab new records. Very time consuming.
> Then I start seeing in Gramps, triplicate identical birth events,
> and other identical events.
> And even worse, I see a husband with the same wife three times and with
> three sets of children (of course with identical names and identical data
> who are all merged together). And when I look at a person's siblings, I
> see that have duplicated brothers and sisters.
> So now I have hundreds of names and thousands of events. I downloaded
> probably about 50 GEDCOM files (50 families). And it is an unusable mess.
> I am going to start over from scratch.
> I'd prefer to not manually enter in the data.
> I asked their support and was told that they don't support multiple
> families in one download and it was suggested I try proprietary commercial
> PAF Insight.
> Do you have any suggestions on how I can put that data (one family GED
> file at a time) into my gramps database cleanly (no duplicated details)?
> I did see a perl-based crawler for grabbing data from familysearch.org but
> haven't tried it yet.
> Or do you have any suggestions on how I can merge this triplicate
> information without having the merged records containing triplicate
> events, duplicated siblings, etc.?
> I am using gramps as provided by Ubuntu -- version 2.2.6-1ubuntu1.
> Jeremy C. Reed
> p.s. Has anyone else seen this?
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems? Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> Gramps-users mailing list