Is there a way to map or associate one text string with another?

+2 votes
228 views

Is there a way to map or associate one text string with another (such as a GEDmatch kit number with a WikiTree ID)?

Let’s say you find a triangulated group in GEDmatch consisting of four GEDmatch kit numbers.   If they are in WikiTree then is there a quick way to know their WikiTree IDs so you can run WikiTree’s relationship finder to possibly see their shared ancestral couple?

Let’s say a DNA confirmation citation for autosomal DNA in WikiTree can’t use GEDmatch IDs (kit numbers) because GEDmatch’s use policy does not let you publish someone else’s GEDmatch kit number. Is there a quick way to know the corresponding GEDmatch kit numbers for the WikiTree IDs in that citation?

Here is what Perplexity AI suggests: This typically involves creating a lookup table or database that pairs the original string with the alternative string. To achieve this:

1. **Create a Database**: You can set up a database where one column contains the original text strings (e.g., user IDs), and another column contains their corresponding alternative strings (e.g., names).

2. Use **Scripting or Coding**: If you’re dealing with web pages, you can use scripting or coding to search for the original text string and then look it up in your database to display the alternative string. This could be done with JavaScript or other programming languages.

3. **Browser Extensions**: There might be browser extensions or add-ons that can assist with this. They can be customized to recognize specific text strings and display associated information.

4. **Content Management Systems (CMS)**: If you have control over the web pages, CMS platforms often allow for custom fields where you can associate data.

5. **Third-party Software**: Some third-party software solutions exist for data extraction and transformation. Tools like Beautiful Soup for Python can help extract and manipulate data on web pages.

in WikiTree Tech by Peter Roberts G2G6 Pilot (717k points)
edited by Peter Roberts

Regarding number 3, Perplexity AI found (but I’m unsure it is a good solution): Regex Search is a Chrome extension that allows you to search for regular expressions on web pages.

Regular expressions are patterns used to match character combinations in strings.

With regular expressions, you can search for specific patterns, as opposed to certain terms and phrases. You can customize Regex Search to recognize specific text strings and display associated information.

Regular expressions use metacharacters in conjunction with a search engine to retrieve specific patterns. Metacharacters are the building blocks of regular expressions. For example, “\d” in a regular expression is a metacharacter that represents a digit character. “d” stands for the literal character, “d.” You can use regular expressions to search for social security numbers, patent numbers, URLs, email addresses, Bates numbers, and other strings that follow a specific pattern.

There are several implementations of regular expressions. The differences in implementations usually include the way special characters are handled and how character classes are treated.

It should be possible to add an API for getting a wikitree if from a GEDMatch kit #, then it would be possible to add the wanted functionality in a browser extension.

yes  Many thanks John!

Peter, can you clarify your starting point?  What are you searching that already contains the GedMatch ID and WikiTree ID?

Is your example the case where the other match is on WikiTree and has their GedMatch code already attached to their profile?

If so, then it would be down to the nature of how the GedMatch code is stored within the Wikitree DB.

Ideally you would want a column that has only the GedMatch ID and then you could index that column for fast lookup. If the GedMatch ID is buried in a larger text field then it is far more expensive to search. You wouldn't need a regex for that, because you presumably already know the exact  GedMatch ID.
Hello Cameron,

The following WikiTree IDs share an ancestor who could have contributed to their X chromosome.  Do they also share an X chromosome segment? To easily answer that, there needs to be a quick way to know their GEDmatch IDs:

Higgs-1000

Lowe-2022

Lowe-10520

Sawyer-1294

Lowe-2382

You can see their X-DNA relationship if you go to

https://www.wikitree.com/index.php?title=Special:Relationship&action=calculate&person1Name=Higgs-1000&person2Name=Lowe-2022

and then enter the following in the "Advanced filter" field: Lowe-10520,Sawyer-1294,Lowe-2382

Then click the FILTER button and then select Martha Unknown as the common ancestor.  The others in that drop down list can't be the X chromosome ancestor because their relationship trail includes a father and son.

WikiTree's relationship finder has the names of the WikiTree DNA test takers.  Hovering a name creates a pop up window with their WikiTree ID.  One improvement would be if hovering over their name also included their GEDmatch ID with a scissor icon & ID (copy option).
The above may not answer your question well enough.  I'll work on a different answer.

In this example I am in GEDmatch: "Let’s say you find a triangulated group in GEDmatch consisting of four GEDmatch kit numbers.   If they are in WikiTree then is there a quick way to know their WikiTree IDs so you can run WikiTree’s relationship finder to possibly see their shared ancestral couple?"

GEDmatch has a round about way to see which GEDmatch IDs are in WikiTree and discover their WikiTree IDs.  See 

https://m.youtube.com/watch?v=1NgD9b3s0Gs

I wish there was a way I could tell my Web browser that if it encounters the text string (i.e. GEDmatch ID) LL4694669 then display Higgs-1000 and give me the option to copy that WikiTree ID.

In this example I'm in WikiTree:  "Let’s say a DNA confirmation citation for autosomal DNA in WikiTree can’t use GEDmatch IDs (kit numbers) because GEDmatch’s use policy does not let you publish someone else’s GEDmatch kit number. Is there a quick way to know the corresponding GEDmatch kit numbers for the WikiTree IDs in that citation?"

This example is the case where the other match is on WikiTree and has their GEDmatch ID already attached to their profile.

I believe one solution is if hovering over a name in the citation created a pop up window with their WikiTree ID and GEDmatch ID and each had a scissor icon & ID (copy option).

I now see that hovering over a name and getting a pop up window is called previews.

It would be much easier to associate GEDmatch IDs (i.e. kit #'s) with WikiTree IDs if previews also included GEDmatch IDs with scissor icon.

Also having a scissor icon next to a GEDmatch ID (where ever one appears in WikiTree) would make it much easier to make use of GEDmatch's tools.

1 Answer

+4 votes

Hello Peter!

Regular expressions (regexes) are a time honored tool for text processing. See [ https://en.wikipedia.org/wiki/Regular_expression ]. That article’s “Uses” section has this to say, “While regexes would be useful on Internet search engines, processing them across the entire database could consume excessive computer resources depending on the complexity and design of the regex. Although in many cases system administrators can run regex-based queries internally, most search engines do not offer regex support to the public. Notable exceptions include Google Code Search and Exalead. However, Google Code Search was shut down in January 2012.”

Another long-time text processing tool is “grep”. See [ https://en.wikipedia.org/wiki/Grep ].

One of your questions asked, “Let’s say you find a triangulated group in GEDmatch consisting of four GEDmatch kit numbers. If they are in WikiTree then is there a quick way to know their WikiTree IDs so you can run WikiTree’s relationship finder to possibly see their shared ancestral couple?” Perhaps the easiest quick way is to use a prefix in your search bar, “site:wikitree.com ”, with the identifier you are currently looking for. As an analogy, you would search the following to find your own WikiTree profile:
site:wikitree.com Roberts-7085

 

by A. Murphy G2G6 Mach 2 (22.8k points)
Or the other way around: I entered my gedMatch ID and "site:wikitree.com" and my profile was top of the list.

Related questions

+10 votes
2 answers
246 views asked Feb 25, 2023 in WikiTree Tech by Frances Piercy-Reins G2G6 Mach 9 (90.1k points)
+12 votes
1 answer
+23 votes
6 answers
505 views asked May 23, 2022 in The Tree House by Steven Harris G2G6 Pilot (760k points)
+11 votes
2 answers
323 views asked Apr 20, 2022 in The Tree House by Chase Ashley G2G6 Pilot (316k points)
+15 votes
2 answers
315 views asked Apr 23, 2014 in WikiTree Tech by Peter Roberts G2G6 Pilot (717k points)

WikiTree  ~  About  ~  Help Help  ~  Search Person Search  ~  Surname:

disclaimer - terms - copyright

...