I have been tracking several statistics that approximately represent the quality of the Wikitree database. I was initially conducting an assessment every 6 months, but find that Wikitree is large enough that the statistics change slowly and am now providing an annual update. Following is a summary as of November 2023:
Overall status: 36.2 M total profiles; 86% are connected; 34% have DNA test connections
Sourcing: about 22% with 3 or more sources, 36% with 1-2 sources, 12% poorly sourced, 22% unsourced, and 8% unavailable
Profiles with known consistency issues: 101,100 (up 4,600 since Nov 2022)
Undated profiles: 418,000 (down 26,000 since Nov 2022)
Duplicate profiles: 3-13% (Dec 2021 estimate)
Compared with Nov 2022, there are 3.9 M more profiles. The estimated fraction of profiles with 1 or more sources is about 58%, up from the 2022 estimate.
A Free Space page is available with graphs, historical data and technical details. But essentially the sourcing review is done by manually checking a random set of profiles and looking at the listed sources.