This post is the most compelling assessment I've seen of the situation to date. Thank you for writing this up and sharing your analysis.
I've seen the full arguments from both sides of the Rootclaim debate, and regardless of people's opinions on how the debate played out with information available at the time, it seems undeniable that studies and key information have emerged since the debate which seem to overwhelmingly conclude that the wet market has extremely low odds of being the origin of the virus, as pointed out in your analysis, which would dramatically downward shift the probability assigned to the most important point in support of Zoonosis. Many commenters don't seem to be aware of how recent some of the key points of evidence are (some as recent as March 2024) which either were not available or may not have been fully understood at the time of the debate.
For those interested, here's a brief list of some recent information that updated me towards lab leak. This is for the sake of explaining my thoughts to others, but is in no way all-encompassing. Michael does a far superior job of explaining these in great depth.
- Study published March 5th, 2024 finding intermediate sequences between Lineage A and B (https://doi.org/10.1093/ve/veae020). This research shows that Lineage B very likely came from Lineage A. All cases in the market were Lineage B, but none were Lineage A. In short, the research shows that a single spillover is much more likely than a double-spillover Zoonotic event. The double-spillover theory is a foundational argument of the ZW theory that Peter Miller and others use. This is a massive blow to the probability that the wet market was the origin of the virus, to the point where it now seems extremely *unlikely* that the wet market was the origin.
- Wildlife trade in Wuhan is significantly less than Wuhan's percentage of the population, which significantly changes the probabilities downwards of a ZW origin in the bayesian calculations that Peter Miller and others use.
- Although the DEFUSE proposal leaked in 2021, more recent drafts were discovered in 2024 which contain what appears to be damning evidence. New information included their approach using restriction enzymes (BsaI/BsmBI) that ultimately matched precisely with what Bruttel et al. (2022) found as the assembly process that would create exactly this virus, years before this DEFUSE draft leak was even public. Michael describes the degree of how unlikely this would be if the origin was Zoonotic. The DEFUSE budget leak confirms that they were purchasing these enzymes. Additionally, the new documents contained draft comments that were not available in the original leaked proposal. Among many other things, the comments show that the research work was actually planned to be done at the WIV at BSL-2 levels for cost reduction, but they edited the final document to "BSL-3" because they thought "US researchers will likely freak out" if they knew this research was being done in lower safety BSL-2 labs. The researchers seemed to think the distinction didn't matter for their research and that it was bureaucratic tape slowing them down, so they fudged the proposal to hide this. Considering BSL-2 labs are not sufficiently designed to contain airborne disease (whereas BSL-3 labs are), this does not seem to be an insignificant point in this whole debate.
Peter Miller claimed that because the DEFUSE proposal was rejected meant that the proposed work never happened (or at least had an exceedingly low likelihood of occurring), which led to the proposal essentially being dismissed as evidence, at least when it came to evaluating the arguments and probabilities. However, it is unequivocally the case that many research scientists conduct research long before they apply for the grant for that research. It is also often the case that they apply for the grant while in the middle of conducting research. This is confirmed by countless research scientists online and it's just how research often is done due to the difficulty and delays of receiving funding. Peter's conclusion that the DEFUSE research did not happen because it was rejected is so at odds with how scientists conduct research that at best I would consider Peter ignorant of this point, and at worst it would seem Peter intentionally manipulated this point and tried to get the DEFUSE proposal dismissed as evidence because it would dramatically undermine his argument and potentially change the entire outcome of the debate.
There's more, but I'll wrap this up for now. My key takeaway is that Michael's approach and evidence seem more updated, complete, and compelling than anything else available online on the topic to date. It seems especially relevant and a better analysis than either sides of the Rootclaim debate, and I believe more people should be aware of this post and read it in full before making conclusions.
I think that the HSM market story has taken serious hits, negating the importance of the badly flawed Worobey/Pekar papers. If you'd asked 5 years ago, people warning of zoonosis wouldn't have put an especially high fraction of their priors on market spillovers, since there are also farms, transport, etc. So I think the big factors assigned by some to HSM are nonsense but removing them doesn't do much to reduce the overall ZW priors.
The biggest change has been from finding the DEFUSE restriction enzyme plans. Before, most people (including me) had not considered that pattern as usable evidence because it was unclear how much ex post facto choice of what to look for was involved, i.e. the old multiple comparisons problem. After Emily Kopp published the DEFUSE plans that were stunningly close to the pattern Bruttel et al. had noticed, that factor became worth taking very seriously.
I have one major suggestion, which is more discussion of how to interpret the numbers coming out. In particular, I was bothered by this analysis for a long time because the conclusion seemed too certain given the number of missing pieces. Intuitively, it seems like there should be some factor in there which has no information about the conclusion but vastly increases the uncertainty. Some or possibly all of this comes from me wanting to interpret the final numbers as representing degrees of certainty instead of subjective probabilities, but I doubt I'm the only one with that confusion.
I think this could be discussed in a few places - in the introduction, at the end when combining priors with evidence, and once at the point where you discuss the vast gap between RaTG13 and SC2. Any origin theory requires some sequences in this gap, and finding them could decide the case either way depending on whether they show up in wildlife or in a lab notebook. Maybe there's a way to evaluate counterfactual evidence formally but just discussing the significance of the missing evidence would help. For everybody's sanity there should be a clear distinction between "100:1 odds it was a lab leak" and "available evidence favors lab leak by 100:1".
There is a second small issue relating to the Bruttel et al paper and the seeming confirmation from the DEFUSE draft. I think it was Alina Chan who pointed out that the BsaI/BsmBI construction showed up in an earlier Baric paper, and that Bruttel et al probably read that paper. I didn't see any followup on that. Your wording seems to suggest that Bruttel et al predicted the choice of enzymes from looking at the genome, but that may be too strong. You didn't use it as evidence but it might be worth tweaking the text.
And a really small point: I would take the part about "quantifying friggin likely' out of the title. Most people won't get the joke, and you are so consistent about taking the high road in the rest of the document.
I can't thank you enough for this work: I really hope that you keep on this. Having it in a decent journal would be a huge step forward.
I'm sluggishly starting (with some possible coauthors) to convert this to peer-review-ready format. There will be no "friggin'" in the title. Meanwhile, I already gave the P.O. authors all the good lines because they spoke in such clear ordinary English.
I guess I should mention the preceding Baric paper. It is relevant to P(Bruttell guessing the right enzymes|ZW), but that doesn't enter directly in my calculations. To the extent that it affects P(pattern|LL), which does enter, it increases it and raises the odds. I'd thought about using it but was lazy and at some point the odds get extreme.
That brings me to the big philosophical point. I do discuss it in a narrow sense at the specific point about the sequence gap. "Although fully knowing what sequences were in Wuhan labs would be almost equivalent to answering the origins question, our current estimate of what’s there would mostly just be based on the other evidence leaning toward LL, ZL, or ZW, augmented a bit by a highly subjective sense of how forthright people are likely to be. We don’t want to either double-count our other evidence or introduce especially subjective terms."
I think this "what about missing evidence?" problem comes up in every Bayesian estimate. I've worried about it a lot, as have others. It sort of nags at the back of the mind. One can usually imagine various types of evidence that would, if available, outweigh everything that we have our hands on. If you tried to change odds based on pure ignorance of the non-evidence, all odds would approach 1/1. That would not only be wrong, it would be logically inconsistent. E.g. it would give odds (A or B) vs. C = 1 if A and B are lumped, but 2/1 if viewed as separate outcomes.
So how to deal with the sense that somehow the odds based on what we know can't be as extreme as what they seem to be? For people calling election results or horse races, they have enough cases to allow calibration. Do the ones they call 2/1 give about 2/1 in results? Etc. For pandemics, we fortunately don't have enough to calibrate.
If I were reading this blog and not writing it, I'd just take it in attenuated form. Once people start seeing things one way, whether by priors or by the first evidence they see, it starts to get hard to see other sides. So I think a reasonable reader who doesn't want to spend a huge amount of time tracking down individual factors would tend to discount the odds beyond the hierarchical and robust discounts I've used. But I can't think of a logically coherent way of saying that the odds aren't what they are because someday we may know them better.
Thanks for the explanation - discounting the odds was my first approach, and when I re-read the document looking specifically for this question I did see that you gestured towards it in the section about the sequence gap. Maybe your pedagogy is working as intended.
I guess what I'm thinking of is to estimate odds for a counterfactual like "nearby ancestral genome including FCS shows up in a wildlife sample around Wuhan" which most people would consider dispositive, and see if the method agrees. This seems like it would be a useful sanity check if it doesn't taint the analysis.
Happily, for a peer-reviewed version you can assume more expert readers.
I did sort of quickly run through a negative control when I saw someone claim that a paper using very different (non-Bayesian) methods might have indicated a possible lab origin of MERS. I think the method here comes down heavily for MERS being natural.
If a close ancestor with an FCS etc had showed up anywhere, even Yunnan, then I wouldn't even bother looking at LL. Given the possible routes to Wuhan, ZL would still be very much in the running.
I'm reading the part about the furine-cleavage site now. Of course there is the story about the sodiumchannel ENaC, but there are two other hypothesis. There is the possibility that the furine-cleavage site in the Feline Enteric Coronovirus inspired researchers, although I'm not very convinced of this possibility. But it is certain from Baric's testimony for the Subcommittee that it were these viruses that made him think about the function of the FCS. So it's not that strange to think that the Chinese also had knowledge about these viruses too.
I think this one is much more convincing. https://zenodo.org/records/10203207. This study describes a Nuclear Localisation Sequence PRRARSV that is in SARS-CoV-2. No other coronavirus in the family of bètacoronaviruses has this sequence: only a MERS-clone after thirty cycles of serial passaging has the same sequence. The study comes from a group of virologists of the University of Iowa, but the first author of the study did his PhD in virology in the Wuhan Institute of Virology from 2005 to 2010.
This would be an explanation for choosing for the 'suboptimal' FCS sequence. If they had chosen for the optimal sequence RARR, the sequence would not function as an NLS. Maybe this detail could be of help in determining the odds that this virus was genetically engineered.
I removed what seemed to have been a duplicate of this Comment.
I agree that the FIPV FCS and the MERS FCS are highly relevant to what WIV and its collaborators might have chosen, especially since zoonati have claimed that the proline would not have been used by a lab. I've been hesitant to add even more details to an over-long blog, but maybe could put something in the relevant Appendix.
That's perfectly fine (about the duplicate). I might have hit the post button twice. It indeed is a very long blog, but very, very useful. I read most of the studies that you referenced, but still there are some things that I didn't notice, after two years of reading and writing. Just wanted to let you know about the MERS-clone, because that's the only possibility that is somewhat watertight, especially about the proline in the lead, of which Baric stated that he never would have chosen this variant.
"These convey information complementary to the centroids because they emphasize the most clustered points rather than the more distance ones. In their supplementary material there is also a KDE map for the linked cases."
How curious... the supplement is very, very long, with many illustrations of the proces how to localise the early cases. But just this one figure, just the one figure that could cast doubt on their hypothesis, that figure is missing. I am certainly willing to look objectively at any study - although there is particularly little left of the study by Worobey et al after the extensive criticisms of several scientists - but this is quite extraordinary.
It is a shame, though, that this constantly updated Bayesian analysis has not received more attention, except from insiders. I understand that it is a difficult piece for people who do not know Bayes' Theorem. It was not an easy piece for me either, and I only now really understand where its principle is, whereas years ago I thought I understood it. And that too only because I got help with it from a professor of probability calculus. And yet this should be published in a mainstream or popular science journal somewhere.
A final question is whether the scenario of Line 2 by Steven Quay would add anything. This is because it can actually explain a lot of aspects very well, without getting into all sorts of twists and turns, especially including the early spread outside China of early variants of the virus. The only problem I have is that even this scenario need not directly point to a lab escape. A zoonotic disease could also spread in this way. The only thing then is that Line 2 also has the WIV and a military hospital, where apparently the first four samples were collected.
On publication, I checked informally with Ann of Appl Stats about preparing a shorter more formal version. An editor said, accurately, it wasn't quite suitable because they like to publish innovative techniques and mine were conventional. If somebody comes up with a likely journal, I may rouse enough energy (or find an energetic coauthor) to make that tighter version. It would remove introductory apologetics, the conventional Bayes intro, remarks in evolution of the versions, ... Also convert the refs to standard form rather than just links.
I've never been much for writing long papers, and dealing with some health issues reinforces that tendency.
Thanks for that link. First impressions: The 4 PLA-hospital patients with records starting on Dec. 10, 2019 are really interesting. On first reading, I'm dubious about the statistics on the Line-2 associated cases. Rejecting a toy null is a bad procedure regardless of who does it. Also, the connection of the international cases with Line 2 is kind of silly. You'd get a similar international pattern regardless of how the infected people got to the airport.
That is exactly my doubt. It is a very attractive hypothesis to explain how the virus spread across Wuhan - see also the New York analysis - but it is no more than that. A zoonotic disease would spread in exactly the same way, with then the fact that the PLA's hospital and the WIV on the Line 2 route would increase the likelihood of it being a lableak. On the other hand, it does not argue against the market hypothesis, because it too could spread through Line 2. So I'm not sure how to assess this piece: probably only as a possible hypothesis.
The link to the airport is in the fact that Line 2 is the only subway line that ends there. But that indeed does not rule out the possibility that the virus could have gotten there by other means. It is just a very fast and direct route, from a situation that is very ideal to spread a virus: the subway.
Thanks. Surely this is very special. Omitting the KDE from the cases that have a direct relationship with the Huanan market. Not including in the actual article nor in the supplement. Just don't bring it up and don't mention it. Surely this suggests that the authors had a particular goal with the publication and deliberately just ignored the details that did not fit with that. Because not only is the KDE quite different from that of all cases and those with no relation to the market, but it also does not center on the market. I cannot escape the impression that the authors did not have pure intentions here.
If you look closely at the KDEs in their paper, you see the "all" noticeably more spread out than the "unlinked", but the effect isn't all that striking because there are only a few "linked" included in "all". I made a .ppt for a talk with those maps superimposed to make it clear, but then found Walker's version that separates out the "linked" rather than hiding them in the "all".
"Wu’s data also show that over time SC2 variants drifted away from the initial pattern by picking up extra sites, as one would expect if the initial form was not a product of random evolution. Unlike other sequence features, there’s no non-random fitness associated with this pattern under ZW."
This seems like a very good argument by Wu, but of course it is not. The "drift" of a reverse-genetic system would occur with every possible clone if it were so massively distributed around the world. Because even if it were a reverse-genetic system, it is still an RNA virus, which mutates regularly. That's really a very different situation than using a reverse-genetics system in the laboratory. Even when creating a clone, the genome sequence is checked afterwards to make sure no significant mutations occurred during cloning. And Wu himself has been trained in the WIV, and knows full well that this is so. But this is really a complete fallacy.
I'm not sure I follow your point. What I'm saying (contra Wu) is that the drift toward a more typical RE site pattern is just what you'd expect if the initial pattern had been selected. Since there's no non-random natural selection for the pattern, that supports the existence of a lab selection. I can't really reproduce whatever Wu's reasoning was supposed to be. I think you agree but am not sure if you're adding another point.
A few other remarks: there is even stronger evidence, whereby the genome was not only found by PCR, but even with sequencing.
Amendola A, Bianchi S, Gori M, et al. Evidence of SARS-CoV-2 RNA in an Oropharyngeal Swab Specimen, Milan, Italy, Early December 2019. Emerg Infect Dis. 2021;27(2):648-650. doi:10.3201/eid2702.204632.
This one is about a child, four years old, that developed a cough and a runny nose on 21 november 2019. Also in Italy. PCR and sequencing confirmed SARS-CoV-2.
Amendola A, Canuti M, Bianchi S, et al. Molecular evidence for SARS-CoV-2 in samples collected from patients with morbilliform eruptions since late 2019 in Lombardy, northern Italy. Environ Res. 2022;215(Pt 1):113979. doi:10.1016/j.envres.2022.113979.
The second one is a study from a reference laboratory for measles and Rubella. The possible rash of Covid-19 is very much alike this infections. First positive sample is a four months old baby. Later it tested positive for IgG and IgM. These could have been false positives, but the complete picture tells us that this is very probably a real case."
About the sequences that were gathered after 2016, this study from 2022 tells us more about this.
Wu Z, Han Y, Wang Y, et al. A comprehensive survey of bat sarbecoviruses across China in relation to the origins of SARS-CoV and SARS-CoV-2. Natl Sci Rev. 2022;10(6):nwac213. Published 2022 Oct 11. doi:10.1093/nsr/nwac213
But the problem is whether this study can be trusted. Holmes as well as the english virologist Alice Hughes - who did a lot of fieldwork in China - say it is highly unlikely that they didn't find any other SARS-CoV-2 related coronavirus, not even in the coppermine in Moijang. And we know that the Chinese from early 2020 strongly censored all scientific publications on SARS-CoV-2 and Covid-19.
"Although the possibility of artifacts cannot be excluded, if La Rosa et al’s found wastewater evidence that SC2 started to show up substantially in northern Italy by December 18th 2019 and Fongaro et al. found similar evidence for November 27 in Brazil. If either of these seemingly careful reports is correct the spillover would have had to have been more in the range estimated by most of the papers rather than the late estimate of the 2022 Pekar paper'
Forget my last remark. Now I see how you got to the number of 19. These are all zoonotic diseases, although some of them are well known for ages. I'm not sure how this would impact your calculations, but I think it lowers the odds for the ZW, because the period is very long.
I checked the reference and the table. It indeed lists 19 pathogens, but they are not all new emerging diseases. The text remarks that roughly half of them are. The other ones are pathogens that we alreadly knew, and for which it is not clear that they are new emerging pathogens. For instance, tuberculosis has been known for ages? That would change the odds, wouldn't it?
"Since then, the number of notifiable diseases has increased from 15 to 41 (in 2011), with emerging zoonoses accounting for over 50% (Table 1 )"
"Before looking numerically at the LL probabilities, let’s look at the competing ZW background. A tabulation from 2014 of important pathogens emerging in China since the 1950’s lists 19 different ones, including one sarbecovirus—SC1, the original SARS. From that one can roughly estimate that the probability of a significant new pathogen in any year, e.g. 2019, would be
Yes. I bent over backwards to give a big ZW prior, part of being conservative. I should have been more explicit about that point.
But in the end it doesn't matter because the one SARS becomes a higher fraction. It basically comes down to one SARS coronavirus in several decades of comparable risk.
I'll try to patch up that explanation later today or tomorrow. But any effect on the priors will be cancelled by P(sarbecovirus|ZW) giving the same net odds.
I'd just been lazy about writing that up and appreciate your careful reading!
Tiny error I saw when reading again. I thought that in the earlier versions there was a literature list with the other Bayes analyses that were done, or am I mistaken?
Thanks... As your analyses is the most comprehensive and the most recent updated, I'm going to use him for the final part of my book - a form of final conclusion, as it were, how to integrate all the evidence and all the events. That will still require some study, I'm afraid, but I think it will be a nice conclusion.
"I’m definitely not endorsing the cruel opposition to strenuous public health measures that seems to have become associated with skepticism about the zoonotic account."
Well, when we talk of scientific proof, it would be wise to look at what the evidence was and is for the efficacy of all these stringent public health measures, and look at the possibility that the cure was far worse than the disease. There are some highly competent people like Jay Bhattacharya and Martin Kulldorff who warned that this would be the case. They are very experienced Public Health experts. And I think that Sweden has proved that it was so.
Apparently you're not allowed to speak out about these matters, without being called 'cruel'. Disappointing. Very disappointing. Doesn't make your story more credible, to say the least.
have you read Bhattacharya's early work in this area? E.g. the Santa Clara infection fatality rate paper? Failure to use standard confidence intervals for false positives gave an error of up to a factor of infinity in the estimate. Improper recruitment methods (not acknowledged in the paper) heavily slanted the results. Etc. So they came up with "we estimate a local infection fatality rate of 0.17%.". The *population* fatality rate in the US is more than twice that.
So I absolutely stand by the position that this is a serious disease. That's now personally reinforced by some fairly bad sequelae to a mild case. Air filtering and more use of masks are fully justified.
So Bhattacharya and I disagree intensely on a crucial issue (pubic health measures) but agree on another (origins). I don't think that judging arguments by whether someone is on your team's side or not is a good way of getting at the truth.
There is a steep gradient in the IFR in terms of age, noting that the IFR for children and adolescents is lower than for influenza. I have never heard mathematicians and physicists complain about that. Old people die, I am sorry for you, and they have been doing so for a very long time. I can know, because I have accompanied many neurological patients who were old and tired of their days to a comfortable death. And really, the IFR for SARS-CoV-2 is up to twice as high as for influenza. And yes, I have seen the necessary people die from influenza, including young people, as well as children.
Also, in terms of facemasks, you are completely uninformed. Every clinician has long known that the protection of a surgical mask is virtually zero with aerogenically transmitted viruses. We do it in the hospital as an "incantation ritual," but almost every physician knows it is utter nonsense. That was the consensus before 2020, and has been demonstrated in many studies.
Two of the best people in evidence-based medicine, Carl Heneghan and Tom Jeffersen, conducted a systematic review of the effect of facemasks. I know there has been a lot of criticism of that, but that what mainly for political purposes, because they did nothing different than they did before in writing their reviews. But somewhere politicians and policy makers had decided that facemasks must be effective, and so this did not come in handy.
I find it very striking that the hard betas in particular were so uncritical of the measures taken. I'm not even talking about the environmental impact of a measure that was based on nothing more than dressing down the public, and making critics immediately visible and scapegoating them. This had nothing, but nothing to do with public health. This had everything to do with indoctrination and enforcing certain behavior.
Then again. What does this whole discussion have to do with the origin of SARS-CoV-2? What is the point of a derogatory comment about people who are very justified, and very well reasoned criticisms of the measures taken? Which - when it comes to preventing mortality - have also had a very marginal effect.
Such a comment has no place in this piece at all. With a swing of the pen, very experienced and well-informed doctors and scientists are dismissed here as cruel. Really, I find it outrageous.
This post is the most compelling assessment I've seen of the situation to date. Thank you for writing this up and sharing your analysis.
I've seen the full arguments from both sides of the Rootclaim debate, and regardless of people's opinions on how the debate played out with information available at the time, it seems undeniable that studies and key information have emerged since the debate which seem to overwhelmingly conclude that the wet market has extremely low odds of being the origin of the virus, as pointed out in your analysis, which would dramatically downward shift the probability assigned to the most important point in support of Zoonosis. Many commenters don't seem to be aware of how recent some of the key points of evidence are (some as recent as March 2024) which either were not available or may not have been fully understood at the time of the debate.
For those interested, here's a brief list of some recent information that updated me towards lab leak. This is for the sake of explaining my thoughts to others, but is in no way all-encompassing. Michael does a far superior job of explaining these in great depth.
- Study published March 5th, 2024 finding intermediate sequences between Lineage A and B (https://doi.org/10.1093/ve/veae020). This research shows that Lineage B very likely came from Lineage A. All cases in the market were Lineage B, but none were Lineage A. In short, the research shows that a single spillover is much more likely than a double-spillover Zoonotic event. The double-spillover theory is a foundational argument of the ZW theory that Peter Miller and others use. This is a massive blow to the probability that the wet market was the origin of the virus, to the point where it now seems extremely *unlikely* that the wet market was the origin.
- Wildlife trade in Wuhan is significantly less than Wuhan's percentage of the population, which significantly changes the probabilities downwards of a ZW origin in the bayesian calculations that Peter Miller and others use.
- Although the DEFUSE proposal leaked in 2021, more recent drafts were discovered in 2024 which contain what appears to be damning evidence. New information included their approach using restriction enzymes (BsaI/BsmBI) that ultimately matched precisely with what Bruttel et al. (2022) found as the assembly process that would create exactly this virus, years before this DEFUSE draft leak was even public. Michael describes the degree of how unlikely this would be if the origin was Zoonotic. The DEFUSE budget leak confirms that they were purchasing these enzymes. Additionally, the new documents contained draft comments that were not available in the original leaked proposal. Among many other things, the comments show that the research work was actually planned to be done at the WIV at BSL-2 levels for cost reduction, but they edited the final document to "BSL-3" because they thought "US researchers will likely freak out" if they knew this research was being done in lower safety BSL-2 labs. The researchers seemed to think the distinction didn't matter for their research and that it was bureaucratic tape slowing them down, so they fudged the proposal to hide this. Considering BSL-2 labs are not sufficiently designed to contain airborne disease (whereas BSL-3 labs are), this does not seem to be an insignificant point in this whole debate.
Peter Miller claimed that because the DEFUSE proposal was rejected meant that the proposed work never happened (or at least had an exceedingly low likelihood of occurring), which led to the proposal essentially being dismissed as evidence, at least when it came to evaluating the arguments and probabilities. However, it is unequivocally the case that many research scientists conduct research long before they apply for the grant for that research. It is also often the case that they apply for the grant while in the middle of conducting research. This is confirmed by countless research scientists online and it's just how research often is done due to the difficulty and delays of receiving funding. Peter's conclusion that the DEFUSE research did not happen because it was rejected is so at odds with how scientists conduct research that at best I would consider Peter ignorant of this point, and at worst it would seem Peter intentionally manipulated this point and tried to get the DEFUSE proposal dismissed as evidence because it would dramatically undermine his argument and potentially change the entire outcome of the debate.
There's more, but I'll wrap this up for now. My key takeaway is that Michael's approach and evidence seem more updated, complete, and compelling than anything else available online on the topic to date. It seems especially relevant and a better analysis than either sides of the Rootclaim debate, and I believe more people should be aware of this post and read it in full before making conclusions.
Yes, nice summary of the newer evidence.
I think that the HSM market story has taken serious hits, negating the importance of the badly flawed Worobey/Pekar papers. If you'd asked 5 years ago, people warning of zoonosis wouldn't have put an especially high fraction of their priors on market spillovers, since there are also farms, transport, etc. So I think the big factors assigned by some to HSM are nonsense but removing them doesn't do much to reduce the overall ZW priors.
The biggest change has been from finding the DEFUSE restriction enzyme plans. Before, most people (including me) had not considered that pattern as usable evidence because it was unclear how much ex post facto choice of what to look for was involved, i.e. the old multiple comparisons problem. After Emily Kopp published the DEFUSE plans that were stunningly close to the pattern Bruttel et al. had noticed, that factor became worth taking very seriously.
Thank you so much for this work.
I have one major suggestion, which is more discussion of how to interpret the numbers coming out. In particular, I was bothered by this analysis for a long time because the conclusion seemed too certain given the number of missing pieces. Intuitively, it seems like there should be some factor in there which has no information about the conclusion but vastly increases the uncertainty. Some or possibly all of this comes from me wanting to interpret the final numbers as representing degrees of certainty instead of subjective probabilities, but I doubt I'm the only one with that confusion.
I think this could be discussed in a few places - in the introduction, at the end when combining priors with evidence, and once at the point where you discuss the vast gap between RaTG13 and SC2. Any origin theory requires some sequences in this gap, and finding them could decide the case either way depending on whether they show up in wildlife or in a lab notebook. Maybe there's a way to evaluate counterfactual evidence formally but just discussing the significance of the missing evidence would help. For everybody's sanity there should be a clear distinction between "100:1 odds it was a lab leak" and "available evidence favors lab leak by 100:1".
There is a second small issue relating to the Bruttel et al paper and the seeming confirmation from the DEFUSE draft. I think it was Alina Chan who pointed out that the BsaI/BsmBI construction showed up in an earlier Baric paper, and that Bruttel et al probably read that paper. I didn't see any followup on that. Your wording seems to suggest that Bruttel et al predicted the choice of enzymes from looking at the genome, but that may be too strong. You didn't use it as evidence but it might be worth tweaking the text.
And a really small point: I would take the part about "quantifying friggin likely' out of the title. Most people won't get the joke, and you are so consistent about taking the high road in the rest of the document.
I can't thank you enough for this work: I really hope that you keep on this. Having it in a decent journal would be a huge step forward.
Good points. Taken in reverse:
I'm sluggishly starting (with some possible coauthors) to convert this to peer-review-ready format. There will be no "friggin'" in the title. Meanwhile, I already gave the P.O. authors all the good lines because they spoke in such clear ordinary English.
I guess I should mention the preceding Baric paper. It is relevant to P(Bruttell guessing the right enzymes|ZW), but that doesn't enter directly in my calculations. To the extent that it affects P(pattern|LL), which does enter, it increases it and raises the odds. I'd thought about using it but was lazy and at some point the odds get extreme.
That brings me to the big philosophical point. I do discuss it in a narrow sense at the specific point about the sequence gap. "Although fully knowing what sequences were in Wuhan labs would be almost equivalent to answering the origins question, our current estimate of what’s there would mostly just be based on the other evidence leaning toward LL, ZL, or ZW, augmented a bit by a highly subjective sense of how forthright people are likely to be. We don’t want to either double-count our other evidence or introduce especially subjective terms."
I think this "what about missing evidence?" problem comes up in every Bayesian estimate. I've worried about it a lot, as have others. It sort of nags at the back of the mind. One can usually imagine various types of evidence that would, if available, outweigh everything that we have our hands on. If you tried to change odds based on pure ignorance of the non-evidence, all odds would approach 1/1. That would not only be wrong, it would be logically inconsistent. E.g. it would give odds (A or B) vs. C = 1 if A and B are lumped, but 2/1 if viewed as separate outcomes.
So how to deal with the sense that somehow the odds based on what we know can't be as extreme as what they seem to be? For people calling election results or horse races, they have enough cases to allow calibration. Do the ones they call 2/1 give about 2/1 in results? Etc. For pandemics, we fortunately don't have enough to calibrate.
If I were reading this blog and not writing it, I'd just take it in attenuated form. Once people start seeing things one way, whether by priors or by the first evidence they see, it starts to get hard to see other sides. So I think a reasonable reader who doesn't want to spend a huge amount of time tracking down individual factors would tend to discount the odds beyond the hierarchical and robust discounts I've used. But I can't think of a logically coherent way of saying that the odds aren't what they are because someday we may know them better.
Thanks for the explanation - discounting the odds was my first approach, and when I re-read the document looking specifically for this question I did see that you gestured towards it in the section about the sequence gap. Maybe your pedagogy is working as intended.
I guess what I'm thinking of is to estimate odds for a counterfactual like "nearby ancestral genome including FCS shows up in a wildlife sample around Wuhan" which most people would consider dispositive, and see if the method agrees. This seems like it would be a useful sanity check if it doesn't taint the analysis.
Happily, for a peer-reviewed version you can assume more expert readers.
I did sort of quickly run through a negative control when I saw someone claim that a paper using very different (non-Bayesian) methods might have indicated a possible lab origin of MERS. I think the method here comes down heavily for MERS being natural.
If a close ancestor with an FCS etc had showed up anywhere, even Yunnan, then I wouldn't even bother looking at LL. Given the possible routes to Wuhan, ZL would still be very much in the running.
https://x.com/john_bumblebee/status/1810335076623712265.
I'm reading the part about the furine-cleavage site now. Of course there is the story about the sodiumchannel ENaC, but there are two other hypothesis. There is the possibility that the furine-cleavage site in the Feline Enteric Coronovirus inspired researchers, although I'm not very convinced of this possibility. But it is certain from Baric's testimony for the Subcommittee that it were these viruses that made him think about the function of the FCS. So it's not that strange to think that the Chinese also had knowledge about these viruses too.
I think this one is much more convincing. https://zenodo.org/records/10203207. This study describes a Nuclear Localisation Sequence PRRARSV that is in SARS-CoV-2. No other coronavirus in the family of bètacoronaviruses has this sequence: only a MERS-clone after thirty cycles of serial passaging has the same sequence. The study comes from a group of virologists of the University of Iowa, but the first author of the study did his PhD in virology in the Wuhan Institute of Virology from 2005 to 2010.
This would be an explanation for choosing for the 'suboptimal' FCS sequence. If they had chosen for the optimal sequence RARR, the sequence would not function as an NLS. Maybe this detail could be of help in determining the odds that this virus was genetically engineered.
https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2023.1073789/full. This proves the function of this sequence as being real.
I removed what seemed to have been a duplicate of this Comment.
I agree that the FIPV FCS and the MERS FCS are highly relevant to what WIV and its collaborators might have chosen, especially since zoonati have claimed that the proline would not have been used by a lab. I've been hesitant to add even more details to an over-long blog, but maybe could put something in the relevant Appendix.
That's perfectly fine (about the duplicate). I might have hit the post button twice. It indeed is a very long blog, but very, very useful. I read most of the studies that you referenced, but still there are some things that I didn't notice, after two years of reading and writing. Just wanted to let you know about the MERS-clone, because that's the only possibility that is somewhat watertight, especially about the proline in the lead, of which Baric stated that he never would have chosen this variant.
Can't share the 'summary' of your blog. It's too long... :)
"These convey information complementary to the centroids because they emphasize the most clustered points rather than the more distance ones. In their supplementary material there is also a KDE map for the linked cases."
Couldn't find in the supplement...
Aha. I didn't think that Walker had to prepare the KDE map. It does turn out Worobey et al. made it, included it on github (https://github.com/sars-cov-2-origins/huanan-market/blob/main/maps/geojson/who_cases_dec-2019.linked.KDE.contours.geojson) but left it out of the supplementary materials.
How curious... the supplement is very, very long, with many illustrations of the proces how to localise the early cases. But just this one figure, just the one figure that could cast doubt on their hypothesis, that figure is missing. I am certainly willing to look objectively at any study - although there is particularly little left of the study by Worobey et al after the extensive criticisms of several scientists - but this is quite extraordinary.
It is a shame, though, that this constantly updated Bayesian analysis has not received more attention, except from insiders. I understand that it is a difficult piece for people who do not know Bayes' Theorem. It was not an easy piece for me either, and I only now really understand where its principle is, whereas years ago I thought I understood it. And that too only because I got help with it from a professor of probability calculus. And yet this should be published in a mainstream or popular science journal somewhere.
A final question is whether the scenario of Line 2 by Steven Quay would add anything. This is because it can actually explain a lot of aspects very well, without getting into all sorts of twists and turns, especially including the early spread outside China of early variants of the virus. The only problem I have is that even this scenario need not directly point to a lab escape. A zoonotic disease could also spread in this way. The only thing then is that Line 2 also has the WIV and a military hospital, where apparently the first four samples were collected.
https://www.academia.edu/82244361/Where_Did_the_2019_Coronavirus_Pandemic_Begin_and_How_Did_it_Spread_The_Peoples_Liberation_Army_Hospital_in_Wuhan_China_and_Line_2_of_the_Wuhan_Metro_System_Are_Compelling_Answers
On publication, I checked informally with Ann of Appl Stats about preparing a shorter more formal version. An editor said, accurately, it wasn't quite suitable because they like to publish innovative techniques and mine were conventional. If somebody comes up with a likely journal, I may rouse enough energy (or find an energetic coauthor) to make that tighter version. It would remove introductory apologetics, the conventional Bayes intro, remarks in evolution of the versions, ... Also convert the refs to standard form rather than just links.
I've never been much for writing long papers, and dealing with some health issues reinforces that tendency.
Well, so far for science. So they weren't interested in a publication with a statistical technique that has proven it's value. Curious.
I'm going to ask around a bit. But I'm certainly no scientist, zo I'm afraid you would know better than me where this could be published.
I think AoAS was sincere. Their theme is innovative techniques, which I don't have.
Thanks for that link. First impressions: The 4 PLA-hospital patients with records starting on Dec. 10, 2019 are really interesting. On first reading, I'm dubious about the statistics on the Line-2 associated cases. Rejecting a toy null is a bad procedure regardless of who does it. Also, the connection of the international cases with Line 2 is kind of silly. You'd get a similar international pattern regardless of how the infected people got to the airport.
That is exactly my doubt. It is a very attractive hypothesis to explain how the virus spread across Wuhan - see also the New York analysis - but it is no more than that. A zoonotic disease would spread in exactly the same way, with then the fact that the PLA's hospital and the WIV on the Line 2 route would increase the likelihood of it being a lableak. On the other hand, it does not argue against the market hypothesis, because it too could spread through Line 2. So I'm not sure how to assess this piece: probably only as a possible hypothesis.
The link to the airport is in the fact that Line 2 is the only subway line that ends there. But that indeed does not rule out the possibility that the virus could have gotten there by other means. It is just a very fast and direct route, from a situation that is very ideal to spread a virus: the subway.
Yipes. I can't either. The points are there but not the KDE map. Dan Walker must have prepared that from the points. I'll revise right away.
Thanks. Surely this is very special. Omitting the KDE from the cases that have a direct relationship with the Huanan market. Not including in the actual article nor in the supplement. Just don't bring it up and don't mention it. Surely this suggests that the authors had a particular goal with the publication and deliberately just ignored the details that did not fit with that. Because not only is the KDE quite different from that of all cases and those with no relation to the market, but it also does not center on the market. I cannot escape the impression that the authors did not have pure intentions here.
If you look closely at the KDEs in their paper, you see the "all" noticeably more spread out than the "unlinked", but the effect isn't all that striking because there are only a few "linked" included in "all". I made a .ppt for a talk with those maps superimposed to make it clear, but then found Walker's version that separates out the "linked" rather than hiding them in the "all".
"Wu’s data also show that over time SC2 variants drifted away from the initial pattern by picking up extra sites, as one would expect if the initial form was not a product of random evolution. Unlike other sequence features, there’s no non-random fitness associated with this pattern under ZW."
This seems like a very good argument by Wu, but of course it is not. The "drift" of a reverse-genetic system would occur with every possible clone if it were so massively distributed around the world. Because even if it were a reverse-genetic system, it is still an RNA virus, which mutates regularly. That's really a very different situation than using a reverse-genetics system in the laboratory. Even when creating a clone, the genome sequence is checked afterwards to make sure no significant mutations occurred during cloning. And Wu himself has been trained in the WIV, and knows full well that this is so. But this is really a complete fallacy.
I'm not sure I follow your point. What I'm saying (contra Wu) is that the drift toward a more typical RE site pattern is just what you'd expect if the initial pattern had been selected. Since there's no non-random natural selection for the pattern, that supports the existence of a lab selection. I can't really reproduce whatever Wu's reasoning was supposed to be. I think you agree but am not sure if you're adding another point.
Now I understand. Thanks.
A few other remarks: there is even stronger evidence, whereby the genome was not only found by PCR, but even with sequencing.
Amendola A, Bianchi S, Gori M, et al. Evidence of SARS-CoV-2 RNA in an Oropharyngeal Swab Specimen, Milan, Italy, Early December 2019. Emerg Infect Dis. 2021;27(2):648-650. doi:10.3201/eid2702.204632.
This one is about a child, four years old, that developed a cough and a runny nose on 21 november 2019. Also in Italy. PCR and sequencing confirmed SARS-CoV-2.
Amendola A, Canuti M, Bianchi S, et al. Molecular evidence for SARS-CoV-2 in samples collected from patients with morbilliform eruptions since late 2019 in Lombardy, northern Italy. Environ Res. 2022;215(Pt 1):113979. doi:10.1016/j.envres.2022.113979.
The second one is a study from a reference laboratory for measles and Rubella. The possible rash of Covid-19 is very much alike this infections. First positive sample is a four months old baby. Later it tested positive for IgG and IgM. These could have been false positives, but the complete picture tells us that this is very probably a real case."
About the sequences that were gathered after 2016, this study from 2022 tells us more about this.
Wu Z, Han Y, Wang Y, et al. A comprehensive survey of bat sarbecoviruses across China in relation to the origins of SARS-CoV and SARS-CoV-2. Natl Sci Rev. 2022;10(6):nwac213. Published 2022 Oct 11. doi:10.1093/nsr/nwac213
But the problem is whether this study can be trusted. Holmes as well as the english virologist Alice Hughes - who did a lot of fieldwork in China - say it is highly unlikely that they didn't find any other SARS-CoV-2 related coronavirus, not even in the coppermine in Moijang. And we know that the Chinese from early 2020 strongly censored all scientific publications on SARS-CoV-2 and Covid-19.
"Although the possibility of artifacts cannot be excluded, if La Rosa et al’s found wastewater evidence that SC2 started to show up substantially in northern Italy by December 18th 2019 and Fongaro et al. found similar evidence for November 27 in Brazil. If either of these seemingly careful reports is correct the spillover would have had to have been more in the range estimated by most of the papers rather than the late estimate of the 2022 Pekar paper'
Useful. Among other things, it implies I'd better go back over my grammar.
Forget my last remark. Now I see how you got to the number of 19. These are all zoonotic diseases, although some of them are well known for ages. I'm not sure how this would impact your calculations, but I think it lowers the odds for the ZW, because the period is very long.
Another remark:
I checked the reference and the table. It indeed lists 19 pathogens, but they are not all new emerging diseases. The text remarks that roughly half of them are. The other ones are pathogens that we alreadly knew, and for which it is not clear that they are new emerging pathogens. For instance, tuberculosis has been known for ages? That would change the odds, wouldn't it?
"Since then, the number of notifiable diseases has increased from 15 to 41 (in 2011), with emerging zoonoses accounting for over 50% (Table 1 )"
"Before looking numerically at the LL probabilities, let’s look at the competing ZW background. A tabulation from 2014 of important pathogens emerging in China since the 1950’s lists 19 different ones, including one sarbecovirus—SC1, the original SARS. From that one can roughly estimate that the probability of a significant new pathogen in any year, e.g. 2019, would be
P0(2019, ZW) = ~ 1/3."
Yes. I bent over backwards to give a big ZW prior, part of being conservative. I should have been more explicit about that point.
But in the end it doesn't matter because the one SARS becomes a higher fraction. It basically comes down to one SARS coronavirus in several decades of comparable risk.
I'll try to patch up that explanation later today or tomorrow. But any effect on the priors will be cancelled by P(sarbecovirus|ZW) giving the same net odds.
I'd just been lazy about writing that up and appreciate your careful reading!
Thanks! When I'm done with the section from the book, could I post it here? So you can see it is an adequate sort of abstract?
sure
Sorry! Already found the other ones in your piece!
"There gave been about a dozen more...'
Tiny error I saw when reading again. I thought that in the earlier versions there was a literature list with the other Bayes analyses that were done, or am I mistaken?
I go over them in Appendix 1, with links. As more came in it got a bit too detailed to summarize more in the main text.
Thanks... As your analyses is the most comprehensive and the most recent updated, I'm going to use him for the final part of my book - a form of final conclusion, as it were, how to integrate all the evidence and all the events. That will still require some study, I'm afraid, but I think it will be a nice conclusion.
"I’m definitely not endorsing the cruel opposition to strenuous public health measures that seems to have become associated with skepticism about the zoonotic account."
Well, when we talk of scientific proof, it would be wise to look at what the evidence was and is for the efficacy of all these stringent public health measures, and look at the possibility that the cure was far worse than the disease. There are some highly competent people like Jay Bhattacharya and Martin Kulldorff who warned that this would be the case. They are very experienced Public Health experts. And I think that Sweden has proved that it was so.
Apparently you're not allowed to speak out about these matters, without being called 'cruel'. Disappointing. Very disappointing. Doesn't make your story more credible, to say the least.
have you read Bhattacharya's early work in this area? E.g. the Santa Clara infection fatality rate paper? Failure to use standard confidence intervals for false positives gave an error of up to a factor of infinity in the estimate. Improper recruitment methods (not acknowledged in the paper) heavily slanted the results. Etc. So they came up with "we estimate a local infection fatality rate of 0.17%.". The *population* fatality rate in the US is more than twice that.
So I absolutely stand by the position that this is a serious disease. That's now personally reinforced by some fairly bad sequelae to a mild case. Air filtering and more use of masks are fully justified.
So Bhattacharya and I disagree intensely on a crucial issue (pubic health measures) but agree on another (origins). I don't think that judging arguments by whether someone is on your team's side or not is a good way of getting at the truth.
There is a steep gradient in the IFR in terms of age, noting that the IFR for children and adolescents is lower than for influenza. I have never heard mathematicians and physicists complain about that. Old people die, I am sorry for you, and they have been doing so for a very long time. I can know, because I have accompanied many neurological patients who were old and tired of their days to a comfortable death. And really, the IFR for SARS-CoV-2 is up to twice as high as for influenza. And yes, I have seen the necessary people die from influenza, including young people, as well as children.
Also, in terms of facemasks, you are completely uninformed. Every clinician has long known that the protection of a surgical mask is virtually zero with aerogenically transmitted viruses. We do it in the hospital as an "incantation ritual," but almost every physician knows it is utter nonsense. That was the consensus before 2020, and has been demonstrated in many studies.
Two of the best people in evidence-based medicine, Carl Heneghan and Tom Jeffersen, conducted a systematic review of the effect of facemasks. I know there has been a lot of criticism of that, but that what mainly for political purposes, because they did nothing different than they did before in writing their reviews. But somewhere politicians and policy makers had decided that facemasks must be effective, and so this did not come in handy.
https://www.cebm.net/covid-19/covid-19-masks-on-or-off/
I find it very striking that the hard betas in particular were so uncritical of the measures taken. I'm not even talking about the environmental impact of a measure that was based on nothing more than dressing down the public, and making critics immediately visible and scapegoating them. This had nothing, but nothing to do with public health. This had everything to do with indoctrination and enforcing certain behavior.
Then again. What does this whole discussion have to do with the origin of SARS-CoV-2? What is the point of a derogatory comment about people who are very justified, and very well reasoned criticisms of the measures taken? Which - when it comes to preventing mortality - have also had a very marginal effect.
Such a comment has no place in this piece at all. With a swing of the pen, very experienced and well-informed doctors and scientists are dismissed here as cruel. Really, I find it outrageous.
Q.E.D.
https://portal.research.lu.se/en/publications/a-literature-review-and-meta-analysis-of-the-effects-of-lockdowns