What is the true incidence of Coronavirus? An empirical model using evacuees from Wuhan

Many people think that the official estimates of total infections from nCov-2019 are lower than the true incidence because of failure to count mild infections, or even worse, for some this is due to a cover-up by the Chinese government trying to hide the real entitity of the epidemic. I came up with the idea of using infection counts from evacuees (people repatriated by other countries) to have the most reliable data.

Using estimates of pathogen prevalence following phylodynamic methods ( which measure how quickly observed genomes share ancestry to estimate the rate of exponential growth) Trevord Bedford estimates a median total incidence on 8 Feb of 55,800 total infections since start of epidemic with a 95% uncertainty interval of between 17,500 and 194,400 total infections. This approach estimates an infection-to-case reporting rate of between 18% and 100%.

MRC’s Neil Ferguson was estimating that official figures were underreporting by a factor of 10x the real number of total infections.

I adopted a rather straightforward approach to this, by using data on evacuees from Wuhan to other countries. These are groups of non Chinese citizens living in Wuhan that were repatriated to their home countries during the last few weeks following the coronavirus outbreak. Most countries placed these individuals in strict quarantine and monitored them for flu-like symptoms. The suspicious cases were tested for coronavirus.

An advantage of using infection data from evacuees is that all the individuals were screened for the virus, at least at a symptomatological level, and the individuals showing symptoms were tested in the lab for the virus. This will result in a complete report of infections, which includes also mild cases. Moreover, fears of a cover-up by China are dissipated because the tests are carried out in other countries.

The official number of infected individuals in Wuhan was obtained from the CSSE interactive map, on the day the evacuees were boarded on the flight back to their home countries. Dividing the number of infected individuals by the Wuhan population (11 millions) gives the incidence of the virus. This figure was compared to the incidence among the evacuees (N. infected/number of people evacuated) for each country. Dividing the incidence among the evacuees by the estimated incidence in Wuhan gives the extent of the reporting rate (or underestimation factor). The reporting rate is estimated at 17.3%, or an “underestimation factor” of 5.78x, lower than Neil Ferguson’s estimate but closer to the lower bound (18%) of Trevor Bedford’s prediction.

Today’s confirmed cases according to CSSE is 43,126. I estimate the true total infections to be 250,000 as of February 11th.

Table 1. Infections among Wuhan evacuees.

JapanItalyGermanyFranceUSAS.KoreaFrance 2USA 2TaiwanCanada
Infected5120121110
Total20654124179195700254600247174
DateJan 29Feb 3Feb 1Jan 31Jan 29Jan 31Feb 2Feb 5Feb 3Feb 7
Wuhan official infected693017730128701080069301080015480246601773030600
Wuhan total pop11100000111000001110000011100000111000001110000011100000111000001110000011100000
Wuhan official infected rate0.00062432432430.0015972972970.0011594594590.0009729729730.00062432432430.0009729729730.0013945945950.0022216216220.0015972972970.002756756757
Repatriates infected rate0.024271844660.018518518520.0161290322600.0051282051280.0028571428570.0039370078740.0016666666670.0040485829960
Underestimation coefficient38.8769806211.5936579613.9108203608.2140082142.9365079372.8230482820.75020275752.5346458690

The updated infections among Wuhan evacuees from different countries can be seen here.

One thought on “What is the true incidence of Coronavirus? An empirical model using evacuees from Wuhan

Leave a comment

Design a site like this with WordPress.com
Get started