From:  Considering the fragility index in reporting trials on diverticular disease

 Effects and importance of smaller and larger sample sizes on different fragility indices

IndexSmall sample effect (example)Large sample effectExplanation and importance
Fragility index (FI)A small RCT with 50 patients shows that Drug A is significantly better (p = 0.04). Changing 2 non-events to events makes p > 0.05 where FI = 2.A large RCT with 1,000 patients shows that drug B is significantly better (p = 0.04). Changing 15 non-events to events makes p > 0.05 where FI = 15.In the small trial (FI = 2), a tiny change in outcomes alters significance, meaning that the results are fragile.
In the larger trial (FI = 15), the results are more robust since more changes are needed in order to affect significance.
Reverse fragility index (rFI)A trial with 60 patients shows no significant difference (p = 0.07). Adding 3 new events to the treatment group makes p < 0.05 where rFI = 3.A large trial with 1,200 patients shows no significant difference (p = 0.07). Adding 20 new events to the treatment group makes p < 0.05 where rFI = 20.The small study (rFI = 3) is close to significance, requiring just 3 event reversals. The large study (rFI = 20) needs a much bigger shift, showing greater confidence that no real effect exists.
Fragility quotient (FQ)A study with FI = 3 and sample size = 200 results in an FQ = 0.015 (1.5%; 3/200).A study with FI = 20 and sample size = 2,000 results in an FQ = 0.01 (1%; 20/2,000).In the small study only 1.5% of the participants changing their outcome can alter significance. In the larger study an FQ = 1% is consistent with greater stability.
Incidence fragility index (FIq)A trial with an FI = 5 and a control group event rate = 10% (15/150 patients), results in an FIq = 0.33 (33%; 5/15).A larger trial with an FI = 30 and a control group event rate = 12% (360/3,000 patients), results in an FIq = 0.083 (8.3%; 30/360).In the small study a significant portion of the control group
must switch outcomes to change the significance. In the larger trial the result (8.3%) is more reliable across different event rates.
Generalized fragility index (GFIq)A study with an FI = 4, an rFI = 3, a sample size = 250, and a control group event rate = 12% (30/250) uses both the FI and rFI to generate a GFIq.A study with an FI = 18, an rFI = 15, a sample size = 5,000, and a control group event rate = 10% (500/5,000) results in a more stable GFIq.The small study has a lower GFIq, meaning both FI and rFI are low, making the results less stable. The large study’ has a higher GFIq demonstrating more resilience to statistical shifts.