INDEX
Explanations
mentions of the 9/11 attacks and related conspiracy theories
New Auto-Interp
Negative Logits
wine
-0.69
MLA
-0.67
cil
-0.67
Tile
-0.66
Mono
-0.65
Huawei
-0.65
DRAGON
-0.65
Thom
-0.65
aird
-0.63
raw
-0.62
POSITIVE LOGITS
anniversary
0.97
mastermind
0.83
truth
0.83
devastation
0.81
Anniversary
0.81
Truth
0.80
bombings
0.80
Commission
0.80
Truth
0.79
victims
0.79
Activations Density 0.102%