INDEX
Explanations
references to Osama bin Laden
New Auto-Interp
Negative Logits
aic
-0.72
antioxid
-0.70
Brew
-0.70
Ott
-0.68
Constantin
-0.67
Celt
-0.67
Lex
-0.67
Cullen
-0.66
Balt
-0.66
Rate
-0.66
POSITIVE LOGITS
Laden
0.95
himself
0.93
Hussein
0.91
addafi
0.85
ensis
0.85
abad
0.84
Jinping
0.83
assassinated
0.82
istani
0.81
terrorists
0.75
Activations Density 0.011%