INDEX
Explanations
mentions of specific names or references to people or places
New Auto-Interp
Negative Logits
Qiao
-0.68
Duterte
-0.65
Versions
-0.63
Paddock
-0.56
ivably
-0.55
ufact
-0.54
ctuary
-0.52
Kissinger
-0.52
ashtra
-0.52
0000000000000000
-0.51
POSITIVE LOGITS
ikuman
0.78
lyak
0.74
nik
0.73
insky
0.73
inski
0.72
oha
0.72
enei
0.70
nis
0.70
ewski
0.65
itsch
0.64
Activations Density 7.291%