INDEX
Explanations
mentions of specific names or surnames
mentions of specific individuals and their roles or actions
New Auto-Interp
Negative Logits
CLASSIFIED
-0.75
EED
-0.74
-0.72
flies
-0.72
OTAL
-0.71
200000
-0.69
dress
-0.67
mails
-0.67
cade
-0.66
ãĥ¼ãĥ³
-0.65
POSITIVE LOGITS
Luk
1.14
uania
0.87
rils
0.86
ijn
0.78
inated
0.76
owitz
0.75
inate
0.74
ifer
0.74
itsch
0.73
lihood
0.73
Activations Density 0.032%