INDEX
Explanations
proper nouns, specifically names like "Russ" or "CIS"
references to individuals named "Russ" and related geopolitical concepts, particularly involving Russia
New Auto-Interp
Negative Logits
conspicuous
-0.75
nces
-0.66
decipher
-0.65
frequent
-0.63
sections
-0.62
thirst
-0.62
undai
-0.62
hypers
-0.62
disposal
-0.61
accompanying
-0.61
POSITIVE LOGITS
Russ
1.29
Russ
1.22
Russo
0.98
uese
0.87
Magnus
0.83
iets
0.81
illo
0.79
achev
0.79
acebook
0.78
keye
0.77
Activations Density 0.004%