INDEX
Explanations
names of Russian and Ukrainian individuals
names of individuals, particularly those of Russian origin
New Auto-Interp
Negative Logits
nels
-0.82
ãĥ¼ãĥĨ
-0.76
apple
-0.74
Fiesta
-0.73
plain
-0.73
finding
-0.72
isons
-0.72
roads
-0.70
neys
-0.68
cess
-0.68
POSITIVE LOGITS
Mikhail
1.10
Lavrov
1.09
Dmitry
1.02
Sergei
0.99
Putin
0.98
ovych
0.96
Sergey
0.95
Gork
0.95
Dmit
0.93
Ily
0.89
Activations Density 0.029%