INDEX
Explanations
specific identifiers or names associated with people and events
New Auto-Interp
Negative Logits
aepernick
-0.17
zyst
-0.17
_gem
-0.16
оген
-0.15
.Microsoft
-0.15
Yug
-0.15
ÅĻiv
-0.14
yon
-0.14
IRON
-0.14
anke
-0.14
POSITIVE LOGITS
Pred
0.24
Dark
0.23
Gor
0.23
pred
0.21
Dam
0.21
Ves
0.19
Milan
0.19
Bo
0.19
Dark
0.19
Rank
0.19
Activations Density 0.012%