INDEX
Explanations
names of places or individuals
the names of individuals or entities
New Auto-Interp
Negative Logits
====
-0.80
cx
-0.74
CY
-0.74
ELE
-0.73
Retro
-0.72
Ao
-0.71
ACS
-0.71
Gy
-0.70
CoC
-0.69
ATES
-0.69
POSITIVE LOGITS
man
1.87
mans
1.62
mann
1.57
MAN
1.44
men
1.31
eman
1.23
linger
1.16
woman
1.13
heimer
1.12
father
1.12
Activations Density 0.085%