INDEX
Explanations
names of places and organizations
New Auto-Interp
Negative Logits
staking
-0.62
roman
-0.58
forgiving
-0.56
lap
-0.53
allowances
-0.53
scratch
-0.52
borne
-0.52
pling
-0.51
blazing
-0.51
depress
-0.50
POSITIVE LOGITS
a
0.91
o
0.86
icz
0.85
opol
0.83
iak
0.82
i
0.81
Ã¥
0.81
vous
0.80
din
0.77
theless
0.76
Activations Density 1.156%