INDEX
Explanations
words related to specific names or terms
proper names of people, particularly those involved in sports or entertainment
New Auto-Interp
Negative Logits
ovych
-0.88
dfx
-0.78
CLASS
-0.70
#$
-0.64
gobl
-0.62
POSE
-0.61
¯
-0.60
reckoning
-0.59
PASS
-0.59
oppable
-0.59
POSITIVE LOGITS
xus
0.89
igham
0.86
eros
0.80
cius
0.78
oglu
0.77
antes
0.76
ortium
0.73
abre
0.70
ople
0.70
loe
0.70
Activations Density 0.332%