INDEX
Explanations
phrases related to significant accomplishments or characteristics
New Auto-Interp
Negative Logits
oneself
-0.76
ÃŁ
-0.69
orks
-0.62
yourselves
-0.61
Federation
-0.60
Yose
-0.60
Brunswick
-0.59
annon
-0.58
Saud
-0.58
Spawn
-0.58
POSITIVE LOGITS
mates
0.85
mate
0.84
counterparts
0.83
fulness
0.80
lessness
0.79
abroad
0.79
foray
0.78
ings
0.76
exploits
0.76
spree
0.76
Activations Density 1.615%