INDEX
Explanations
expressions of plans, goals, or aspirations related to development and progress
New Auto-Interp
Negative Logits
lsen
-0.18
anik
-0.17
union
-0.16
udiant
-0.16
ainer
-0.15
lista
-0.14
ongo
-0.14
_EXTERN
-0.14
essler
-0.14
isma
-0.14
POSITIVE LOGITS
aries
0.21
naire
0.20
ning
0.17
egg
0.16
ight
0.16
naires
0.16
odiac
0.15
erchant
0.14
oss
0.14
ibus
0.14
Activations Density 0.019%