INDEX
Explanations
words related to the purpose or intent behind actions or entities
references to the concept of purpose
New Auto-Interp
Negative Logits
Chance
-0.65
Pick
-0.65
Leopard
-0.64
Drops
-0.62
alus
-0.61
Temper
-0.61
slopes
-0.61
aq
-0.60
Polo
-0.59
igans
-0.58
POSITIVE LOGITS
ful
1.50
fully
1.23
fulness
1.13
FUL
1.01
lessness
0.98
full
0.96
lessly
0.93
ories
0.83
²¾
0.80
less
0.77
Activations Density 0.064%