INDEX
Explanations
adjectives that describe capability or usability
New Auto-Interp
Negative Logits
ed
-0.27
ing
-0.25
arily
-0.22
edb
-0.21
ical
-0.19
ers
-0.19
fulness
-0.18
ième
-0.18
istry
-0.18
icals
-0.18
POSITIVE LOGITS
enough
0.27
atable
0.26
able
0.25
/un
0.22
0.21
/non
0.20
ABLE
0.20
ble
0.20
mente
0.20
Enough
0.19
Activations Density 0.165%