INDEX
Explanations
words related to the concept of being able or capable of something
New Auto-Interp
Negative Logits
ing
-0.87
n
-0.82
m
-0.74
es
-0.71
<eos>
-0.69
9
-0.68
th
-0.66
2
-0.65
<h2>
-0.63
↵↵
-0.62
POSITIVE LOGITS
izable
1.34
vable
1.25
urable
1.24
^(@)
1.22
Theſe
1.20
chable
1.19
Efq
1.19
myſelf
1.18
asable
1.17
Jefus
1.14
Activations Density 0.253%