INDEX
Explanations
verb forms related to actions and expectations in various contexts
New Auto-Interp
Negative Logits
ss
-0.18
ennon
-0.17
496
-0.15
igans
-0.15
ourselves
-0.15
_NAMESPACE
-0.15
hart
-0.14
ahr
-0.14
sworth
-0.14
tn
-0.14
POSITIVE LOGITS
-Mart
0.16
bic
0.15
heets
0.15
dol
0.15
akh
0.14
ì¶ĺ
0.14
edn
0.14
ansa
0.14
begr
0.14
med
0.14
Activations Density 0.114%