INDEX
Explanations
verbs and their variations related to action and agency
New Auto-Interp
Negative Logits
éĢĶ
-0.16
eri
-0.15
pus
-0.14
opa
-0.14
heavens
-0.14
dra
-0.14
an
-0.13
/UIKit
-0.13
ilda
-0.13
asn
-0.13
POSITIVE LOGITS
out
0.28
up
0.27
-up
0.24
-out
0.23
off
0.21
-off
0.20
åĩºåĵģ
0.17
down
0.16
-in
0.16
-down
0.16
Activations Density 0.242%