INDEX
Explanations
references to the word "hop" and its variations, particularly in contexts related to movement or action
New Auto-Interp
Negative Logits
opposite
-0.16
uns
-0.16
antee
-0.16
bre
-0.15
functional
-0.15
.arg
-0.14
quez
-0.14
иÑģÑĮ
-0.14
past
-0.14
βολή
-0.14
POSITIVE LOGITS
Ñģи
0.17
.struts
0.16
gor
0.15
gett
0.15
leg
0.15
oved
0.15
344
0.15
_COMPAT
0.14
PROTO
0.14
shit
0.14
Activations Density 0.011%