INDEX
Explanations
operations or actions that are physically impactful or intense
words related to actions or activities that imply movement or completion
New Auto-Interp
Negative Logits
REF
-0.63
UE
-0.61
ISH
-0.60
yss
-0.59
Narr
-0.58
اÙĦ
-0.58
LESS
-0.58
stewards
-0.57
âĸ¬
-0.56
SOURCE
-0.55
POSITIVE LOGITS
paces
1.43
ettings
1.40
pace
1.39
creen
1.32
mith
1.24
omething
1.23
hift
1.20
hops
1.19
hips
1.16
peed
1.13
Activations Density 0.520%