INDEX
Explanations
phrases related to sequential actions or events
phrases associated with animals or animal-related actions
New Auto-Interp
Negative Logits
interstitial
-0.73
³
-0.70
ת
-0.68
ersive
-0.67
-0.65
obil
-0.65
âĶľâĶĢâĶĢ
-0.64
ÙĪ
-0.64
arcer
-0.64
ocumented
-0.63
POSITIVE LOGITS
incompetence
0.67
inaction
0.64
whoever
0.62
whine
0.61
bait
0.61
shove
0.59
quo
0.59
throttle
0.59
wink
0.59
fail
0.58
Activations Density 1.129%