INDEX
Explanations
verbs related to actions or behaviors
action verbs and their usage in context
New Auto-Interp
Negative Logits
berus
-0.75
Applic
-0.74
fired
-0.72
winner
-0.71
struction
-0.68
apo
-0.68
éĹ
-0.66
affer
-0.65
shut
-0.65
pletion
-0.63
POSITIVE LOGITS
themselves
1.13
differently
0.97
theirs
0.95
their
0.92
THEIR
0.84
alike
0.80
instinctively
0.76
endlessly
0.74
freely
0.73
ingly
0.69
Activations Density 0.523%