INDEX
Explanations
verbs related to combat or physical confrontation
New Auto-Interp
Negative Logits
Dun
-0.17
Brad
-0.16
appen
-0.16
870
-0.15
conting
-0.15
ceipt
-0.15
Tele
-0.14
Bras
-0.14
Hort
-0.14
Richardson
-0.14
POSITIVE LOGITS
blow
0.20
blows
0.19
balance
0.18
ellan
0.18
struck
0.17
balance
0.17
Strike
0.16
strike
0.16
Pose
0.16
ģm
0.16
Activations Density 0.037%