INDEX
Explanations
expressions of conflict or confrontation
New Auto-Interp
Head Attr Weights
0:0.08
1:0.03
2:0.07
3:0.05
4:0.03
5:0.06
6:0.19
7:0.04
8:0.09
9:0.22
10:0.03
11:0.04
Negative Logits
fram
-3.51
iy
-3.45
Dund
-3.45
NG
-3.35
ð
-3.34
Yug
-3.31
maths
-3.30
oga
-3.28
Leicester
-3.22
letal
-3.21
POSITIVE LOGITS
Pepper
9.18
pepper
8.61
peppers
8.09
Pe
4.63
chili
4.47
Lemon
4.46
Sauce
4.36
sauces
4.14
Melvin
4.10
Tomato
4.08
Activations Density 0.005%