INDEX
Explanations
the use of conjunctions or other connecting words in the text
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.08
4:0.07
5:0.08
6:0.08
7:0.09
8:0.08
9:0.09
10:0.07
11:0.08
Negative Logits
apples
-2.87
tofu
-2.77
othy
-2.71
Cooking
-2.70
hao
-2.69
Kung
-2.58
murd
-2.56
hunts
-2.56
Cheng
-2.53
bung
-2.53
POSITIVE LOGITS
elect
2.83
Mat
2.78
iov
2.66
Nev
2.57
Lieberman
2.51
stood
2.47
ROM
2.44
secure
2.43
idem
2.42
Ev
2.38
Activations Density 0.000%