INDEX
Explanations
the conjunction "and" in the text
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.09
4:0.08
5:0.06
6:0.07
7:0.08
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
Baal
-2.94
bian
-2.58
Jaguars
-2.53
JACK
-2.47
atered
-2.45
Kos
-2.39
Homs
-2.36
Antioch
-2.33
Samson
-2.30
SAL
-2.30
POSITIVE LOGITS
terness
3.03
veyard
2.89
ゼウス
2.88
ioxid
2.74
row
2.60
velength
2.60
diffusion
2.51
endish
2.47
neurot
2.47
aceous
2.46
Activations Density 0.000%