INDEX
Explanations
phrases related to conditional statements and contrasting situations
New Auto-Interp
Head Attr Weights
0:0.24
1:0.04
2:0.05
3:0.07
4:0.05
5:0.03
6:0.06
7:0.17
8:0.02
9:0.07
10:0.12
11:0.03
Negative Logits
�
-4.05
Krypt
-3.65
BCE
-3.62
hipp
-3.47
poppy
-3.40
hemp
-3.34
glyphosate
-3.25
Grateful
-3.23
psychedel
-3.20
Monsanto
-3.20
POSITIVE LOGITS
Irma
6.44
Naples
4.82
Brow
3.73
iami
3.71
Miami
3.52
Miami
3.52
nton
3.51
Pens
3.45
Maria
3.40
Maria
3.40
Activations Density 0.000%