INDEX
Explanations
the letter 'Q' and phrases with significance or impact
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.07
3:0.06
4:0.07
5:0.09
6:0.08
7:0.07
8:0.07
9:0.08
10:0.08
11:0.09
Negative Logits
aeda
-3.02
20439
-2.96
submit
-2.90
�
-2.78
�
-2.77
プ
-2.77
ン
-2.76
�
-2.57
�
-2.55
ه
-2.54
POSITIVE LOGITS
Hopkins
2.63
Cock
2.57
Aber
2.52
brill
2.49
cock
2.49
CLS
2.46
etsk
2.43
Brut
2.43
loft
2.42
gent
2.41
Activations Density 0.000%