INDEX
Explanations
interrogative statements and responses related to political and diplomatic issues
New Auto-Interp
Head Attr Weights
0:0.04
1:0.07
2:0.06
3:0.03
4:0.18
5:0.19
6:0.06
7:0.04
8:0.05
9:0.14
10:0.04
11:0.04
Negative Logits
kilomet
-2.16
izont
-1.79
symbol
-1.79
photos
-1.71
Symb
-1.62
Pok
-1.62
dot
-1.61
Nem
-1.56
Siber
-1.55
photo
-1.55
POSITIVE LOGITS
answered
2.40
hee
2.13
iott
2.09
Answer
1.99
cheat
1.82
gage
1.82
��極
1.81
answer
1.80
Spending
1.79
Answer
1.78
Activations Density 0.006%