INDEX
Explanations
statements or quotes from individuals, particularly those that express feelings or opinions
New Auto-Interp
Head Attr Weights
0:0.18
1:0.12
2:0.07
3:0.05
4:0.04
5:0.02
6:0.08
7:0.10
8:0.02
9:0.05
10:0.16
11:0.04
Negative Logits
Shell
-3.18
Delta
-3.17
Tank
-2.96
scroll
-2.86
Snake
-2.62
Sar
-2.60
�
-2.58
aghd
-2.56
displayText
-2.54
Delta
-2.53
POSITIVE LOGITS
Fel
9.42
Fel
8.40
fel
5.64
Felix
5.14
Fidel
3.85
Fang
3.71
Maur
3.67
Ele
3.58
Calder
3.46
Fant
3.43
Activations Density 0.010%