INDEX
Explanations
parentheses and their contents
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.07
3:0.08
4:0.07
5:0.07
6:0.09
7:0.09
8:0.09
9:0.07
10:0.09
11:0.08
Negative Logits
chieve
-2.74
paio
-2.68
ende
-2.65
quit
-2.60
.</
-2.56
Chero
-2.54
unte
-2.52
witch
-2.50
�
-2.50
interstate
-2.49
POSITIVE LOGITS
Sonia
2.71
JJ
2.66
Musk
2.65
Jinn
2.65
CLS
2.63
Eggs
2.58
LSD
2.58
champagne
2.55
Klein
2.55
Hawking
2.51
Activations Density 0.000%