INDEX
Explanations
mentions of the name "Graham" in various contexts
New Auto-Interp
Head Attr Weights
0:0.07
1:0.03
2:0.24
3:0.09
4:0.07
5:0.08
6:0.03
7:0.04
8:0.04
9:0.18
10:0.06
11:0.03
Negative Logits
UTERS
-1.46
abytes
-1.32
selves
-1.30
imeters
-1.29
illions
-1.24
guiActiveUnfocused
-1.24
iatus
-1.22
foundland
-1.20
axy
-1.20
UGE
-1.18
POSITIVE LOGITS
wine
1.25
bold
1.20
espie
1.19
Hansen
1.13
eele
1.13
heit
1.11
adam
1.10
bert
1.09
confid
1.08
Welch
1.06
Activations Density 0.007%