INDEX
Explanations
specific proper nouns and names, particularly related to individuals
New Auto-Interp
Head Attr Weights
0:0.06
1:0.07
2:0.13
3:0.04
4:0.03
5:0.04
6:0.05
7:0.03
8:0.03
9:0.04
10:0.40
11:0.03
Negative Logits
inyl
-2.34
dos
-2.23
hol
-2.23
ials
-2.16
isphere
-2.10
perty
-2.04
Eternity
-2.02
Golem
-1.99
thing
-1.98
Dy
-1.95
POSITIVE LOGITS
Scott
3.94
Scott
3.84
Scot
2.57
Walker
2.48
Wall
2.43
Walker
2.40
Wallace
2.34
Stuart
2.34
displayText
2.26
CAP
2.26
Activations Density 0.000%