INDEX
Explanations
personal pronouns and their variations in context
New Auto-Interp
Head Attr Weights
0:0.20
1:0.20
2:0.04
3:0.05
4:0.05
5:0.10
6:0.03
7:0.05
8:0.06
9:0.06
10:0.04
11:0.06
Negative Logits
Marginal
-1.55
minist
-1.44
Says
-1.42
said
-1.39
��
-1.38
¶
-1.37
Showtime
-1.37
Stuff
-1.37
Peel
-1.35
HEAD
-1.34
POSITIVE LOGITS
rusty
1.66
enium
1.37
athan
1.35
ivan
1.33
aru
1.33
is
1.31
iron
1.31
reminis
1.31
thal
1.28
ambul
1.28
Activations Density 0.005%