INDEX
Explanations
proper nouns or names, particularly related to people
repeated names or references to specific individuals in the text
New Auto-Interp
Negative Logits
Aad
-0.71
Bie
-0.69
Maya
-0.68
thing
-0.68
FACE
-0.68
âĢ¢âĢ¢
-0.66
Afgh
-0.65
answ
-0.64
bubble
-0.64
stink
-0.63
POSITIVE LOGITS
Berger
1.99
Macron
1.88
alle
1.76
Robertson
1.34
Rou
1.24
LC
1.21
olin
1.21
oton
1.20
Rogue
1.09
rift
1.08
Activations Density 0.034%