INDEX
Explanations
personal pronouns and verbs related to following or supporting a person
repeated references to a specific individual
New Auto-Interp
Negative Logits
reach
-0.68
Limit
-0.68
Carrie
-0.67
Siege
-0.66
Pastebin
-0.65
ornia
-0.64
Deal
-0.63
Molly
-0.63
Sara
-0.63
Start
-0.63
POSITIVE LOGITS
redes
0.79
tremend
0.77
atically
0.77
conduc
0.75
detractors
0.75
atic
0.74
personally
0.72
enthusi
0.71
orally
0.69
occas
0.69
Activations Density 0.070%