INDEX
Explanations
personal names and entities in the text
New Auto-Interp
Negative Logits
itſelf
-1.04
pleaſure
-0.91
myſelf
-0.85
ſmall
-0.77
Conſ
-0.75
Monfieur
-0.75
fhew
-0.74
ſche
-0.73
Diſ
-0.71
ſever
-0.71
POSITIVE LOGITS
Charles
0.77
("%.0.74
Edward
0.73
George
0.73
PhysRev
0.70
djangoproject
0.70
Robert
0.70
Henry
0.69
William
0.69
Ed
0.68
Activations Density 0.897%