INDEX
Explanations
URL patterns or references in the text
New Auto-Interp
Negative Logits
Efq
-0.98
pleaſure
-0.98
myſelf
-0.92
doubtnut
-0.90
itſelf
-0.89
raiſ
-0.87
Monfieur
-0.86
ſeveral
-0.86
Conſ
-0.85
Haarlem
-0.83
POSITIVE LOGITS
/
1.55
/\
1.04
Mathis
0.99
/=
0.98
iwa
0.94
Rosenthal
0.91
/(\
0.91
Jansen
0.85
Pfeiffer
0.85
-
0.84
Activations Density 0.096%