INDEX
Explanations
references to budget cuts
New Auto-Interp
Negative Logits
myſelf
-1.96
itſelf
-1.76
whoſe
-1.69
Jefus
-1.65
Monfieur
-1.65
Efq
-1.64
doubtnut
-1.63
themſelves
-1.62
pleaſure
-1.62
Majefty
-1.60
POSITIVE LOGITS
0.96
=
0.95
=
0.92
(
0.90
-
0.83
I
0.80
'
0.78
.
0.78
F
0.76
0.75
Activations Density 0.430%