INDEX
Explanations
phrases related to situations of legal and personal accountability
New Auto-Interp
Negative Logits
Efq
-1.11
Monfieur
-0.95
LookAnd
-0.88
Jefus
-0.85
myſelf
-0.84
fubject
-0.82
poffible
-0.81
Gave
-0.80
ſtate
-0.79
whoſe
-0.78
POSITIVE LOGITS
be
0.59
is
0.58
reportedly
0.57
are
0.57
initially
0.52
actually
0.52
ultimately
0.50
currently
0.49
generally
0.48
donc
0.48
Activations Density 0.799%