INDEX
Explanations
phrases indicating a decision or conclusion
instances of the word "ruled" or its variations, often in a legal or declarative context
New Auto-Interp
Negative Logits
issance
-0.85
illin
-0.76
velength
-0.74
akra
-0.72
Newsletter
-0.72
ufact
-0.70
ription
-0.70
ionage
-0.68
ickr
-0.68
sqor
-0.65
POSITIVE LOGITS
phas
0.80
ihad
0.70
inker
0.69
maker
0.69
unfit
0.67
uled
0.65
against
0.65
differently
0.64
decisively
0.64
out
0.63
Activations Density 0.024%