INDEX
Explanations
references to legal terminologies related to justice and fair treatment
phrases related to the concept of due process
New Auto-Interp
Negative Logits
sat
-0.69
Seymour
-0.69
Offline
-0.68
Hots
-0.68
adia
-0.65
Sheep
-0.64
Sek
-0.63
anon
-0.62
Wolfgang
-0.61
flo
-0.61
POSITIVE LOGITS
diligence
1.47
lling
1.12
dilig
0.90
giving
0.90
lled
0.88
process
0.85
process
0.83
notice
0.83
respect
0.81
allowance
0.81
Activations Density 0.028%