INDEX
Explanations
phrases related to evaluations or judgments
terms related to justification and validity in contexts involving decisions or statements
New Auto-Interp
Negative Logits
domest
-0.74
kindred
-0.68
indo
-0.67
oppressed
-0.64
etus
-0.64
resil
-0.61
decad
-0.60
ut
-0.60
plent
-0.60
everyday
-0.59
POSITIVE LOGITS
due
1.04
owing
0.98
because
0.93
due
0.93
because
0.91
citing
0.82
considering
0.81
despite
0.79
pending
0.78
barring
0.77
Activations Density 0.416%