INDEX
Explanations
words related to excuses or justifications
terms related to justifications or rationalizations, particularly focusing on "excuses."
New Auto-Interp
Negative Logits
emouth
-0.80
acid
-0.80
marks
-0.78
mark
-0.78
itals
-0.77
tein
-0.77
ipeg
-0.75
utenberg
-0.73
weeney
-0.72
semble
-0.71
POSITIVE LOGITS
excuse
1.13
excuses
1.04
why
1.02
WHY
1.00
abl
0.87
rationale
0.84
explanations
0.79
justifying
0.78
explanation
0.78
why
0.76
Activations Density 0.056%