INDEX
Explanations
phrases related to recognizing, acknowledging, or accepting something
New Auto-Interp
Negative Logits
ograp
-0.80
quer
-0.73
teasp
-0.73
ciating
-0.71
agues
-0.70
aspx
-0.68
ä½ľ
-0.67
idav
-0.64
subur
-0.62
description
-0.62
POSITIVE LOGITS
that
1.08
realities
1.05
how
0.97
shortcomings
0.90
deficiencies
0.89
reality
0.88
responsibility
0.88
firsthand
0.87
flaws
0.86
weaknesses
0.83
Activations Density 0.201%