INDEX
Explanations
weaknesses or areas lacking in something
expressions related to deficiencies or lack in various contexts
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.72
dr
-0.69
escal
-0.65
Disp
-0.62
flo
-0.61
dry
-0.61
eds
-0.60
pill
-0.60
si
-0.60
offs
-0.59
POSITIVE LOGITS
lust
0.93
luster
0.92
avorite
0.89
patience
0.88
willpower
0.79
adequate
0.77
ocally
0.74
confidence
0.74
empathy
0.73
specificity
0.73
Activations Density 0.027%