INDEX
Explanations
phrases related to past behavior or performance
phrases indicating a history or track record of actions or behaviors
New Auto-Interp
Negative Logits
ando
-0.75
agers
-0.72
hap
-0.70
tein
-0.69
ulp
-0.68
Stars
-0.68
ateurs
-0.67
ishers
-0.67
ople
-0.66
elsen
-0.66
POSITIVE LOGITS
documented
0.79
dealings
0.77
acqu
0.74
proven
0.69
revolving
0.69
histories
0.68
reliability
0.67
abuser
0.67
conflicts
0.67
penchant
0.67
Activations Density 0.080%