INDEX
Explanations
phrases indicating a need for improvement or enhancement
New Auto-Interp
Negative Logits
nown
-0.72
azar
-0.70
abal
-0.67
umbered
-0.65
vantage
-0.62
Associ
-0.62
emies
-0.60
RANT
-0.60
uthor
-0.60
aneously
-0.60
POSITIVE LOGITS
refres
1.00
urgently
0.88
HELP
0.88
help
0.82
patience
0.79
assurances
0.78
reinforcement
0.77
clarification
0.75
assurance
0.74
luck
0.74
Activations Density 0.150%