INDEX
Explanations
the word "deserve" or variations of it
phrases indicating entitlement or deservingness
New Auto-Interp
Negative Logits
zyme
-0.71
ula
-0.67
ullivan
-0.64
Briggs
-0.62
azar
-0.62
ki
-0.61
band
-0.61
Ou
-0.60
au
-0.59
kj
-0.58
POSITIVE LOGITS
precedence
0.88
arna
0.82
bage
0.81
applause
0.80
FINE
0.80
Citation
0.75
iour
0.73
justice
0.72
Ô
0.71
deserve
0.71
Activations Density 0.026%