INDEX
Explanations
phrases indicating a sense of deserving or merit
New Auto-Interp
Negative Logits
Marín
-0.46
hulls
-0.39
Gupta
-0.38
Đi
-0.37
nodeList
-0.37
gangsters
-0.36
Straß
-0.35
principal
-0.35
kayaks
-0.35
pani
-0.35
POSITIVE LOGITS
deserve
1.89
deserves
1.77
deserved
1.47
deserving
1.34
deserved
1.31
mérit
1.14
merece
1.09
mérite
1.03
merec
0.97
worthy
0.95
Activations Density 0.009%