INDEX
Explanations
references to humility and its related concepts
New Auto-Interp
Negative Logits
thumbs
-0.16
itto
-0.15
ům
-0.14
helm
-0.14
volley
-0.14
Wire
-0.14
elig
-0.14
å¼ı
-0.13
Helm
-0.13
affirmative
-0.13
POSITIVE LOGITS
humble
0.18
arily
0.17
kker
0.16
Ñģобой
0.14
ardy
0.14
ERRU
0.14
Hum
0.14
ÙĪØ§Ø±
0.14
/simple
0.14
usi
0.14
Activations Density 0.016%