INDEX
Explanations
concepts related to honor, particularly in contexts of recognition and valor
New Auto-Interp
Negative Logits
parsedMessage
-0.71
dedans
-0.69
Saltar
-0.69
passés
-0.61
kuh
-0.61
âgé
-0.61
coûte
-0.60
GARET
-0.59
yonder
-0.59
πάντα
-0.57
POSITIVE LOGITS
Honor
1.34
honor
1.34
Honor
1.28
HONOR
1.23
honor
1.17
honour
1.17
hon
1.04
Honour
1.01
Hon
0.98
HON
0.97
Activations Density 0.021%