INDEX
Explanations
greetings and conversational interjections
New Auto-Interp
Negative Logits
scriptcase
-0.76
useRouter
-0.68
coa
-0.68
Rivière
-0.66
Brasileiro
-0.66
AfterClass
-0.65
oficina
-0.64
Cordialement
-0.64
trebui
-0.63
ówczas
-0.63
POSITIVE LOGITS
Hey
2.20
Hey
2.19
hey
2.09
HEY
1.96
hey
1.84
HEY
1.74
Heywood
1.35
Heya
1.27
嘿
1.11
Hej
1.10
Activations Density 0.028%