INDEX
Explanations
phrases that express opinions on clarity and transparency in communication
New Auto-Interp
Negative Logits
nemlig
-0.52
なんですよ
-0.51
なのです
-0.50
totiž
-0.49
يكب
-0.48
beginnetje
-0.48
んだよ
-0.45
なのだ
-0.45
んだよね
-0.44
orsese
-0.43
POSITIVE LOGITS
obvious
2.54
obvious
2.32
obviously
2.10
obviously
2.08
Obvious
2.03
obvio
1.91
Obviously
1.86
Obviously
1.80
duh
1.72
duh
1.67
Activations Density 0.405%