INDEX
Explanations
instances of strong declarative statements or conclusions in discussions
New Auto-Interp
Negative Logits
пожалуйста
-0.52
为止
-0.52
こいつ
-0.49
Vergnügen
-0.48
featureID
-0.48
numerusform
-0.46
ándalo
-0.44
这家伙
-0.43
kenstock
-0.43
颼
-0.43
POSITIVE LOGITS
referring
0.86
Asked
0.85
Referring
0.84
Referring
0.83
Asked
0.82
pointing
0.73
citing
0.72
noting
0.71
Descri
0.69
speaking
0.67
Activations Density 0.306%