INDEX
Explanations
expressions of doubt or skepticism regarding beliefs
New Auto-Interp
Negative Logits
Ñĸдно
-0.16
ervas
-0.13
ãĥ£
-0.13
Schneider
-0.13
мÑı
-0.13
ÑĪов
-0.12
zl
-0.12
olio
-0.12
MW
-0.12
.annotate
-0.12
POSITIVE LOGITS
talking
0.92
talk
0.75
Talking
0.69
speaking
0.67
Talking
0.65
-talk
0.62
talk
0.60
talks
0.60
referring
0.59
talked
0.58
Activations Density 0.141%