INDEX
Explanations
expressions of personal emotions and concerns
New Auto-Interp
Negative Logits
wikipagina
-0.61
िखित
-0.58
étrie
-0.58
Wikiseite
-0.57
↹
-0.57
újo
-0.57
раздо
-0.56
voici
-0.55
kollek
-0.55
ofire
-0.54
POSITIVE LOGITS
Pristupljeno
0.63
really
0.56
still
0.55
also
0.55
maybe
0.55
definitely
0.54
Also
0.53
хь
0.53
o
0.52
Примітки
0.52
Activations Density 0.326%