INDEX
Explanations
specific references to technology, communication devices, and security concerns
New Auto-Interp
Negative Logits
Diweddarwch
-0.70
zarówno
-0.69
nakalista
-0.63
zowel
-0.63
titolata
-0.60
především
-0.59
appunto
-0.59
fjspx
-0.59
期刊论文
-0.58
gekomen
-0.58
POSITIVE LOGITS
while
0.90
whilst
0.78
mientras
0.75
WHILE
0.72
pretending
0.71
sambil
0.66
lmao
0.65
enquanto
0.65
every
0.64
mientras
0.63
Activations Density 0.987%