INDEX
Explanations
phrases indicating statistical or numerical values
New Auto-Interp
Negative Logits
keď
-0.52
{}".-0.50
protože
-0.48
wenn
-0.47
puisque
-0.46
whereas
-0.46
dass
-0.45
if
-0.45
ieważ
-0.45
perché
-0.44
POSITIVE LOGITS
habits
0.97
Wikimedijinoj
0.95
ients
0.92
stood
0.91
BeginInit
0.90
bited
0.89
følgelig
0.86
estekak
0.86
etheless
0.84
pires
0.82
Activations Density 0.332%