INDEX
Explanations
phrases indicating reasons and justifications, particularly in a legal or formal context
New Auto-Interp
Negative Logits
PYX
-0.75
بوابة
-0.57
جغرافيا
-0.52
готовка
-0.50
Kjelder
-0.49
Дереккөздер
-0.49
참고
-0.48
niająca
-0.47
ISSE
-0.45
erts
-0.44
POSITIVE LOGITS
these
1.09
this
1.06
thefe
0.86
этого
0.82
این
0.81
these
0.79
этих
0.78
queste
0.78
diesen
0.74
těchto
0.74
Activations Density 1.180%