INDEX
Explanations
definitions using relative clauses
New Auto-Interp
Negative Logits
другие
1.00
Он
0.90
После
0.88
других
0.84
Как
0.84
други
0.84
это
0.83
ವಾರು
0.83
respectivas
0.81
Sebagai
0.81
POSITIVE LOGITS
that
2.73
whose
2.60
which
2.56
που
2.33
that
2.31
which
2.27
الذي
2.25
که
2.22
التي
2.17
whose
2.10
Activations Density 0.219%