INDEX
Explanations
conjunctions and relative pronouns
New Auto-Interp
Negative Logits
собенно
0.49
hatta
0.39
Особенно
0.39
এমনকি
0.39
尤其是
0.39
thậm
0.39
특히
0.37
甚至
0.36
genden
0.36
znovu
0.36
POSITIVE LOGITS
ซึ่ง
0.61
because
0.59
потому
0.58
which
0.57
ซึ่ง
0.57
which
0.56
Because
0.55
because
0.55
பொதுவாக
0.53
což
0.51
Activations Density 0.387%