INDEX
Explanations
the word "that" in various contexts
New Auto-Interp
Negative Logits
disponibilités
-0.67
ร์
-0.64
er
-0.63
hoga
-0.62
Argo
-0.60
ER
-0.59
ẩu
-0.59
alu
-0.57
cellaneous
-0.57
Melayu
-0.56
POSITIVE LOGITS
that
1.52
that
1.38
THAT
1.08
THAT
1.05
That
0.96
chiunque
0.94
That
0.94
mikä
0.88
bahwa
0.88
they
0.85
Activations Density 0.502%