INDEX
Explanations
words conveying reasoning, communication and agreement
say "that"
New Auto-Interp
Negative Logits
which
-1.34
Which
-1.29
Which
-1.15
WHICH
-1.04
quelles
-0.96
οποία
-0.93
laquelle
-0.90
οποίο
-0.85
quels
-0.84
hich
-0.81
POSITIVE LOGITS
that
2.20
rằng
0.95
bahwa
0.88
że
0.72
ότι
0.72
propOrder
0.66
कि
0.63
multirow
0.62
kwamba
0.60
mà
0.58
Activations Density 0.694%