INDEX
Explanations
verb or adjective followed by preposition
New Auto-Interp
Negative Logits
और
0.80
وغ
0.75
֩
0.75
そして
0.75
এবং
0.74
وع
0.74
ঃ
0.71
અને
0.71
এবং
0.70
ು
0.68
POSITIVE LOGITS
.].
1.29
.).
1.29
."
1.24
].
1.23
.</
1.21
.}
1.21
unless
1.20
.]
1.19
."""
1.18
?).
1.18
Activations Density 0.121%