INDEX
Explanations
attends to the token "ll" from tokens that are ending in "id," excluding statements that lack companion tokens for "ll."
New Auto-Interp
Head Attr Weights
0:0.12
1:0.24
2:0.12
3:0.13
4:0.15
5:0.08
6:0.04
7:0.07
Negative Logits
تقاوى
-0.57
RegressionTest
-0.56
MigrationBuilder
-0.52
:✨
-0.52
IVEREF
-0.51
"..\..\..\
-0.50
sizeCache
-0.49
তথ্যসূত্র
-0.48
adə
-0.47
Réponses
-0.47
POSITIVE LOGITS
last
0.29
true
0.28
<blockquote>
0.27
venir
0.26
coming
0.26
زد
0.26
<i>
0.26
effects
0.25
tài
0.25
more
0.25
Activations Density 0.002%