INDEX
Explanations
the occurrences of the letter 'b' and its variations in the text
New Auto-Interp
Negative Logits
>"+
-0.86
"):
-0.74
تح
-0.73
Danilo
-0.72
SuppressMessage
-0.72
Donahue
-0.69
Lire
-0.69
zel
-0.69
verſ
-0.67
}^{+\-0.67
POSITIVE LOGITS
b
1.21
B
1.21
B
1.14
b
1.11
getB
1.08
cB
0.92
bB
0.90
Xb
0.88
xb
0.85
ب
0.85
Activations Density 0.256%