INDEX
Explanations
mentions of the letter 'B' associated with genetic or biological terms
New Auto-Interp
Negative Logits
"):
-0.92
verſ
-0.89
iſt
-0.84
)");
-0.82
"):
-0.81
―――――
-0.81
ſind
-0.79
ſelf
-0.79
muſt
-0.79
ſever
-0.77
POSITIVE LOGITS
B
2.88
B
2.64
b
1.95
getB
1.88
getB
1.59
b
1.59
bB
1.35
B
1.33
Б
1.30
ب
1.29
Activations Density 0.250%