INDEX
Explanations
instances of the phrase "bos"
New Auto-Interp
Negative Logits
Бахар
-0.51
Co
-0.48
ma
-0.45
都在
-0.45
estimés
-0.45
pre
-0.44
R
-0.42
New
-0.42
"
-0.42
St
-0.42
POSITIVE LOGITS
poffible
0.86
ſche
0.85
Jefus
0.84
purpoſe
0.82
raiſ
0.81
poffe
0.81
juſ
0.80
faſt
0.80
ſtate
0.79
myſelf
0.79
Activations Density 0.593%