INDEX
Explanations
mentions of the word "best" in various contexts
New Auto-Interp
Negative Logits
okit
-0.15
ughters
-0.15
woo
-0.15
doch
-0.15
Pis
-0.14
orre
-0.14
oriously
-0.14
ipsis
-0.14
oru
-0.14
uchos
-0.14
POSITIVE LOGITS
ilig
0.17
emer
0.16
Cah
0.15
../../../../
0.15
erman
0.14
Assurance
0.14
анк
0.14
tangent
0.14
ep
0.14
RIX
0.13
Activations Density 0.003%