INDEX
Explanations
mentions of the name "Nathan."
New Auto-Interp
Negative Logits
ัà¸į
-0.19
ofs
-0.18
verted
-0.17
uchs
-0.16
tee
-0.16
atto
-0.16
reta
-0.15
hek
-0.15
lund
-0.15
ucher
-0.15
POSITIVE LOGITS
ial
0.31
aniel
0.29
alie
0.28
645
0.21
iele
0.20
IEL
0.19
iel
0.19
iales
0.18
@nate
0.18
anson
0.16
Activations Density 0.008%