INDEX
Explanations
words related to people's names
the end-of-text token and variations of the name "Alexey."
New Auto-Interp
Negative Logits
mint
-0.79
istani
-0.77
ULAR
-0.72
atoon
-0.71
ãĥķãĤ©
-0.63
Present
-0.62
ivity
-0.62
atchewan
-0.61
ï
-0.61
EED
-0.61
POSITIVE LOGITS
ewitness
1.11
kj
0.86
outube
0.80
er
0.80
oshi
0.79
ield
0.78
giene
0.78
alty
0.75
estinal
0.74
esy
0.73
Activations Density 0.031%