INDEX
Explanations
references to the name "Jordan."
New Auto-Interp
Negative Logits
ven
-0.17
kil
-0.17
aud
-0.17
compreh
-0.16
erties
-0.16
unh
-0.15
gu
-0.15
owl
-0.14
dür
-0.14
Sdk
-0.14
POSITIVE LOGITS
bove
0.17
IAN
0.16
ien
0.16
ians
0.15
ian
0.15
ells
0.15
stown
0.15
503
0.14
.gg
0.14
azzi
0.14
Activations Density 0.008%