INDEX
Explanations
proper nouns or names containing the letters "ko"
references to the name "Ko"
New Auto-Interp
Negative Logits
glass
-0.76
ب
-0.73
Ö¼
-0.72
Creed
-0.71
ingham
-0.71
narrator
-0.70
senal
-0.67
à¨
-0.65
天
-0.65
ibly
-0.63
POSITIVE LOGITS
zzi
1.21
pport
0.96
osta
0.95
essler
0.94
pper
0.93
jo
0.92
ppa
0.90
pps
0.90
zy
0.89
unin
0.89
Activations Density 0.025%