INDEX
Explanations
references to specific individuals named Ken
New Auto-Interp
Negative Logits
oling
-0.07
tá»ij
-0.07
encer
-0.07
indexes
-0.07
erot
-0.07
illery
-0.07
eyn
-0.07
cela
-0.06
itesse
-0.06
lando
-0.06
POSITIVE LOGITS
yon
0.09
zie
0.08
ya
0.08
worthy
0.07
zik
0.07
igma
0.07
yal
0.06
mare
0.06
485
0.06
ran
0.06
Activations Density 0.011%