INDEX
Explanations
mentions of the name "Sid."
New Auto-Interp
Negative Logits
iola
-0.20
asco
-0.16
kettle
-0.15
ÃĹ↵↵
-0.15
site
-0.15
imid
-0.14
maal
-0.14
fetch
-0.14
table
-0.14
idge
-0.14
POSITIVE LOGITS
ney
0.22
eways
0.19
har
0.18
NEY
0.18
este
0.18
ней
0.17
hir
0.17
kick
0.17
à¥įध
0.16
arat
0.16
Activations Density 0.010%