INDEX
Explanations
the name "Carlo" followed by a number
occurrences of the name "Carlo"
New Auto-Interp
Negative Logits
manship
-0.85
imental
-0.75
icles
-0.75
ICLE
-0.74
orship
-0.71
iments
-0.70
glass
-0.69
orial
-0.68
chell
-0.67
uality
-0.65
POSITIVE LOGITS
zzi
1.14
pper
1.01
fty
1.00
zzle
0.97
ppy
0.95
ven
0.93
oping
0.90
tto
0.88
pping
0.88
ppe
0.87
Activations Density 0.020%