INDEX
Explanations
the name "Ronaldini"
instances of a particular name or phrase related to a person
New Auto-Interp
Negative Logits
ĻĤ
-0.75
¥µ
-0.74
worthy
-0.72
pages
-0.70
steps
-0.67
friends
-0.67
names
-0.65
¥
-0.64
groups
-0.63
soph
-0.61
POSITIVE LOGITS
ini
1.30
etta
0.89
etti
0.89
otto
0.87
zzle
0.87
otti
0.86
ptin
0.85
emi
0.83
igne
0.83
eta
0.83
Activations Density 0.004%