INDEX
Explanations
names, specifically last names
occurrences of the suffix 'ini' used in various contexts
New Auto-Interp
Negative Logits
¥µ
-0.77
é¾įå¥ij士
-0.71
ĻĤ
-0.70
spin
-0.68
deck
-0.65
friends
-0.65
pages
-0.64
shows
-0.64
worthy
-0.63
names
-0.63
POSITIVE LOGITS
ini
1.02
zzle
1.00
zzo
0.94
emi
0.94
etta
0.90
zzi
0.90
opsis
0.88
aga
0.88
ya
0.87
qi
0.86
Activations Density 0.010%