INDEX
Explanations
names of individuals, particularly focusing on Japanese names like "Kawashima"
references to prominent individuals or notable names
New Auto-Interp
Negative Logits
istically
-0.74
xual
-0.74
pmwiki
-0.72
llor
-0.71
owder
-0.70
udic
-0.69
Murdoch
-0.69
vain
-0.68
Flight
-0.68
ugu
-0.67
POSITIVE LOGITS
asaki
1.07
Kaw
1.05
©¶æ¥µ
0.98
enne
0.95
aii
0.94
eco
0.86
apon
0.82
ota
0.81
halla
0.80
KEN
0.77
Activations Density 0.010%