INDEX
Explanations
proper nouns, specifically names
references to particular individuals, especially those named Mau and Piet
New Auto-Interp
Negative Logits
angers
-0.84
Interstitial
-0.81
nikov
-0.81
ãģį
-0.79
enegger
-0.77
é¾į
-0.75
é¾įå
-0.74
OUGH
-0.69
ãĤ¤ãĥĪ
-0.68
ãĤ¼
-0.68
POSITIVE LOGITS
Mau
1.11
ertodd
0.96
Piet
0.90
aeper
0.82
iflower
0.82
catentry
0.81
pillar
0.80
vine
0.79
leted
0.76
edIn
0.76
Activations Density 0.012%