INDEX
Explanations
names of individuals
proper nouns, particularly names of people
New Auto-Interp
Negative Logits
LEASE
-0.77
Michigan
-0.71
Colossus
-0.71
IUM
-0.71
Westbrook
-0.69
CLASSIFIED
-0.67
Ohio
-0.67
GGGGGGGG
-0.66
EEE
-0.64
Bloom
-0.64
POSITIVE LOGITS
oub
1.01
awar
0.98
aya
0.96
ulla
0.94
iani
0.93
hani
0.92
ibi
0.92
ij
0.91
angan
0.90
abis
0.89
Activations Density 0.205%