INDEX
Explanations
names of people and locations
proper nouns related to people and institutions
New Auto-Interp
Negative Logits
BuyableInstoreAndOnline
-0.73
ãĤ¯
-0.72
cast
-0.69
casting
-0.69
ggle
-0.68
ĪĴ
-0.66
ãĤ¦
-0.65
oxide
-0.64
omaly
-0.64
Ïĥ
-0.64
POSITIVE LOGITS
intosh
0.99
donald
0.91
enzie
0.88
artney
0.84
rons
0.79
icut
0.78
ermott
0.76
ufact
0.75
awar
0.75
worth
0.74
Activations Density 0.012%