INDEX
Explanations
names of political figures
proper nouns and names, particularly related to individuals and entities
New Auto-Interp
Negative Logits
Invention
-0.67
olulu
-0.66
cffffcc
-0.65
retty
-0.62
ŃĶ
-0.59
ļéĨĴ
-0.59
};
-0.59
Gloria
-0.56
.�
-0.56
Palest
-0.55
POSITIVE LOGITS
will
0.97
intends
0.93
could
0.93
somehow
0.92
might
0.89
should
0.89
someday
0.88
would
0.88
qualifies
0.86
succeeds
0.85
Activations Density 0.479%