INDEX
Explanations
proper nouns related to political figures or organizations
names of prominent individuals and organizations
New Auto-Interp
Negative Logits
interstitial
-0.60
ãĥ¼ãĥĨãĤ£
-0.53
Els
-0.49
è¦ļéĨĴ
-0.49
ãĥ¼ãĥĨ
-0.48
代
-0.47
guiName
-0.47
*.
-0.47
asta
-0.47
Cth
-0.46
POSITIVE LOGITS
could
0.62
opted
0.62
has
0.60
spokesman
0.60
insisted
0.59
couldn
0.59
intends
0.59
responded
0.59
had
0.58
chose
0.58
Activations Density 0.980%