INDEX
Explanations
proper nouns or names of specific entities
instances of the verb "to be" in various forms
New Auto-Interp
Negative Logits
Inventory
-0.67
ctive
-0.67
ionics
-0.67
retty
-0.67
athy
-0.66
erald
-0.66
Guys
-0.63
Puzzles
-0.63
lex
-0.60
Passive
-0.60
POSITIVE LOGITS
supposed
1.11
purportedly
1.10
supposedly
1.09
incidentally
1.07
subsequently
1.04
allegedly
1.01
purported
0.98
ostensibly
0.97
overseen
0.92
besie
0.91
Activations Density 0.164%