INDEX
Explanations
phrases related to imagining scenarios or events
conjectures or hypothetical scenarios
New Auto-Interp
Negative Logits
isphere
-0.60
Kings
-0.55
canopy
-0.54
stuff
-0.52
Cologne
-0.52
Euph
-0.52
Spoiler
-0.52
oster
-0.52
Gran
-0.52
Gab
-0.51
POSITIVE LOGITS
govtrack
0.61
ļé
0.59
ultane
0.57
alternatively
0.56
secondly
0.56
erest
0.55
anwhile
0.55
vertisement
0.54
any
0.54
lesbians
0.53
Activations Density 1.079%