INDEX
Explanations
questions directed at someone to explain or provide information
instances of direct address or questions directed at the audience
New Auto-Interp
Negative Logits
phrine
-0.77
BuyableInstoreAndOnline
-0.71
ikan
-0.62
Wikimedia
-0.61
mathemat
-0.61
sharing
-0.61
ITED
-0.61
itement
-0.60
untarily
-0.59
externalToEVAOnly
-0.59
POSITIVE LOGITS
why
0.72
auga
0.72
goodbye
0.71
how
0.70
something
0.68
çīĪ
0.67
plainly
0.66
beforehand
0.66
about
0.65
dn
0.65
Activations Density 0.037%