INDEX
Explanations
verbs indicating contemplation or consideration
phrases that initiate questions or address the reader's curiosity
New Auto-Interp
Negative Logits
nown
-0.71
©¶æ¥µ
-0.67
imar
-0.66
realise
-0.64
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
-0.64
pex
-0.64
qua
-0.62
udeb
-0.62
%.
-0.62
tu
-0.61
POSITIVE LOGITS
warr
0.66
anything
0.64
dessert
0.63
exclus
0.62
inspiration
0.62
yourself
0.62
finer
0.60
throne
0.60
athlet
0.60
caffeine
0.58
Activations Density 0.359%