INDEX
Explanations
words related to observation and speculation about actions or events
pronouns and related expressions of perception or belief
New Auto-Interp
Negative Logits
Effective
-0.66
é£
-0.66
Restoration
-0.62
Affordable
-0.61
286
-0.60
Mercenary
-0.60
Chong
-0.60
monog
-0.59
Vi
-0.58
Fug
-0.58
POSITIVE LOGITS
wonder
0.94
wondered
0.88
searched
0.87
watched
0.84
hear
0.84
realise
0.83
wish
0.82
pity
0.81
overlook
0.80
admire
0.80
Activations Density 0.553%