INDEX
Explanations
words related to people, such as verbs like 'are' and 'were'
the verb "to be" in different forms and contexts
New Auto-Interp
Negative Logits
pedia
-0.75
imation
-0.70
ricks
-0.68
Geh
-0.63
ooters
-0.62
Footnote
-0.62
goodbye
-0.62
fail
-0.62
cancellation
-0.61
wake
-0.60
POSITIVE LOGITS
tein
0.82
fluent
0.73
dinand
0.72
supposed
0.70
held
0.67
subscribed
0.67
behold
0.67
married
0.67
caught
0.66
blinded
0.66
Activations Density 0.186%