INDEX
Explanations
phrases related to actions or characteristics associated with individuals
conjugated forms of the verb "to be" in various contexts
New Auto-Interp
Negative Logits
oire
-0.82
osate
-0.77
lag
-0.77
ð
-0.72
lished
-0.71
culosis
-0.70
cture
-0.66
ographer
-0.66
poke
-0.65
orthern
-0.64
POSITIVE LOGITS
selves
1.07
themselves
0.99
wolves
0.90
able
0.86
outnumbered
0.85
selves
0.85
interchangeable
0.84
idiots
0.82
supposed
0.80
aware
0.80
Activations Density 0.317%