INDEX
Explanations
adjectives describing characteristics or qualities of objects or actions
adjectives describing various qualities or states of objects and experiences
New Auto-Interp
Negative Logits
moon
-0.63
¬¼
-0.63
thood
-0.62
xit
-0.62
©¶æ
-0.61
usalem
-0.61
CHA
-0.59
aleb
-0.58
rera
-0.57
bey
-0.57
POSITIVE LOGITS
compared
0.91
insofar
0.84
nowadays
0.82
sounding
0.81
indeed
0.80
––
0.77
(~
0.75
anyway
0.75
anyways
0.74
looking
0.72
Activations Density 0.284%