INDEX
Explanations
adjectives describing high intensity or extremeness
extreme adjectives that describe conditions or situations
New Auto-Interp
Negative Logits
Ô
-0.65
cember
-0.59
zsche
-0.59
Brief
-0.54
chieve
-0.53
ighth
-0.53
ilan
-0.52
livest
-0.52
ãĤ´ãĥ³
-0.51
Vert
-0.50
POSITIVE LOGITS
that
1.40
that
1.24
THAT
1.04
it
0.94
they
0.90
That
0.90
That
0.86
thats
0.83
they
0.75
you
0.70
Activations Density 0.144%