INDEX
Explanations
terms related to contrasting elements or co-existing entities
conjunctions and comparative phrases
New Auto-Interp
Negative Logits
uploads
-0.51
xious
-0.48
alike
-0.48
alde
-0.47
doms
-0.46
ãĥ³ãĤ¸
-0.44
è¦
-0.43
psc
-0.43
bers
-0.43
ersen
-0.43
POSITIVE LOGITS
an
1.38
a
1.37
another
1.11
someone
1.05
someone
0.94
something
0.94
somebody
0.92
another
0.86
a
0.84
an
0.81
Activations Density 1.300%