INDEX
Explanations
comparative and superlative adjectives indicative of performance or quality
New Auto-Interp
Negative Logits
eniable
-0.18
sto
-0.16
æ¯Ķè¼ĥ
-0.16
plr
-0.15
æ¯Ķè¾ĥ
-0.15
efon
-0.15
pll
-0.15
atori
-0.14
æŁı
-0.14
ãĥ³ãĥģ
-0.14
POSITIVE LOGITS
than
0.44
Than
0.30
then
0.30
THAN
0.26
than
0.26
_than
0.25
Than
0.24
then
0.20
THEN
0.20
Then
0.19
Activations Density 0.116%