INDEX
Explanations
phrases related to comparisons and contrasts
New Auto-Interp
Negative Logits
827
-0.16
amp
-0.16
ias
-0.15
chn
-0.15
i
-0.14
798
-0.14
478
-0.14
-0.13
looph
-0.13
cel
-0.13
POSITIVE LOGITS
ones
0.19
others
0.16
abbo
0.15
edla
0.15
ÐIJÑĢÑħÑĸв
0.15
regor
0.15
egrator
0.15
ëĿ½
0.14
iyim
0.14
usual
0.14
Activations Density 0.097%