INDEX
Explanations
comparative phrases emphasizing depth or complexity
comparisons suggesting complexity or depth
New Auto-Interp
Negative Logits
ella
-0.75
taboola
-0.75
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.74
âĹ¼
-0.74
Winged
-0.71
ECT
-0.69
soType
-0.69
ALLY
-0.68
Used
-0.66
ãģ®éŃĶ
-0.66
POSITIVE LOGITS
mere
1.21
merely
1.11
simply
1.10
just
0.91
superficial
0.89
meets
0.87
simple
0.87
brute
0.80
purely
0.80
ordinary
0.78
Activations Density 0.129%