INDEX
Explanations
phrases related to challenges and improvements
New Auto-Interp
Negative Logits
Less
-0.17
Less
-0.15
ÅĽÄĩ
-0.15
rosso
-0.14
moins
-0.14
Lesser
-0.14
-less
-0.13
olsun
-0.13
ë¥
-0.13
least
-0.13
POSITIVE LOGITS
even
0.81
further
0.68
even
0.68
EVEN
0.66
still
0.61
yet
0.59
Even
0.56
Even
0.56
jeszcze
0.55
еÑīе
0.54
Activations Density 0.252%