INDEX
Explanations
the presence of the suffix "inz" and related variations
New Auto-Interp
Negative Logits
rientation
-0.14
ataires
-0.14
ipy
-0.14
erot
-0.14
nek
-0.14
erval
-0.14
ivery
-0.14
å³
-0.14
irth
-0.14
udget
-0.14
POSITIVE LOGITS
eln
0.23
ylinder
0.18
s
0.17
linger
0.17
elm
0.17
yl
0.16
ç¿Ķ
0.15
ather
0.15
els
0.14
ehler
0.14
Activations Density 0.002%