INDEX
Explanations
numerical values and their contextual significance
New Auto-Interp
Negative Logits
ernet
-0.17
drv
-0.15
essler
-0.15
-anchor
-0.15
à¸Ķ
-0.15
ãĥ¼ãĥĨ
-0.14
Poly
-0.14
åĺĽ
-0.14
-widgets
-0.14
ayi
-0.14
POSITIVE LOGITS
ipa
0.15
agr
0.14
PY
0.14
upa
0.14
æĬ¥åijĬ
0.14
ylko
0.14
еÑĪ
0.14
utenberg
0.13
/tos
0.13
ÌĨ
0.13
Activations Density 0.003%