INDEX
Explanations
instances of teaching and educational roles
New Auto-Interp
Negative Logits
itan
-0.16
znik
-0.15
ukan
-0.15
buster
-0.15
busters
-0.14
лиÑĪ
-0.14
asso
-0.14
inspace
-0.14
rh
-0.13
ãĤ·
-0.13
POSITIVE LOGITS
μεν
0.15
bron
0.15
Coc
0.14
ipo
0.14
Encoded
0.14
FI
0.14
ÏħÏĦÏĮ
0.14
ırı
0.13
respected
0.13
427
0.13
Activations Density 0.048%