INDEX
Explanations
mentions of prominent individuals and their affiliations or contributions in various contexts
New Auto-Interp
Negative Logits
ffic
-0.14
ç¿Ķ
-0.14
å¸Ī
-0.14
olarak
-0.14
_mgr
-0.14
ربÛĮ
-0.14
èĥ¸
-0.14
å®ĺ
-0.13
師
-0.13
RAINT
-0.13
POSITIVE LOGITS
another
0.29
former
0.24
fellow
0.22
another
0.18
one
0.18
our
0.17
åı¦ä¸Ģ
0.17
erst
0.17
longtime
0.17
none
0.16
Activations Density 0.241%