INDEX
Explanations
significant numerical values, particularly years and monetary amounts
New Auto-Interp
Negative Logits
777
-0.15
owe
-0.14
oga
-0.14
åŃĹå¹ķ
-0.14
ãĤ¦ãĥĪ
-0.14
uyên
-0.13
fame
-0.13
·¸
-0.13
amb
-0.13
owell
-0.13
POSITIVE LOGITS
å¹´
0.25
å¹´ãģ®
0.20
marks
0.20
ëħĦëıĦ
0.20
ëħĦ
0.20
yılı
0.19
ëħĦ
0.19
欧
0.19
saw
0.19
å¹´çļĦ
0.18
Activations Density 0.085%