INDEX
Explanations
proper nouns and specific entities related to organizations, events, and notable individuals
New Auto-Interp
Negative Logits
Dit
-0.15
och
-0.15
able
-0.15
ove
-0.14
Haus
-0.14
au
-0.14
au
-0.14
804
-0.14
buch
-0.13
ickerView
-0.13
POSITIVE LOGITS
regarding
0.35
concerning
0.29
about
0.26
Ñīодо
0.24
åħ³äºİ
0.22
Regarding
0.21
vá»ģ
0.21
about
0.20
tentang
0.19
towards
0.19
Activations Density 0.607%