INDEX
Explanations
instances of reported speech or quotations
New Auto-Interp
Negative Logits
nier
-0.17
hurst
-0.16
borough
-0.15
.googlecode
-0.15
ürger
-0.15
etti
-0.15
angelo
-0.14
ubu
-0.14
Trang
-0.14
.setter
-0.14
POSITIVE LOGITS
udo
0.15
doc
0.14
eger
0.14
ÄĽtÅ¡
0.14
Felix
0.14
omen
0.14
Broad
0.13
afari
0.13
neutral
0.13
Tiny
0.13
Activations Density 0.048%