INDEX
Explanations
web links and specific references embedded in the text
references to organizations and information sources
New Auto-Interp
Negative Logits
Yugoslavia
-0.56
unforeseen
-0.55
exagger
-0.54
balk
-0.54
unintended
-0.52
Bai
-0.52
ordinate
-0.52
aunted
-0.51
forbid
-0.51
utters
-0.50
POSITIVE LOGITS
HERE
1.64
below
1.29
here
1.28
https
1.19
http
1.12
http
1.10
below
1.10
https
1.08
www
1.06
online
0.94
Activations Density 0.550%