INDEX
Explanations
references to legal or criminal activities
New Auto-Interp
Negative Logits
dissu
-0.76
lihood
-0.75
xual
-0.66
overth
-0.65
discourage
-0.64
next
-0.63
undecided
-0.63
someday
-0.63
outweigh
-0.63
offline
-0.61
POSITIVE LOGITS
Posted
1.07
WASHINGTON
1.06
Contents
0.98
SHARE
0.97
³³³³³³³³
0.95
SCP
0.93
LOS
0.92
Published
0.92
BBC
0.92
ccording
0.91
Activations Density 2.615%