INDEX
Explanations
statements regarding authorship and personal opinions
New Auto-Interp
Negative Logits
itter
-0.14
оÑħ
-0.14
excuse
-0.14
asing
-0.14
ÙĬØ«
-0.14
aser
-0.14
aje
-0.14
Sist
-0.14
deed
-0.14
simultaneously
-0.14
POSITIVE LOGITS
views
0.27
Views
0.23
opinions
0.21
Views
0.21
views
0.20
/views
0.18
apult
0.17
expres
0.17
inions
0.17
_views
0.16
Activations Density 0.014%