INDEX
Explanations
references to various societal and economic issues
New Auto-Interp
Negative Logits
Ratings
-0.59
ournal
-0.57
amins
-0.56
vitamins
-0.56
livest
-0.54
abase
-0.54
Statistical
-0.54
Printing
-0.53
hovah
-0.53
ˈ
-0.53
POSITIVE LOGITS
hers
0.79
SPONSORED
0.73
~~~~~~~~
0.69
theirs
0.69
;;;;;;;;;;;;
0.68
anza
0.66
swer
0.65
itiz
0.65
affair
0.64
sie
0.64
Activations Density 3.961%