INDEX
Explanations
references to events or places where things are being viewed by the public
references to public viewing or the act of observing
New Auto-Interp
Negative Logits
trap
-0.82
redo
-0.78
lying
-0.70
wealth
-0.70
contract
-0.70
breaking
-0.69
nesses
-0.69
working
-0.68
ursed
-0.67
accompan
-0.67
POSITIVE LOGITS
viewing
0.93
"$:/
0.89
Ratings
0.81
obser
0.80
wat
0.79
screens
0.77
experien
0.74
âĸ¬
0.74
ById
0.73
ipers
0.73
Activations Density 0.012%