INDEX
Explanations
references to wealthy or influential individuals
references to various types of businesses or corporations
New Auto-Interp
Negative Logits
PORT
-0.81
ATK
-0.77
å§«
-0.72
Ferr
-0.69
Sale
-0.68
Accessory
-0.68
Participant
-0.67
Murd
-0.67
Accountability
-0.66
Annotations
-0.66
POSITIVE LOGITS
etime
0.90
icol
0.89
iddling
0.86
gee
0.86
pex
0.83
iatrics
0.82
ogly
0.81
iatric
0.81
ction
0.81
idd
0.79
Activations Density 0.135%