INDEX
Explanations
statements or declarations made by individuals
New Auto-Interp
Negative Logits
RH
-0.72
Rh
-0.68
dayName
-0.67
pps
-0.67
alez
-0.67
umbn
-0.66
isoft
-0.64
rero
-0.64
anuts
-0.64
iddling
-0.63
POSITIVE LOGITS
bankruptcy
1.05
phas
1.00
allegiance
0.89
unequivocally
0.83
independence
0.82
victory
0.82
unfit
0.80
martial
0.80
unconstitutional
0.79
war
0.79
Activations Density 0.087%