INDEX
Explanations
instances of the word "declare" and its variations in different contexts, indicating a focus on declarations or formal statements
New Auto-Interp
Negative Logits
RH
-0.75
Rh
-0.70
dayName
-0.67
isoft
-0.67
alez
-0.65
anuts
-0.64
engers
-0.63
pps
-0.63
umbn
-0.63
Reply
-0.62
POSITIVE LOGITS
bankruptcy
1.08
phas
0.97
allegiance
0.92
unequivocally
0.86
victory
0.85
independence
0.82
aloud
0.80
martial
0.79
war
0.79
moratorium
0.77
Activations Density 0.017%