INDEX
Explanations
adjectives related to harm or negative impact
terms related to destruction and harmful effects
New Auto-Interp
Negative Logits
Cosponsors
-0.76
ask
-0.74
Clar
-0.72
shire
-0.71
HR
-0.66
Patel
-0.66
icip
-0.66
âī¤
-0.64
Lilly
-0.64
Kislyak
-0.64
POSITIVE LOGITS
destructive
3.19
destruct
2.38
destruct
2.01
destroying
1.88
destruction
1.79
dest
1.67
Dest
1.66
Destruction
1.64
Dest
1.55
sabot
1.44
Activations Density 0.032%