INDEX
Explanations
terms related to public figures and their statements or actions
statements related to political remarks or opinions
New Auto-Interp
Negative Logits
Located
-0.76
houses
-0.74
ILCS
-0.71
[];
-0.70
Exper
-0.70
Printing
-0.68
geries
-0.67
Archdemon
-0.67
Agric
-0.66
)/
-0.66
POSITIVE LOGITS
remarks
0.97
praise
0.95
upbeat
0.94
scathing
0.94
clarify
0.92
reiterate
0.90
categ
0.89
lique
0.88
sarcast
0.88
clarification
0.87
Activations Density 0.513%