INDEX
Explanations
political figures and their related activities or positions
negative connotations or sentiments associated with various political figures or events
New Auto-Interp
Negative Logits
llah
-0.83
DragonMagazine
-0.77
ransom
-0.76
prompt
-0.71
obyl
-0.70
glean
-0.70
intrinsic
-0.69
structure
-0.69
poets
-0.66
astronomers
-0.66
POSITIVE LOGITS
California
1.05
Portland
1.03
Seattle
1.02
Calif
0.96
Minnesota
0.95
Washington
0.95
Nation
0.95
Mont
0.92
nom
0.92
East
0.91
Activations Density 0.042%