INDEX
Explanations
names and terms related to politics and current events
names of notable individuals and organizations involved in political or societal issues
New Auto-Interp
Negative Logits
Weston
-0.69
Lomb
-0.59
ciplinary
-0.58
SHIP
-0.58
Natural
-0.57
doors
-0.55
Major
-0.53
Sutherland
-0.53
Naples
-0.53
ento
-0.52
POSITIVE LOGITS
[/
0.90
»
0.89
ãĢį
0.88
%"
0.85
''
0.84
ãĢı
0.82
,''
0.80
_.
0.79
"—
0.75
\)
0.73
Activations Density 0.931%