INDEX
Explanations
proper nouns, specifically names of locations, organizations, and individuals
references to specific political parties or movements
New Auto-Interp
Negative Logits
Wall
-0.67
ãĤ¯
-0.67
SHARE
-0.66
Bear
-0.66
strong
-0.66
Strength
-0.66
protected
-0.66
Lieberman
-0.65
Wide
-0.65
options
-0.62
POSITIVE LOGITS
eers
0.87
entimes
0.78
bourg
0.76
charm
0.76
ernel
0.69
)]
0.68
aspir
0.67
faculties
0.66
ciating
0.65
olitics
0.64
Activations Density 0.090%