INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
ufact
-0.94
boycott
-0.81
hesda
-0.75
boycot
-0.74
divest
-0.73
osponsors
-0.70
alions
-0.69
orah
-0.69
issions
-0.66
arers
-0.66
POSITIVE LOGITS
Grey
0.71
Tex
0.71
INTON
0.69
Xan
0.68
Gall
0.67
Giles
0.64
kids
0.62
ouston
0.62
Dane
0.61
Avalon
0.61
Activations Density 0.000%
No Known Activations
This feature has no known activations.