INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
fri
-0.76
forest
-0.75
asons
-0.72
rien
-0.67
taboola
-0.66
cryst
-0.66
pins
-0.65
oug
-0.64
burden
-0.64
atars
-0.64
POSITIVE LOGITS
Buchanan
0.78
RN
0.74
Buckley
0.70
Mellon
0.69
TRUMP
0.66
Commando
0.66
Eliot
0.66
Bastard
0.64
McMaster
0.63
Dartmouth
0.62
Activations Density 0.000%
No Known Activations
This feature has no known activations.