INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
Cir
-0.68
Soros
-0.67
Burr
-0.65
Rai
-0.65
Riding
-0.63
Slot
-0.62
Ort
-0.60
Irwin
-0.59
Forbes
-0.59
Passive
-0.58
POSITIVE LOGITS
ysis
0.86
illin
0.86
enh
0.85
isha
0.75
eeper
0.71
peror
0.70
ibo
0.70
fax
0.70
itous
0.69
obyl
0.68
Activations Density 0.000%
No Known Activations
This feature has no known activations.