INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
enegger
-0.77
iago
-0.73
Enough
-0.72
acters
-0.67
aukee
-0.67
ween
-0.66
Azerb
-0.66
Kers
-0.65
atar
-0.64
eri
-0.64
POSITIVE LOGITS
isSpecialOrderable
0.72
Pilgrim
0.71
itent
0.70
instincts
0.68
bonded
0.64
ifles
0.64
natureconservancy
0.64
acious
0.63
rouse
0.61
bars
0.61
Activations Density 0.000%
No Known Activations
This feature has no known activations.