INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
Bonnie
-0.70
Deborah
-0.68
Aval
-0.67
Norton
-0.64
opathy
-0.62
Salv
-0.62
McH
-0.61
Mast
-0.59
Whit
-0.59
Bernstein
-0.59
POSITIVE LOGITS
abouts
0.79
rarily
0.76
gage
0.75
paralle
0.74
atmosp
0.72
verty
0.68
̶
0.67
pse
0.67
â̦]
0.66
irements
0.65
Activations Density 0.000%
No Known Activations
This feature has no known activations.