INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
antid
-0.70
fug
-0.70
+++
-0.64
Bundy
-0.63
bargaining
-0.62
etheless
-0.61
absentee
-0.60
unions
-0.60
VERTISEMENT
-0.59
discretion
-0.59
POSITIVE LOGITS
eki
0.79
osit
0.71
ico
0.70
cycl
0.69
psc
0.68
aye
0.67
onen
0.67
bsite
0.67
osite
0.66
zie
0.66
Activations Density 0.000%
No Known Activations
This feature has no known activations.