INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
seed
-0.72
owski
-0.71
inspected
-0.68
oulos
-0.67
resist
-0.67
crush
-0.64
amia
-0.63
ulous
-0.62
icum
-0.61
oka
-0.61
POSITIVE LOGITS
GOODMAN
0.69
showc
0.67
addons
0.67
glim
0.66
EGIN
0.65
midfield
0.64
features
0.63
lineback
0.61
ilities
0.61
IRE
0.61
Activations Density 0.000%
No Known Activations
This feature has no known activations.