INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
eton
-0.82
odox
-0.76
isks
-0.75
isky
-0.75
SHIP
-0.72
acular
-0.72
IUM
-0.71
arium
-0.71
unn
-0.69
inav
-0.68
POSITIVE LOGITS
folds
0.62
collagen
0.59
gelatin
0.59
drawer
0.58
metab
0.58
millisec
0.58
genic
0.57
fused
0.57
vein
0.55
atel
0.55
Activations Density 0.000%
No Known Activations
This feature has no known activations.