INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
craw
-0.15
aub
-0.15
banks
-0.15
wand
-0.14
seafood
-0.14
odom
-0.14
Elder
-0.13
cdr
-0.13
dumpster
-0.13
kred
-0.13
POSITIVE LOGITS
horse
0.54
horses
0.52
Horse
0.49
horse
0.46
riders
0.39
rider
0.38
pony
0.36
riding
0.35
polo
0.35
Riders
0.34
Activations Density 0.000%
No Known Activations
This feature has no known activations.