INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
senal
-0.95
esson
-0.81
Downloadha
-0.78
ichick
-0.76
icut
-0.73
auntlets
-0.70
govtrack
-0.70
olicy
-0.70
itone
-0.69
archment
-0.69
POSITIVE LOGITS
veget
0.67
eaten
0.64
starving
0.64
population
0.62
hungry
0.61
Animal
0.60
grave
0.60
remaining
0.59
hum
0.59
food
0.58
Activations Density 0.000%
No Known Activations
This feature has no known activations.