INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
coin
-0.75
oy
-0.70
arnaev
-0.67
gym
-0.66
Coin
-0.66
meal
-0.66
enta
-0.63
eat
-0.63
trainer
-0.63
sung
-0.62
POSITIVE LOGITS
ID
0.63
icter
0.62
population
0.62
livest
0.61
Doctors
0.61
Hos
0.60
Definitive
0.60
tis
0.59
Disability
0.59
thumbnails
0.59
Activations Density 0.000%
No Known Activations
This feature has no known activations.