INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
JD
-0.75
Fly
-0.73
retri
-0.71
HOU
-0.69
traveller
-0.69
pige
-0.67
Airbus
-0.67
LIA
-0.66
ADS
-0.66
ircraft
-0.66
POSITIVE LOGITS
uch
1.67
atism
0.75
ita
0.67
inky
0.67
punished
0.66
apon
0.63
itas
0.63
ogie
0.62
imprint
0.62
Gab
0.60
Activations Density 0.000%
No Known Activations
This feature has no known activations.