INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
ABE
-0.79
defe
-0.75
Ens
-0.69
Liberties
-0.67
Protect
-0.67
²¾
-0.65
Ples
-0.65
Flag
-0.62
yip
-0.62
Bye
-0.62
POSITIVE LOGITS
ums
0.93
uum
0.90
usha
0.77
hea
0.76
heit
0.76
zes
0.75
ractor
0.73
rolled
0.70
packs
0.70
IUM
0.69
Activations Density 0.000%
No Known Activations
This feature has no known activations.