INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
ropri
-0.67
person
-0.67
ela
-0.67
woman
-0.65
gal
-0.62
Sounds
-0.62
ZI
-0.62
wife
-0.61
Si
-0.61
cow
-0.61
POSITIVE LOGITS
unker
0.71
ģĸ
0.69
Thunderbolt
0.67
acers
0.67
Seraph
0.66
leep
0.66
Archangel
0.66
phrine
0.65
cipled
0.65
ĸļ
0.65
Activations Density 0.000%
No Known Activations
This feature has no known activations.