INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
amina
-0.76
arie
-0.75
uana
-0.74
uffy
-0.70
vine
-0.69
Baal
-0.68
asma
-0.68
Shant
-0.66
ataka
-0.63
reel
-0.62
POSITIVE LOGITS
GMT
0.74
integration
0.73
fragmentation
0.70
iqueness
0.70
otin
0.70
inav
0.64
IG
0.63
Integration
0.63
ohl
0.63
nov
0.62
Activations Density 0.000%
No Known Activations
This feature has no known activations.