INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
Moo
-0.15
abox
-0.14
Regardless
-0.14
eselect
-0.14
å¯
-0.14
marque
-0.14
akedirs
-0.13
Montserrat
-0.13
unlikely
-0.13
malar
-0.13
POSITIVE LOGITS
Sharon
0.21
(“
0.20
occupation
0.19
Occupation
0.18
Israeli
0.18
ynet
0.18
occupation
0.18
IDF
0.17
pione
0.17
occup
0.17
Activations Density 0.000%
No Known Activations
This feature has no known activations.