INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
tz
-0.80
stadt
-0.76
Gingrich
-0.76
ice
-0.74
zers
-0.74
ende
-0.71
vin
-0.68
Wasserman
-0.66
Cummings
-0.65
rir
-0.64
POSITIVE LOGITS
catentry
0.82
Surv
0.75
advertisement
0.74
Narr
0.70
POST
0.68
mere
0.68
)</
0.66
reet
0.64
Medium
0.63
rehens
0.62
Activations Density 0.000%
No Known Activations
This feature has no known activations.