INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
iqueness
-0.72
actionDate
-0.63
approval
-0.60
Production
-0.59
Manip
-0.59
viability
-0.57
signalling
-0.56
omorph
-0.56
Judd
-0.55
disapproval
-0.55
POSITIVE LOGITS
should
1.05
should
0.94
shouldn
0.92
SHOULD
0.83
ought
0.83
book
0.79
books
0.71
book
0.71
BOOK
0.67
Book
0.67
Activations Density 0.000%
No Known Activations
This feature has no known activations.