INDEX
Explanations
details related to legal or regulatory issues as well as specific naming instances
New Auto-Interp
Negative Logits
grown
-0.63
-0.62
Russ
-0.62
isSpecialOrderable
-0.57
animate
-0.56
©
-0.56
Spread
-0.56
Availability
-0.55
rehearsal
-0.55
sear
-0.55
POSITIVE LOGITS
told
0.97
tells
0.95
urged
0.95
wrote
0.94
commented
0.90
testified
0.89
says
0.87
told
0.86
explains
0.86
warns
0.85
Activations Density 0.111%