INDEX
Explanations
instances where actions or events are suggested to happen differently than the norm
phrases or constructions that specify alternatives or comparisons
New Auto-Interp
Negative Logits
CSS
-0.72
may
-0.72
izens
-0.69
cia
-0.69
soon
-0.68
WIND
-0.67
cue
-0.67
Corn
-0.66
RAW
-0.64
Aren
-0.64
POSITIVE LOGITS
necessarily
1.16
relying
1.14
bothering
1.10
merely
1.03
simply
0.93
letting
0.93
risking
0.90
outright
0.90
allowing
0.86
focusing
0.85
Activations Density 0.060%