INDEX
Explanations
phrases related to the potential occurrence or non-occurrence of events
expressions related to events occurring or not occurring
New Auto-Interp
Negative Logits
itton
-0.68
osi
-0.65
onomy
-0.64
ross
-0.62
usra
-0.62
ription
-0.62
ortment
-0.61
brace
-0.60
curious
-0.57
irin
-0.57
POSITIVE LOGITS
anymore
1.64
unless
1.09
nor
1.06
anywhere
1.00
anytime
0.96
whatsoever
0.95
necessarily
0.94
yet
0.90
until
0.87
unless
0.87
Activations Density 0.234%