INDEX
Explanations
phrases related to changes or transitions
phrases indicating current actions or transitions
New Auto-Interp
Negative Logits
LOCK
-0.67
Logged
-0.62
ANA
-0.61
bodily
-0.60
encyclopedia
-0.59
Gallery
-0.56
worthy
-0.55
deserve
-0.55
chy
-0.54
Mystic
-0.54
POSITIVE LOGITS
uddenly
1.02
instead
0.92
iott
0.82
suddenly
0.77
reversed
0.76
emer
0.72
today
0.71
Instead
0.67
shifted
0.67
rosso
0.66
Activations Density 0.989%