INDEX
Explanations
words related to changing or transitioning
phrases related to changing states or transitions
New Auto-Interp
Negative Logits
EMENT
-0.72
INO
-0.68
EST
-0.66
LIB
-0.63
MU
-0.60
Spoiler
-0.59
UAL
-0.59
Brave
-0.59
fullest
-0.58
POST
-0.57
POSITIVE LOGITS
imester
0.85
ipolar
0.84
ricular
0.78
izoph
0.77
apters
0.74
rients
0.74
levels
0.74
acco
0.72
rencies
0.71
drive
0.70
Activations Density 0.112%