INDEX
Explanations
phrases related to changes or transitions
phrases indicating a change or transition in status or condition
New Auto-Interp
Negative Logits
iosyn
-0.81
QUI
-0.78
jong
-0.77
cise
-0.76
resy
-0.70
vic
-0.67
DAQ
-0.66
ourage
-0.66
etimes
-0.64
Enlarge
-0.64
POSITIVE LOGITS
obscurity
0.90
hating
0.80
afar
0.73
humble
0.71
mildly
0.66
whence
0.66
dormant
0.65
laughing
0.65
novice
0.64
seed
0.64
Activations Density 0.059%