INDEX
Explanations
mentions of the presence or continuation of certain states or actions over time
New Auto-Interp
Negative Logits
insula
-0.72
Stim
-0.69
elimination
-0.65
iHUD
-0.61
stice
-0.60
alties
-0.59
diversion
-0.59
ongyang
-0.59
é¾
-0.57
Moody
-0.57
POSITIVE LOGITS
birth
1.14
intact
0.93
born
0.89
unanswered
0.88
undecided
0.78
reeling
0.78
alive
0.77
atile
0.74
extant
0.72
Nadu
0.72
Activations Density 0.978%