INDEX
Explanations
expressions indicating a strong personal sentiment or opinion
expressions of strong emotional emphasis or commitment
New Auto-Interp
Negative Logits
guiActiveUnfocused
-0.72
Gothic
-0.69
icist
-0.67
senal
-0.63
aic
-0.62
ORN
-0.62
Delivery
-0.61
Gad
-0.61
YING
-0.61
srfAttach
-0.60
POSITIVE LOGITS
knew
1.18
owe
1.15
forgot
1.08
want
1.04
despise
1.03
need
1.02
ate
0.98
saw
0.98
got
0.97
intend
0.97
Activations Density 0.223%