INDEX
Explanations
expressions of personal feelings and reactions
First-person statements expressing feelings
states of being or opinion
New Auto-Interp
Negative Logits
ftate
-0.65
fubject
-0.58
Chriftian
-0.57
צלחה
-0.56
neceff
-0.55
ктан
-0.55
Relo
-0.53
ſelf
-0.53
Pliocene
-0.52
saraba
-0.52
POSITIVE LOGITS
intrigued
0.78
impressed
0.78
amazed
0.66
convinced
0.63
skeptical
0.62
curious
0.60
thrilled
0.59
impress
0.58
perplexed
0.58
puzzled
0.57
Activations Density 0.214%