INDEX
Explanations
words related to feelings of concern or unease
phrases and words that express concern or anxiety
New Auto-Interp
Negative Logits
ewitness
-0.84
guided
-0.72
inters
-0.71
ingers
-0.68
inka
-0.68
atha
-0.65
adr
-0.65
chnology
-0.64
avour
-0.64
dating
-0.64
POSITIVE LOGITS
warts
0.98
ingly
0.94
wart
0.91
about
0.89
ABOUT
0.89
lessly
0.84
crow
0.78
worry
0.78
aloud
0.77
der
0.76
Activations Density 0.025%