INDEX
Explanations
phrases related to taking actions or stances
references to confrontation or disagreement regarding actions or events
New Auto-Interp
Negative Logits
ambo
-0.72
ovi
-0.69
Waste
-0.67
ailability
-0.67
Ü
-0.66
apo
-0.65
rived
-0.64
ells
-0.62
etheless
-0.62
uga
-0.61
POSITIVE LOGITS
largeDownload
0.92
çİĭ
0.82
seriously
0.76
lightly
0.69
mans
0.65
stance
0.64
WATCHED
0.64
Fax
0.63
emoji
0.63
rogens
0.62
Activations Density 0.108%