INDEX
Explanations
phrases expressing opinions or beliefs
clauses or phrases that indicate belief, doubt, or opinion statements
New Auto-Interp
Negative Logits
ãĥĦ
-0.73
ĸļ士
-0.65
flag
-0.64
ActionCode
-0.64
EStream
-0.64
Said
-0.64
efe
-0.64
ocative
-0.64
tained
-0.63
Redditor
-0.63
POSITIVE LOGITS
nowadays
1.35
nobody
1.14
there
1.10
they
1.09
alot
1.06
lately
1.04
although
1.03
everybody
1.02
whenever
1.02
whereas
1.02
Activations Density 0.447%