INDEX
Explanations
disagreements or discussions mentioned in written content
references to social media interactions and discussions
New Auto-Interp
Negative Logits
ÂŃ
-0.65
å§
-0.65
"},"
-0.64
bribes
-0.61
fitted
-0.60
ballet
-0.60
elve
-0.59
atile
-0.58
Fif
-0.58
ichever
-0.57
POSITIVE LOGITS
commenter
1.19
mentioning
1.11
Quote
1.09
regarding
1.08
commenters
1.06
mentioned
1.06
1.05
thread
1.03
comments
1.03
referenced
1.01
Activations Density 0.616%