INDEX
Explanations
arguments or attempts to persuade others in written texts
words related to debating or making arguments
New Auto-Interp
Negative Logits
gallery
-0.68
pta
-0.65
ILCS
-0.63
cffffcc
-0.62
ascript
-0.62
anie
-0.61
hook
-0.60
Pastebin
-0.60
lio
-0.60
beam
-0.60
POSITIVE LOGITS
against
1.28
convinc
1.16
passionately
1.15
vehemently
1.11
forcefully
1.08
persu
1.07
against
1.03
stren
0.93
vigorously
0.89
Against
0.88
Activations Density 0.046%