INDEX
Explanations
words related to contradiction or opposing views
occurrences of the word "Cont" and its variations, indicating a focus on contrasting statements or narratives
New Auto-Interp
Negative Logits
EStream
-1.12
ħĭ
-0.85
STER
-0.81
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.80
senal
-0.76
Downloadha
-0.75
OPLE
-0.74
Reviewer
-0.72
terday
-0.72
steen
-0.72
POSITIVE LOGITS
ributed
1.21
ribution
1.20
ainment
1.16
ribut
1.09
rast
1.07
ribute
1.05
roversial
1.02
ainers
0.97
rollers
0.94
rad
0.93
Activations Density 0.010%