INDEX
Explanations
sentences containing words related to controversies or arguments
New Auto-Interp
Negative Logits
igslist
-0.65
mosqu
-0.62
apiece
-0.61
misdem
-0.61
culus
-0.61
»Ĵ
-0.60
trailed
-0.58
culminated
-0.58
respectively
-0.58
ppa
-0.58
POSITIVE LOGITS
unless
0.84
except
0.80
Its
0.78
Therefore
0.78
Therefore
0.78
Its
0.77
Picture
0.69
:(
0.68
doesnt
0.67
nor
0.66
Activations Density 0.638%