INDEX
Explanations
HTML tags indicating links or buttons
HTML or XML tags and attributes
New Auto-Interp
Negative Logits
offensive
-0.60
boycot
-0.59
Dund
-0.59
Faust
-0.58
needed
-0.55
boycott
-0.54
earning
-0.54
temporarily
-0.54
Falk
-0.53
fellow
-0.53
POSITIVE LOGITS
">
3.74
"><
2.79
"></
2.50
"/>
2.50
'>
2.39
\">
2.38
"]
2.01
)</
1.88
>"
1.79
</
1.71
Activations Density 0.009%