INDEX
Explanations
words related to legal matters and political figures in news contexts
terms related to elements and their occurrences
New Auto-Interp
Negative Logits
America
-0.68
Craigslist
-0.63
Omaha
-0.63
kills
-0.62
dirty
-0.62
offensive
-0.60
kill
-0.60
papers
-0.60
bro
-0.60
Grind
-0.59
POSITIVE LOGITS
lement
5.03
lements
2.58
lem
1.21
ling
1.09
ment
1.06
lers
1.04
LE
1.03
lez
1.01
LES
1.00
ement
0.99
Activations Density 0.014%