INDEX
Explanations
proper nouns and names of people or entities
names of individuals involved in legal or political contexts
New Auto-Interp
Negative Logits
eatures
-0.78
cellaneous
-0.68
</
-0.64
eaturing
-0.63
Adds
-0.62
rawdownloadcloneembedreportprint
-0.61
ibilities
-0.61
abound
-0.58
umph
-0.58
Characters
-0.58
POSITIVE LOGITS
violated
1.48
deserved
1.40
misled
1.35
interfered
1.29
lacked
1.27
acted
1.25
misunderstood
1.24
discriminated
1.24
lied
1.23
stole
1.21
Activations Density 0.355%