INDEX
Explanations
pronouns referring to people
references to individuals or groups in the context of struggles or issues
New Auto-Interp
Negative Logits
Eleven
-0.70
CCC
-0.68
Rating
-0.59
Offline
-0.58
Binding
-0.58
Witt
-0.58
Links
-0.58
stellar
-0.58
Sequ
-0.58
Dome
-0.57
POSITIVE LOGITS
're
1.70
've
1.33
are
1.24
themselves
1.21
'll
1.21
deserve
1.12
aren
1.11
were
1.08
perceive
1.07
'd
1.06
Activations Density 0.240%