INDEX
Explanations
personal pronouns and verbs related to actions towards others
references to individuals and their relationships in a narrative context
New Auto-Interp
Negative Logits
NetMessage
-0.72
largeDownload
-0.71
heny
-0.61
Variety
-0.58
Ruff
-0.57
PLA
-0.57
mega
-0.56
Memor
-0.56
inelli
-0.55
Federation
-0.55
POSITIVE LOGITS
selves
1.16
self
0.92
atic
0.85
atically
0.83
atics
0.77
senseless
0.76
alian
0.73
unconscious
0.73
selves
0.72
asleep
0.70
Activations Density 0.216%