INDEX
Explanations
personal pronouns and verbs indicating personal actions or interactions
terms related to various digital media formats and contexts
New Auto-Interp
Negative Logits
favor
-0.90
accrued
-0.82
eleph
-0.80
lim
-0.78
conflicted
-0.78
charter
-0.78
cher
-0.77
scaling
-0.77
ow
-0.76
consolidation
-0.76
POSITIVE LOGITS
T
1.37
B
1.36
P
1.34
Mal
1.32
N
1.30
H
1.30
Bl
1.29
L
1.29
C
1.29
World
1.29
Activations Density 0.344%