INDEX
Explanations
conjunctions 'and' with a high activation value, indicating a high importance placed on relationships between ideas or concepts
New Auto-Interp
Negative Logits
ãĥ¢
-0.76
uria
-0.70
park
-0.70
file
-0.63
HQ
-0.62
leneck
-0.61
Locked
-0.61
wake
-0.59
olitan
-0.59
rison
-0.59
POSITIVE LOGITS
etc
1.30
respectively
1.01
etc
0.98
whatever
0.96
blah
0.93
assorted
0.88
whatever
0.84
thence
0.83
oh
0.81
consequently
0.80
Activations Density 0.200%