INDEX
Explanations
past tense verbs indicating actions affecting various situations or individuals
New Auto-Interp
Negative Logits
urai
-0.62
enos
-0.57
CoC
-0.56
Cliff
-0.56
nature
-0.55
DRAGON
-0.55
Poké
-0.53
Bees
-0.53
Eth
-0.53
WH
-0.53
POSITIVE LOGITS
by
1.15
aback
0.93
bys
0.84
by
0.84
BY
0.84
ĸļ
0.83
By
0.75
By
0.72
linger
0.72
pez
0.71
Activations Density 0.278%