INDEX
Explanations
mentions of the word "Con" followed by a high number
instances of the prefix "Con" suggesting it looks for words related to connections or conversations
New Auto-Interp
Negative Logits
OHN
-0.66
TPS
-0.65
wip
-0.64
Rover
-0.64
Wem
-0.63
practicable
-0.62
terday
-0.61
hydra
-0.60
aceutical
-0.60
Uzbek
-0.58
POSITIVE LOGITS
stant
1.26
ventions
1.15
cept
1.14
ference
1.12
secut
1.08
clusions
1.08
verting
1.06
cerning
1.04
verted
1.04
joined
1.04
Activations Density 0.014%