INDEX
Explanations
words related to medical conditions, particularly strokes
mentioned medical conditions and strokes
New Auto-Interp
Negative Logits
starter
-0.69
aco
-0.68
source
-0.65
Nexus
-0.65
mate
-0.64
anomaly
-0.63
Bomb
-0.63
nexus
-0.62
link
-0.62
Radio
-0.61
POSITIVE LOGITS
strokes
4.06
Stro
1.48
brushes
1.43
stro
1.31
tones
1.28
kisses
1.26
stitches
1.23
motions
1.22
blows
1.21
wrinkles
1.20
Activations Density 0.014%