INDEX
Explanations
instances where the word "sit" is mentioned
instances of the word "sit."
New Auto-Interp
Negative Logits
iler
-0.76
ctr
-0.67
Adds
-0.66
raid
-0.62
iction
-0.59
Scarlet
-0.59
operation
-0.59
ilers
-0.59
---------------
-0.58
ologically
-0.58
POSITIVE LOGITS
lie
0.90
seiz
0.89
uate
0.85
atop
0.85
comfortably
0.84
aution
0.84
anic
0.83
chel
0.81
ducks
0.79
ivas
0.79
Activations Density 0.029%